Commit Graph

909 Commits

Author SHA1 Message Date
ines c815ff65f6 Update feature list 2017-10-24 21:49:11 +02:00
ines d71702b827 Fix formatting 2017-10-24 20:11:04 +02:00
ines 6686e53530 Allow GitHub embeds to specify optional language 2017-10-24 16:00:56 +02:00
ines 56a47f137f Add title description for tokenizer 2017-10-24 16:00:56 +02:00
ines 3944c1d6e7 Document lemmatizer 2017-10-24 16:00:56 +02:00
ines c9dc88ddfc Document current JSON format for training 2017-10-24 16:00:56 +02:00
Matthew Honnibal ef3e5a361b Merge pull request #1442 from explosion/feature/fix-sp
💫Fix SP tag, tweak Vectors.__init__, fix Morphology
2017-10-24 10:24:07 +02:00
Matthew Honnibal fdf25d10ba Merge pull request #1440 from ramananbalakrishnan/develop
Support single value for attribute list in doc.to_array
2017-10-24 10:23:12 +02:00
ines 7701984f13 Document Span.as_doc 2017-10-23 10:38:27 +02:00
ines db15902e84 Tidy up 2017-10-23 10:38:21 +02:00
ines 3f0a157b33 Fix typo 2017-10-23 10:38:13 +02:00
Matthew Honnibal ebecaddb76 Make 'data_or_width' two keyword args in Vectors.__init__
Previously the data and width options were one argument in Vectors,
which meant you couldn't say vectors = Vectors(strings, width=300).
It's better to have two keywords.
2017-10-20 14:17:15 +02:00
ines 108f1f786e Update symbols and document missing token attributes (see #1439) 2017-10-20 13:08:44 +02:00
ines 4acab77a8a Add missing symbol for LAW entities (resolves #1427) 2017-10-20 13:07:57 +02:00
Ramanan Balakrishnan d44a079fe3
Update documentation on doc.to_array 2017-10-20 14:25:38 +05:30
Matthew Honnibal 61bc203f3f Merge pull request #1438 from explosion/feature/fast-parser
💫 Improve runtime CPU efficiency of parser/NER
2017-10-19 02:42:21 +02:00
Matthew Honnibal d4cfff0476 Comment out currently hard-coded hyper-params 2017-10-19 00:47:24 +02:00
Ines Montani f0d577e460 Merge pull request #1425 from explosion/feature/hindi-tokenizer
💫 Basic Hindi tokenization support
2017-10-18 13:34:52 +02:00
ines a74cba2ffa Remove Binder from docs (now covered by Doc API) 2017-10-17 16:27:19 +02:00
ines 8ca344712d Add Language.has_pipe method 2017-10-17 11:20:07 +02:00
ines 4cfe259266 Fix formatting 2017-10-16 20:36:41 +02:00
ines 18793efef1 Remove Russian from v2.0 docs for now 2017-10-16 20:36:36 +02:00
ines d383612225 Add note about word vectors in example (see #1117) 2017-10-16 20:31:58 +02:00
Matthew Honnibal 010a7309ff Merge pull request #1402 from explosion/feature/fix-matcher-operators
💫 Fix Matcher variable-length operators
2017-10-16 17:53:19 +02:00
ines 63393b4e0d Update matcher docs to reflect operator changes 2017-10-16 13:44:12 +02:00
ines 15514dc333 Add section on upgrading 2017-10-14 22:14:47 +02:00
ines c0aceb9fbe Add Hindi to supported languages 2017-10-14 15:16:41 +02:00
ines a5da683578 Add Russian to alpha docs and update tokenizer dependencies 2017-10-14 12:52:41 +02:00
ines a69f4e56e5 Remove outdated aside 2017-10-14 12:52:07 +02:00
ines bb6ecb82e5 Ensure long file paths in code examples break if needed 2017-10-14 12:51:52 +02:00
ines bfd9506f1d Update extensions docs and add resources 2017-10-13 00:18:13 +02:00
ines 5f5d6897e8 Increment version 2017-10-13 00:18:02 +02:00
ines 9fd68334ab Add validate command docs 2017-10-12 23:36:48 +02:00
Ines Montani 37aa523a8e Merge pull request #1408 from explosion/feature/dot-underscore
💫 Custom attributes via Doc._, Token._ and Span._
2017-10-11 18:35:56 +02:00
ines eac9e99086 Update docs on adding lemmatization to languages 2017-10-11 14:21:15 +02:00
ines f4ae6763b9 Fix consistency of imports from spacy.tokens in examples 2017-10-11 02:30:40 +02:00
ines 19598ebfee Update migration guide 2017-10-10 06:38:11 +02:00
ines 9c96a6e131 Update pipelines section in v2 overview 2017-10-10 06:33:53 +02:00
Matthew Honnibal 09d61ada5e Merge pull request #1396 from explosion/feature/pipeline-management
💫 Improve pipeline and factory management
2017-10-10 04:29:54 +02:00
ines 6679117000 Add pipeline component examples 2017-10-10 04:26:06 +02:00
ines 7a592d01dc Update pipeline component usage docs 2017-10-10 04:24:39 +02:00
ines 3d5154811a Fix typo 2017-10-10 04:24:22 +02:00
ines 43b70651fb Document extension methods on Doc, Token and Span
set_extension, get_extension, has_extension
2017-10-10 04:23:37 +02:00
ines b4fc6b203c Rename mixin 2017-10-10 04:22:23 +02:00
ines de374dc72a Merge branch 'feature/pipeline-management' into feature/dot-underscore 2017-10-09 14:37:51 +02:00
ines 6c253db3fe Add section for developing spaCy extensions 2017-10-09 14:36:56 +02:00
ines 6550d0547c Fix typo 2017-10-09 14:36:36 +02:00
ines 4d248ea920 Fix spacing on bulleted lists 2017-10-09 14:36:30 +02:00
ines 2ac8b5c622 Add wrapper for before/after code examples 2017-10-09 14:36:20 +02:00
ines ca6769fd48 Update spacy functions and remove removed set_factory 2017-10-07 15:28:01 +02:00
ines 743d1df1fe Update pipelines docs and add user hooks to custom components 2017-10-07 15:27:28 +02:00
Matthew Honnibal eb0595bea9 Merge pull request #1392 from explosion/feature/parser-history-model
💫 Parser history features
2017-10-07 15:07:02 +02:00
ines d70cf19158 Fix formatting 2017-10-07 15:06:38 +02:00
ines c970b4f226 Add missing token attribute 2017-10-07 15:04:16 +02:00
ines 37f755897f Update rule-based matching docs 2017-10-07 15:04:09 +02:00
Matthew Honnibal e22067e3b5 Document new hyper-parameters 2017-10-07 07:10:10 -05:00
ines feaf353051 Update processing pipelines usage docs 2017-10-07 14:05:59 +02:00
ines 58dfde7c02 Remove redundante deprecation note 2017-10-07 04:54:57 +02:00
ines ed8e0085b0 Update docs for spacy.load() 2017-10-07 03:06:55 +02:00
ines e370332fb1 Update Language API docs 2017-10-07 03:00:20 +02:00
ines 3468d535ad Update model benchmarks 2017-10-06 21:39:06 +02:00
ines 96a4e79d13 Fix PhraseMatcher example 2017-10-06 18:22:10 +02:00
ines bb13aa4bf3 Fix typos in PhraseMatcher docs 2017-10-04 16:12:09 +02:00
ines 33cf9cecdd Port over changes from #1386 2017-10-04 13:34:03 +02:00
ines 36ff525ff5 Add NER P and NER R scores to model overview 2017-10-04 00:37:15 +02:00
ines 15ec7ddd09 Add docs for new spacy evaluate command 2017-10-04 00:19:03 +02:00
ines 464f14019d Fix typos 2017-10-04 00:18:47 +02:00
ines bfb512f45a Add website package.json and fix gitignore 2017-10-04 00:18:41 +02:00
ines 80a2fb6193 Update visualizers docs and add submenu 2017-10-03 19:40:39 +02:00
ines 5fb057b575 Fix secondary font stack 2017-10-03 15:45:07 +02:00
ines b24fbd8aad Fix titles for social cards 2017-10-03 14:54:33 +02:00
ines 23019d1daa Add styleguide 2017-10-03 14:28:24 +02:00
ines 319fac14fe Update global config and landing page 2017-10-03 14:28:18 +02:00
ines 22dd929b65 Add models documentation 2017-10-03 14:28:03 +02:00
ines 808f7ee417 Update API documentation 2017-10-03 14:27:22 +02:00
ines 3f4fd2c5d5 Update usage documentation 2017-10-03 14:26:20 +02:00
ines 9af604f0da Update layout templates, partials and mixins 2017-10-03 14:20:13 +02:00
ines 49b58d35fd Update JavaScript 2017-10-03 14:18:49 +02:00
ines a8ff8423bb Update image assets, icons and SVGs
Move SVG sprite to Jade file and include in template. Only use SVG
symbols for logos.
2017-10-03 14:17:41 +02:00
ines 7d01d7411b Update web fonts 2017-10-03 14:15:36 +02:00
ines 3e1b971b16 Update CSS 2017-10-03 14:14:52 +02:00
Reza Gharibi 0461b82158 Fix typos 2017-09-27 03:56:20 +03:30
Reza Gharibi fa1844b132 Fix typo 2017-09-27 03:55:54 +03:30
Reza Gharibi b5dd7e7cc4 Fix typo 2017-09-27 03:55:28 +03:30
Ines Montani b8e81daccf Fix typo (closes #1312) 2017-09-14 12:49:59 +02:00
ines d15775c3ad Fix typos and commands in alpha docs 2017-08-21 13:40:11 +02:00
ines 3c33003078 Port over typo corrections from #1245 2017-08-20 12:00:17 +02:00
ines 1261b01e46 Update Doc.char_span docs 2017-08-19 16:34:32 +02:00
ines 5cb0200e63 Document new Span.to_array() method 2017-08-19 12:45:28 +02:00
ines 471eed4126 Add example to Span.merge() 2017-08-19 12:45:16 +02:00
ines 404d3067b8 Document new Doc.char_span() method 2017-08-19 12:45:00 +02:00
ines d53cbf369f Document as_tuples kwarg on Language.pipe() 2017-08-19 12:44:50 +02:00
ines 6a37c93311 Update argument type 2017-08-19 12:44:33 +02:00
ines 4731d50220 Add break utility for long nowrap items (e.g. code) 2017-08-19 12:44:23 +02:00
ines 0aba11b64b Update package command docs 2017-08-14 16:45:44 +02:00
ines 52c6302223 Allow prompt setting on code mixin 2017-08-14 13:05:01 +02:00
ines a29f132ffd Change python -m spacy to spacy
Reflects latest change to entry point or auto-alias
2017-08-14 13:04:48 +02:00
Nikolai Kruglikov 08e443e083 Fix small typo in documentation 2017-08-14 12:19:04 +02:00
ines ab8ffbaab7 Add text classification to v2 overview 2017-07-22 17:56:51 +02:00
ines f085b88f9d Add TextCategorizer API docs stub 2017-07-22 17:56:33 +02:00
ines ab1a4e8b3c Add Tensorizer API docs stub 2017-07-22 17:56:25 +02:00
ines 0fb89dd204 Add text classification usage guide template 2017-07-22 17:56:07 +02:00
ines d05ab1b3a0 Add text classification to 101 overview and change order 2017-07-22 17:55:53 +02:00
ines d2a7e5b8e5 Add GoldParse.cats attribute 2017-07-22 17:55:35 +02:00
ines 23d976ed00 Add Doc.cats attribute and missing v2 tag 2017-07-22 17:55:14 +02:00
Ines Montani 1ddbeddca2 Fix typo 2017-07-22 15:00:58 +02:00
Jarle Mathiesen f20533ec0c fix small typo 2017-06-24 12:31:33 +02:00
Savva Kolbachev 800a8faff4 Changed the capital of Lithuania to Vilnius
Hi,
There is a typo about the capital of Lithuania.

Vilnius is the capital of Lithuania https://en.wikipedia.org/wiki/Vilnius
Ljubljana is the capital of Slovenia https://en.wikipedia.org/wiki/Ljubljana
2017-06-12 23:27:00 +03:00
Ines Montani 57f64b9e1c Merge pull request #1124 from v3t3a/patch-3
docs - Fix url error for Displacy Ent visualizer
2017-06-12 21:20:32 +02:00
Ines Montani b2a28028cf Merge pull request #1115 from v3t3a/patch-2
docs - Add read() method when opening file (Lightning tour)
2017-06-12 21:19:25 +02:00
Ines Montani fe8d136ae0 Merge pull request #1114 from v3t3a/patch-1
docs - Update doc.jade (Just remove a duplicate 'doc =')
2017-06-12 21:19:02 +02:00
Vetea eae1f7b19c Fix url error for Displacy Ent visualizer 2017-06-12 14:30:02 +02:00
ines 49026a1346 Fix typos in example (see #1105) 2017-06-08 19:15:50 +02:00
Vetea cc3aee1189 Add read() method when opening file
Add read() method for 

to avoid :
```TypeError: Argument 'string' has incorrect type (expected str, got _io.TextIOWrapper)```

Test with:
spaCy : v2.0.0 Alpha
python : 3.5.2+ (default, Sep 22 2016, 12:18:14)
2017-06-08 11:27:09 +02:00
Vetea 8e20cf6368 Update doc.jade
Just remove a duplicate 'doc ='
2017-06-08 10:35:58 +02:00
ines 6b799bac54 Fix formatting and details 2017-06-06 14:37:49 +02:00
ines 6c34b1a65b Update alpha thread link 2017-06-06 00:58:12 +02:00
ines c921ba109a Fix robots and meta 2017-06-05 20:07:52 +02:00
ines fd9ae0f0e0 Update v2 comparison table 2017-06-05 16:39:11 +02:00
ines a3f9745a14 Update similarity usage guide and examples 2017-06-05 15:37:33 +02:00
ines fd35d910b8 Update v2 docs and benchmarks 2017-06-05 14:13:38 +02:00
ines 9f55c0d4f6 Add Vectors class 2017-06-05 13:33:11 +02:00
ines 040553ca59 Update architecture and features table 2017-06-05 13:33:01 +02:00
ines e204788c30 Add docs for util.load_model_from_path 2017-06-05 13:18:22 +02:00
ines efc37ea3de Update train CLI 2017-06-04 23:45:14 +02:00
ines 505d43b832 Update norms example 2017-06-04 23:33:26 +02:00
ines f8e93b6d0a Update norms example 2017-06-04 23:24:29 +02:00
ines a857b2b511 Update norms example 2017-06-04 23:21:37 +02:00
ines 47d066b293 Add under construction 2017-06-04 23:17:54 +02:00
ines e9816daa6a Add details on syntax iterators 2017-06-04 23:16:33 +02:00
ines 6438428ce8 Update v2 infobox 2017-06-04 22:09:33 +02:00
ines 990cb81556 Add info on syntax iterators 2017-06-04 21:47:22 +02:00
ines e4eb33daf7 Add links to production use guide 2017-06-04 20:56:58 +02:00
ines 63cd539d04 Add more details on model packages and requirements.txt (see #1099) 2017-06-04 20:52:10 +02:00
ines 97ff83d163 Fix docs on model loading 2017-06-04 20:44:59 +02:00
ines b6002db797 Add v2 label 2017-06-04 18:53:03 +02:00
ines e76baccd51 Add alpha social image 2017-06-04 18:43:14 +02:00
ines 468ff1a7dd Update v2 docs and add benchmarks stub 2017-06-04 15:34:28 +02:00
Matthew Honnibal 23fd6b1782 Add intro narrative for v2 2017-06-04 15:10:37 +02:00
ines 3419ecbfdd Update docs on model shortcut links 2017-06-04 13:55:00 +02:00
ines 586e901143 Add v2 intro stub 2017-06-04 13:42:37 +02:00
ines 4f8f62d9b3 Merge branch 'v2-docs-edits' into develop 2017-06-04 13:40:58 +02:00
ines 809903dcad Fix link and update wording 2017-06-04 13:29:20 +02:00
ines 22dd18c364 Remove redundant CPU commands 2017-06-04 13:29:13 +02:00
ines 1d6377218a Update architecture blurb and move other info 2017-06-04 13:28:58 +02:00
ines eb66625c69 Also add disallow robots.txt for alpha mode 2017-06-04 13:14:32 +02:00
ines 7a66c9f039 Fix formatting 2017-06-04 13:14:00 +02:00
Matthew Honnibal f2c4a9f690 Edits to spacy-101 page 2017-06-04 13:10:27 +02:00
Matthew Honnibal aca53b95e1 Link architecture blurb 2017-06-04 13:10:06 +02:00
Matthew Honnibal 64ca5123bb Add Architecture 101 blurb 2017-06-04 13:09:19 +02:00
Matthew Honnibal e77ed953f4 Update GPU instructions 2017-06-04 12:03:22 +02:00
ines 1d3b012e56 Update adding languages docs and add 101 2017-06-03 23:54:23 +02:00
ines a3715a81d5 Update adding languages guide 2017-06-03 22:16:38 +02:00
ines ec6d2bc81d Add table of contents mixin 2017-06-03 22:16:26 +02:00
ines 9acf8686f7 Update note on compact mode issues 2017-06-03 13:31:16 +02:00
ines b0225183c2 Update displaCy defaults 2017-06-03 13:27:06 +02:00
ines c60431357d Port over docs typo corrections 2017-06-03 11:31:30 +02:00
ines 9064fbbf1e Fix empty arguments in mixins 2017-06-01 18:57:02 +02:00
ines 8bee34126d Update model size 2017-06-01 18:22:35 +02:00
ines 6c908700c4 Add alpha badge 2017-06-01 18:20:33 +02:00
ines c6dc2fafc0 Add Spanish and move example sentences to meta 2017-06-01 17:49:56 +02:00
ines 1bebc6392c Add source files to pipeline components 2017-06-01 17:38:06 +02:00
ines b577ed79ee Move social image logic out to function and move files 2017-06-01 14:27:44 +02:00
ines 8fc52878f7 Make graphic smaller 2017-06-01 13:03:54 +02:00
ines 5e60b09dcd Fix custom tokenizer example 2017-06-01 13:02:50 +02:00
ines 706cec6d58 Move annotation specs up 2017-06-01 13:02:43 +02:00
ines fd77917c5a Remove bottom padding from sidebar 2017-06-01 13:02:36 +02:00
ines 8274dffad6 Update NER training draft 2017-06-01 12:51:36 +02:00
ines 04fac3f52a Add NER training example code 2017-06-01 12:47:47 +02:00
ines 7f5e7e7320 Fix typo 2017-06-01 12:47:36 +02:00
ines 5cef1dd305 Always use develop branch of GitHub links in ALPHA mode 2017-06-01 12:47:30 +02:00
ines 4a927154d8 Update v2 docs 2017-06-01 11:56:32 +02:00
ines 03bbb96db8 Remove outdated examples 2017-06-01 11:56:02 +02:00
ines 789e69b73f Update training guide 2017-06-01 11:53:23 +02:00
ines 2f40d6e7e7 Add training 101 2017-06-01 11:53:16 +02:00
ines abed463bbb Update serialization 101 2017-06-01 11:52:58 +02:00
ines 72380c952a Update training section in NER guide and add links 2017-06-01 11:52:49 +02:00
ines 77dca25c7f Update Language API docs 2017-06-01 11:51:31 +02:00
ines 9c975c4882 Add training illustrations 2017-06-01 11:51:22 +02:00
ines bea6e6bfad Allow annotation row to take children 2017-06-01 11:51:14 +02:00
ines 22b1f72870 Add spaCy 101 intro 2017-05-31 12:44:09 +02:00
ines a18b95ca12 Update docs on testing 2017-05-31 12:43:40 +02:00
ines baa6070548 Fix ID of quickstart group to avoid conflicts 2017-05-31 12:43:30 +02:00
ines 981196c181 Fix typo 2017-05-31 11:34:31 +02:00
ines f86289566a Update new in v2 section and add note on Matcher acceptors 2017-05-30 13:53:06 +02:00
ines ce4e45d0bb Update 101 intro 2017-05-29 22:15:06 +02:00
ines b5bfab8699 Add description 2017-05-29 15:27:16 +02:00
ines 687ed28340 Update processing pipelines guide 2017-05-29 14:21:00 +02:00
ines d5992f408f Update note on vocab consistency 2017-05-29 14:14:26 +02:00
ines 567485a818 Fix and document model loading with pipeline and overrides 2017-05-29 14:10:10 +02:00
ines a2134951f2 Update 101 and add note on pipeline order and tensors 2017-05-29 11:45:32 +02:00
ines 17b635eaab Update alpha docs note and fix typo 2017-05-29 11:09:24 +02:00
ines fbe105f1eb Add note on L in long integers in Python 2 2017-05-29 11:05:05 +02:00
ines 9d74810f6f Update examples 2017-05-29 01:09:52 +02:00
ines 42cf414138 Update Matcher example 2017-05-29 01:09:52 +02:00
ines 00b2094dc3 Fix typos, long integers and tests 2017-05-29 01:09:52 +02:00
ines 18b8050b07 Revert "Update syntax highlighting regex for long integers"
This reverts commit 11f2e80c6a.
2017-05-29 01:09:52 +02:00
ines d71c6db76e Add missing Chainer install for GPU if building spaCy from source 2017-05-28 23:34:59 +02:00
ines e0f9ccdaa3 Update texts and rename vectorizer to tensorizer 2017-05-28 23:26:13 +02:00
ines 606879b217 Update hash strings examples 2017-05-28 19:42:44 +02:00