Commit Graph

3674 Commits

Author SHA1 Message Date
Matthew Honnibal 6652f2a135 Test #656, #624: special case rules for tokenizer with attributes. 2016-11-25 12:44:13 +01:00
Matthew Honnibal 1e0f566d95 Fix #656, #624: Support arbitrary token attributes when adding special-case rules. 2016-11-25 12:43:24 +01:00
Matthew Honnibal 87613edf8f Add set_struct_attr staticmethod to token 2016-11-25 12:41:47 +01:00
Matthew Honnibal fb69aa648f Merge branch 'master' of ssh://github.com/explosion/spaCy 2016-11-25 11:35:44 +01:00
Matthew Honnibal 9a03a3f85e Add get_struct_attr staticmethod to Token, to match Lexeme.get_struct_attr. 2016-11-25 11:35:17 +01:00
Matthew Honnibal 53d8ca8f51 Add spacy.attrs.intify_attrs function, to normalize strings in token attribute dictionaries. 2016-11-25 11:34:30 +01:00
Ines Montani e0c7a22f09 Add usage workflow for entity recognizer 2016-11-25 02:30:31 +01:00
Ines Montani c8e69b98cc Update tutorial tags 2016-11-25 02:30:31 +01:00
Ines Montani bf65d070ef Add CodePen embed mixin 2016-11-25 02:30:31 +01:00
Ines Montani 3092efdbeb Update CONTRIBUTORS.md 2016-11-24 22:08:36 +01:00
Ines Montani 6f7835bb70 Add tutorial 2016-11-24 19:25:21 +01:00
Ines Montani 427e942e84 Ignore temporary files 2016-11-24 19:21:27 +01:00
Matthew Honnibal 1f247959f3 Merge pull request #658 from pokey/master
Add noun_chunks to Span
2016-11-24 23:33:57 +11:00
Matthew Honnibal b8c4f5ea76 Allow German noun chunks to work on Span
Update the German noun chunks iterator, so that it also works on Span objects.
2016-11-24 23:30:15 +11:00
Pokey Rule 3e3bda142d Add noun_chunks to Span 2016-11-24 10:47:20 +00:00
Ines Montani a98da29232 Update CONTRIBUTORS.md 2016-11-24 11:37:08 +01:00
Matthew Honnibal 09f68bc641 Fix Issue #639: stop words in language class not used. This patch is messy, but it's better not to change too much until the language data loading can be properly refactored. 2016-11-24 00:13:55 +01:00
Matthew Honnibal 48e1dc29d4 Fix default path loading. 2016-11-23 23:48:55 +01:00
Matthew Honnibal e01c1875ee Work on test for #615 2016-11-23 23:48:41 +01:00
Matthew Honnibal 1b77932ba5 Merge pull request #654 from ExplodingCabbage/patch-1
Fix syntax mistake
2016-11-24 09:31:36 +11:00
ExplodingCabbage 6c4f488e89 Fix syntax mistake 2016-11-23 15:12:45 +00:00
Matthew Honnibal 60eb2343ce Only try to load vectors if they exist. 2016-11-23 13:50:24 +01:00
Matthew Honnibal 618ac36093 Fix use of path argument in Language.__init__. Needs to be keyword arg, not positional. 2016-11-23 13:26:34 +01:00
Ines Montani a7b5fba132 Merge pull request #642 from ExplodingCabbage/specify-data-path
Let --data-path be specified when running download.py scripts
2016-11-23 13:05:03 +01:00
Ines Montani ede2baba19 Merge pull request #647 from wjt/patch-1
Fix typos in docs
2016-11-21 14:33:50 +01:00
Will Thompson e896466dcf
docs: processing-text: fix missing line wrap 2016-11-21 10:43:16 +00:00
Will Thompson 1adc96f0a6 docs: fix "installaton" typo 2016-11-21 10:37:57 +00:00
Matthew Honnibal ba2cd3d1e7 Merge pull request #646 from ExplodingCabbage/ignore-more-stuff
Ignore entire data folder
2016-11-21 09:52:18 +11:00
Matthew Honnibal d0c999e0ad Add config.py for paddle example 2016-11-20 23:24:51 +01:00
Matthew Honnibal 605144398b Merge branch 'master' of https://github.com/explosion/spaCy 2016-11-20 23:23:59 +01:00
Matthew Honnibal d75fe7c19a Update paddle example 2016-11-20 21:45:08 +01:00
Matthew Honnibal 1ef541ddff Add train.sh for paddle 2016-11-20 21:44:33 +01:00
Mark Amery bc368e4237 Ignore entire data folder
Previously only some of its content was ignored, so running

    python -m spacy.en.download all

after installing from a local repo would create unstaged changes.
2016-11-20 20:33:23 +00:00
Mark Amery fbe19680a6 Fix another bug related to Language.__init__'s path parameter 2016-11-20 20:31:34 +00:00
Mark Amery 2dc305f46b Merge remote-tracking branch 'origin/master' into specify-data-path 2016-11-20 18:29:06 +00:00
Ines Montani 20c8fc5255 Merge pull request #645 from ExplodingCabbage/formatting-mistake
Fix another typo on the website
2016-11-20 19:13:53 +01:00
Ines Montani b89946e95d Merge pull request #644 from ExplodingCabbage/missing-spaces
Fix a bunch of missing spaces of the website
2016-11-20 19:13:29 +01:00
Mark Amery 270d42e73a Fix another typo on the website 2016-11-20 17:08:04 +00:00
Mark Amery b4e1dc0e3f Fix a bunch of missing spaces of the website 2016-11-20 17:02:45 +00:00
Mark Amery a0c4b29dcb Document new --data-path argument 2016-11-20 16:52:56 +00:00
Mark Amery b0a07c21a0 Fix `path` param of `Language.__init__` always being ignored
There was an explicitly-declared `path` keyword argument, so 'path'
would never be present in `**overrides`. This line just overwrote
any manually-specified value the user might've passed to the `path`
parameter.
2016-11-20 16:29:57 +00:00
Mark Amery 1988fce389 Merge remote-tracking branch 'origin/master' into specify-data-path 2016-11-20 16:07:14 +00:00
Ines Montani bcc76e42de Merge pull request #643 from ExplodingCabbage/patch-1
Fix spelling error on website front page
2016-11-20 17:04:48 +01:00
ExplodingCabbage b6e507e026 Fix spelling error on website front page 2016-11-20 16:02:54 +00:00
Mark Amery 3871007c72 Let --data-path be specified when running download.py scripts
Resolves https://github.com/explosion/spaCy/issues/637
2016-11-20 15:48:04 +00:00
Ines Montani dad2c6cae9 Strip trailing whitespace 2016-11-20 16:45:51 +01:00
Ines Montani 3082e49326 Update and reformat German stopwords 2016-11-20 16:45:26 +01:00
Ines Montani d24aaadbb8 Merge pull request #638 from souravsingh/add-stopwords
Add German Stopwords
2016-11-20 16:09:12 +01:00
Sourav Singh 6745eac309 Update language_data.py 2016-11-20 19:52:02 +05:30
Ines Montani c413ffe26d Merge pull request #640 from ExplodingCabbage/missing-gitignore-entry
Add cythonize.json to .gitignore
2016-11-20 15:07:24 +01:00