Commit Graph

6354 Commits

Author SHA1 Message Date
Matthew Honnibal 5384fff5ce Add test for 1305: Incorrect lemmatization of VBZ for English 2017-09-06 18:40:18 +02:00
Matthew Honnibal 95bca20c17 Revert changes to spacy/cli/train.py from branch 2017-09-06 05:52:32 -05:00
Matthew Honnibal 24ff6b0ad9 Fix parsing and tok2vec models 2017-09-06 05:50:58 -05:00
Matthew Honnibal c537154b21 Revert gold pre-processing to True 2017-09-06 04:59:08 -05:00
Matthew Honnibal 167f6a8938 Revert noise-level back to default 0.0 2017-09-06 04:58:33 -05:00
Matthew Honnibal 1b65115bc2 Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-09-04 20:02:53 -05:00
Matthew Honnibal 33fa91feb7 Restore correctness of parser model 2017-09-04 21:19:30 +02:00
Matthew Honnibal e88a42e460 Increment version 2017-09-04 21:14:39 +02:00
Matthew Honnibal 48f4abdcf2 Update travis, removing pypi build 2017-09-04 20:05:37 +02:00
Matthew Honnibal 6bd0a0df9a Update travis 2017-09-04 19:49:35 +02:00
Matthew Honnibal d9c609c0f5 Update travis 2017-09-04 19:01:38 +02:00
Matthew Honnibal 3ba9994f1f Update travis 2017-09-04 18:44:23 +02:00
Matthew Honnibal d47af99561 Update travis.yml 2017-09-04 18:43:33 +02:00
Matthew Honnibal 66646ead26 Update travis 2017-09-04 18:14:15 +02:00
Matthew Honnibal 9d65d67985 Preserve model compatibility in parser, for now 2017-09-04 16:46:22 +02:00
Matthew Honnibal d5fbf27335 Fix test 2017-09-04 16:45:11 +02:00
Matthew Honnibal 7fdafcc4c4 Fix config loading in tagger 2017-09-04 16:38:49 +02:00
Matthew Honnibal 058372d120 Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-09-04 16:27:53 +02:00
Matthew Honnibal 16e25ce3b5 Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-09-04 09:26:53 -05:00
Matthew Honnibal 9f512e657a Fix drop_layer calculation 2017-09-04 09:26:38 -05:00
Matthew Honnibal cb4839033c Fix loader for EN tests 2017-09-04 15:19:18 +02:00
Matthew Honnibal 382ce566eb Fix deserialization bug 2017-09-04 15:19:01 +02:00
Matthew Honnibal bfddf50081 Fix #1296: Incorrect lemmatization of base form verbs 2017-09-04 15:18:41 +02:00
Matthew Honnibal b29e6bff46 Improve lemmatization rule for am|VBP 2017-09-04 15:18:10 +02:00
Matthew Honnibal 644d6c9e1a Improve lemmatization tests, re #1296 2017-09-04 15:17:44 +02:00
Matthew Honnibal 3cf3fa1704 Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-09-02 12:46:11 -05:00
Matthew Honnibal e920885676 Fix pickle during train 2017-09-02 12:46:01 -05:00
Matthew Honnibal c0eaba8b28 Fix low-data textcat 2017-09-02 15:17:32 +02:00
Matthew Honnibal 9e378bdac5 Fix textcat serialization 2017-09-02 15:17:20 +02:00
Matthew Honnibal e3ea6ee02b Increment version 2017-09-02 15:17:01 +02:00
Matthew Honnibal a3b69bcb3d Add low_data mode in textcat 2017-09-02 14:56:30 +02:00
Matthew Honnibal ead78c7b9b Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-09-02 12:55:25 +02:00
Matthew Honnibal 5e6a9e7dcc Add rule-based SBD 2017-09-02 12:53:38 +02:00
Matthew Honnibal a824cf8f9a Adjust text classification model 2017-09-02 11:41:00 +02:00
Matthew Honnibal ac040b99bb Add support for pre-trained vectors in text classifier 2017-09-01 16:39:55 +02:00
Matthew Honnibal 7742a6d559 Add GloVe vectors reader 2017-09-01 16:39:22 +02:00
Matthew Honnibal 789e1a3980 Use 13 parser features, not 8 2017-08-31 14:13:00 -05:00
Matthew Honnibal 30e35d9666 Fix syntax error 2017-08-30 17:35:39 -05:00
Matthew Honnibal 4ceebde523 Fix gradient bug in parser 2017-08-30 17:32:56 -05:00
ines 173089a45a Add more validation for model meta 2017-08-29 11:21:46 +02:00
Matthew Honnibal 2e28982e28 Merge pull request #1288 from geovedi/indonesian
Indonesian language support
2017-08-26 21:31:13 +02:00
ines 7e04b7f89c Fix info text on pipeline in package cli 2017-08-26 18:30:59 +02:00
ines 40afa13a8a Increment version 2017-08-26 18:30:49 +02:00
Matthew Honnibal 876f38c548 Merge pull request #1279 from oroszgy/model_cli_v2
Added vector loading to model cli
2017-08-26 15:57:50 +02:00
Matthew Honnibal cfc055734e Split % in units, for compatibility with corpus 2017-08-25 20:03:37 -05:00
Matthew Honnibal 4bb6bc3f9e Add support for sent_start to GoldParse 2017-08-25 20:03:14 -05:00
Matthew Honnibal 44589fb38c Fix Break oracle 2017-08-25 19:50:55 -05:00
Matthew Honnibal 6d4e8e14ca Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-08-25 12:37:16 -05:00
Matthew Honnibal 4ce5531389 Use layer norm instead of batch norm 2017-08-25 12:37:10 -05:00
Matthew Honnibal 20dd66ddc2 Constrain sentence boundaries to IS_PUNCT and IS_SPACE tokens 2017-08-25 19:35:47 +02:00