Commit Graph

2784 Commits

Author SHA1 Message Date
Matthew Honnibal 5188f6d9d8 * Fix parseC function 2016-02-01 08:48:48 +01:00
Matthew Honnibal bcf8f7ba40 * Add a parse_batch method to Parser, that releases the GIL around a batch of documents. 2016-02-01 08:34:55 +01:00
Matthew Honnibal bd47cb3290 Merge branch 'rethinc2' of https://github.com/honnibal/spaCy into rethinc2 2016-02-01 08:33:52 +01:00
Matthew Honnibal 80caba28c7 Merge branch 'master' of ssh://github.com/honnibal/spaCy into rethinc2 2016-02-01 08:33:26 +01:00
Matthew Honnibal d5579cd0d8 Merge branch 'rethinc2' of https://github.com/honnibal/spaCy into rethinc2 2016-02-01 03:08:49 +01:00
Matthew Honnibal 490ba65398 * Use openmp in parser 2016-02-01 03:08:42 +01:00
Matthew Honnibal cb78d91ec5 * Fix ArcEager.set_valid 2016-02-01 03:07:37 +01:00
Matthew Honnibal 9c34ca9e5d * Add _stack to mod_names 2016-02-01 03:00:53 +01:00
Matthew Honnibal 28e5ad62bc * Pass a StateC pointer into the transition and validation methods in the parser, so that the GIL can be released over a batch of documents 2016-02-01 03:00:15 +01:00
Matthew Honnibal a47f00901b * Pass a StateC pointer into the transition and validation methods in the parser, so that the GIL can be released over a batch of documents 2016-02-01 02:58:14 +01:00
Matthew Honnibal daaad66448 * Now fully proxied 2016-02-01 02:37:08 +01:00
Matthew Honnibal 7a0e3bb9c1 * Continue proxying. Some problem currently 2016-02-01 02:22:21 +01:00
Matthew Honnibal 2169bbb7ea * Shadow StateClass with StateC, to start proxying 2016-02-01 01:16:14 +01:00
Matthew Honnibal 2fa228458e * Add _state file, which StateClass will proxy to 2016-02-01 01:09:21 +01:00
Matthew Honnibal bc0f0d284c * Require different thinc version 2016-01-30 20:29:24 +01:00
Matthew Honnibal 6bb007d16e * Make set_parse nogil 2016-01-30 20:27:52 +01:00
Matthew Honnibal 9410e74c92 * Switch parser to use nogil functions 2016-01-30 20:27:07 +01:00
Matthew Honnibal 10877a7791 * Update for thinc 5.0, including changing cost from int to weight_t, and updating the tagger and parser 2016-01-30 14:31:36 +01:00
Matthew Honnibal 6c633f2edc Fix Issue #243: Incorrect gazetteer entry 2016-01-30 06:58:29 +11:00
Matthew Honnibal ea4ff94cde * Whitespace 2016-01-29 03:59:22 +01:00
Matthew Honnibal b0718b6ee1 * Move to thinc 5.0 2016-01-29 03:58:55 +01:00
Henning Peters ed3ebf9e43 remove unnecessary compiler flags (see #237) 2016-01-28 19:12:00 +01:00
Matthew Honnibal 9721502c81 * Update version 2016-01-25 15:52:59 +01:00
Matthew Honnibal 907e8cf07d * Add u prefix to string in web example 2016-01-25 15:51:38 +01:00
Matthew Honnibal eba03695ef * Comment out pickle tests 2016-01-25 15:51:13 +01:00
Matthew Honnibal de94e6c525 * Mark pickle tests as xfail, due to temp files problem 2016-01-25 15:24:17 +01:00
Matthew Honnibal 87172a15c6 * Fix runtime error bug that arose from updated Span.root function. 2016-01-25 15:22:42 +01:00
Matthew Honnibal 2c8dd91785 * Fix first code example on the website 2016-01-23 18:09:19 +01:00
Matthew Honnibal af332f5095 * Add some stream of consciousness about NER 2016-01-23 13:41:01 +01:00
Matthew Honnibal 3af84cfd6e * Increment version 2016-01-21 17:49:27 +01:00
Matthew Honnibal 571d26b773 Merge branch 'master' of ssh://github.com/honnibal/spaCy 2016-01-21 17:48:32 +01:00
Matthew Honnibal 6842f681e5 Merge pull request #234 from henningpeters/master
remove package version constraint
2016-01-22 03:48:12 +11:00
Henning Peters 65aeac24cb remove package version constraint 2016-01-21 17:40:51 +01:00
Matthew Honnibal 0ec4df6d7c * Write more notes about spaCy's NER 2016-01-21 16:37:13 +01:00
Matthew Honnibal 7d16f25218 * Update release notes 2016-01-21 00:24:21 +01:00
Matthew Honnibal 1270506f7e * Update release notes 2016-01-21 00:23:43 +01:00
Matthew Honnibal 792c98a438 * Increment version for OSX-fixed release of v0.100 2016-01-21 00:23:04 +01:00
Matthew Honnibal 110304f62e * Start writing bootstrap word2vec tutorial 2016-01-20 13:51:36 +01:00
Matthew Honnibal 82d011ac43 * Fix test for whitespace 2016-01-19 20:38:26 +01:00
Matthew Honnibal e89069dcae * Fix matcher test 2016-01-19 20:24:01 +01:00
Matthew Honnibal 63e3d4e27f * Add comment on Vocab.__reduce__ 2016-01-19 20:11:25 +01:00
Matthew Honnibal e1282b7f2f * Require user-custom NER classes to work without adding the label. 2016-01-19 20:11:03 +01:00
Matthew Honnibal 84c5dfbfc3 * Clean up debugging python list 2016-01-19 20:10:32 +01:00
Matthew Honnibal 04d0686b26 * Make TransitionSystem.add_action idempotent, i.e. ignore duplicate added actions. 2016-01-19 20:10:04 +01:00
Matthew Honnibal c4a89d56bd * Automatically register any entity types pre-set on the tokens, so that the NER works with user-given entity types. 2016-01-19 20:09:26 +01:00
Matthew Honnibal f0f92793f6 * Add test for user NER classes in matcher blocking the NER model. Re Issue #178 and Issue #217 2016-01-19 19:23:16 +01:00
Matthew Honnibal 65c5bc4988 * Add add_label method, to allow users to register new entity types and dependency labels. 2016-01-19 19:11:02 +01:00
Matthew Honnibal 151aa0b0e2 * Allow users to add_label, in order to extend the entity recogniser to new classes. Does not by itself add a class to the model 2016-01-19 19:09:33 +01:00
Matthew Honnibal c8e0011ebc * Add iterators to the NER and parser transition systems, to get the action types 2016-01-19 19:07:43 +01:00
Matthew Honnibal 515493c675 * Add xfail test for Issue #225: tokenization with non-whitespace delimiters 2016-01-19 13:20:14 +01:00