Matthew Honnibal
|
049c937540
|
Have the matcher return character offsets, to handle the match better.
|
2016-10-17 15:58:57 +02:00 |
Matthew Honnibal
|
9b60186266
|
Fix doc class
|
2016-10-17 15:23:47 +02:00 |
Matthew Honnibal
|
6cbdc94959
|
Lots of updates to Matcher, to make entity handling sane.
|
2016-10-17 15:23:31 +02:00 |
Matthew Honnibal
|
7fd98fc91c
|
Remove deprecation shim around str/bytes in Token.
|
2016-10-17 14:02:47 +02:00 |
Matthew Honnibal
|
b67697a97b
|
Improve API for doc.merge() and span.merge(), to use keyword arguments.
|
2016-10-17 14:02:13 +02:00 |
Matthew Honnibal
|
fbb7f3f15c
|
Add user_data attribute to Doc object.
|
2016-10-17 11:43:22 +02:00 |
Matthew Honnibal
|
c1abc8f6ed
|
Fix deprecation stuff in Token: Remove the shim for the str/unicode semantics, and raise for has_repvec and repvec
|
2016-10-17 11:18:41 +02:00 |
Matthew Honnibal
|
4ba9eadf3d
|
Merge branch 'v1.0.0-rc1' of ssh://github.com/explosion/spaCy into v1.0.0-rc1
|
2016-10-17 02:45:44 +02:00 |
Matthew Honnibal
|
09ab447a18
|
Remove tensor property from token.
|
2016-10-17 02:45:09 +02:00 |
Matthew Honnibal
|
5d10e2005c
|
Defer some attributes to Doc, via getters_for_tokens attribute.
|
2016-10-17 02:44:49 +02:00 |
Matthew Honnibal
|
8829984efb
|
Remove tensor attribute from Span and Token.
|
2016-10-17 02:44:04 +02:00 |
Matthew Honnibal
|
d15a88c66a
|
Defer some attributes to Doc via getters_for_spans
|
2016-10-17 02:43:35 +02:00 |
Matthew Honnibal
|
62230dd13a
|
Add getters_for_spans and getters_for_tokens attributes to Doc. Fix docstring
|
2016-10-17 02:42:51 +02:00 |
Matthew Honnibal
|
ae11ea8240
|
Add getters_for_tokens and getters_for_spans attributes to Doc object.
|
2016-10-17 02:42:05 +02:00 |
Matthew Honnibal
|
be48a7b4f3
|
Fix conftest for website tests.
|
2016-10-17 01:54:26 +02:00 |
Matthew Honnibal
|
8951bf6989
|
Update matcher tests
|
2016-10-17 01:53:24 +02:00 |
Matthew Honnibal
|
0cf4aff470
|
Set default path in EN/DE tests.
|
2016-10-17 01:52:49 +02:00 |
Matthew Honnibal
|
cd71b6b0a9
|
Remove test of parser pickle
|
2016-10-17 01:52:10 +02:00 |
Matthew Honnibal
|
5bc101006e
|
Add cfg field to Tagger
|
2016-10-17 01:03:41 +02:00 |
Matthew Honnibal
|
517f090cbf
|
Use GoldParse in tagger.update
|
2016-10-17 00:55:15 +02:00 |
Matthew Honnibal
|
59038f7efa
|
Restore support for prior data format -- specifically, the labels field of the config.
|
2016-10-17 00:53:26 +02:00 |
Matthew Honnibal
|
7887ab3b36
|
Fix default use of feature_templates in parser
|
2016-10-16 21:41:56 +02:00 |
Matthew Honnibal
|
f787cd29fe
|
Refactor the pipeline classes to make them more consistent, and remove the redundant blank() constructor.
|
2016-10-16 21:34:57 +02:00 |
Matthew Honnibal
|
311a985fe0
|
Add input error handling in Doc
|
2016-10-16 18:16:42 +02:00 |
Matthew Honnibal
|
06322ba99d
|
Add words and spaces keyword arguments to Doc.
|
2016-10-16 18:13:03 +02:00 |
Matthew Honnibal
|
ca51f3b77e
|
Use DependencyParser and EntityRecognizer in the Language class.
|
2016-10-16 17:58:12 +02:00 |
Matthew Honnibal
|
195d998a12
|
Fix GoldParse argument to tagger.update
|
2016-10-16 17:05:09 +02:00 |
Matthew Honnibal
|
274a4d4272
|
Fix queue Python property in StateClass
|
2016-10-16 17:04:41 +02:00 |
Matthew Honnibal
|
e8c8aa08ce
|
Make action_name optional in StepwiseState
|
2016-10-16 17:04:16 +02:00 |
Matthew Honnibal
|
4bb73b1a93
|
Fix parser labels in pipeline
|
2016-10-16 17:03:22 +02:00 |
Matthew Honnibal
|
a81c5a7abf
|
Fix name of labels keyword to 'actions'.
|
2016-10-16 12:00:27 +02:00 |
Matthew Honnibal
|
a079677984
|
Fix omission of O action when creating blank entity recognizer
|
2016-10-16 11:43:25 +02:00 |
Matthew Honnibal
|
5444d38cc6
|
Update test for biluo tags
|
2016-10-16 11:42:45 +02:00 |
Matthew Honnibal
|
4fc56d4a31
|
Rename 'labels' to 'actions' in parser options
|
2016-10-16 11:42:26 +02:00 |
Matthew Honnibal
|
8a6b35d266
|
Delay binding in MakeDoc
|
2016-10-16 11:41:55 +02:00 |
Matthew Honnibal
|
52b48b415e
|
Fix GoldParse class
|
2016-10-16 11:41:36 +02:00 |
Matthew Honnibal
|
3259a63779
|
Whitespace
|
2016-10-16 01:47:28 +02:00 |
Matthew Honnibal
|
509b30834f
|
Add a pipeline module, to collect and wrap processes for annotation
|
2016-10-16 01:47:12 +02:00 |
Matthew Honnibal
|
0317cea0ad
|
Fix GoldParse
|
2016-10-15 23:55:07 +02:00 |
Matthew Honnibal
|
1c62573a41
|
Fix spacy.train
|
2016-10-15 23:53:46 +02:00 |
Matthew Honnibal
|
a48aa15384
|
Improve the API for the GoldParse class.
|
2016-10-15 23:53:29 +02:00 |
Matthew Honnibal
|
e07fe92b27
|
Draft a refactored init for the GoldParse class
|
2016-10-15 22:09:52 +02:00 |
Matthew Honnibal
|
47afef7d6b
|
Add init.py for gold tests
|
2016-10-15 21:51:28 +02:00 |
Matthew Honnibal
|
86ae665c78
|
Add function for entity->biluo transformation
|
2016-10-15 21:51:04 +02:00 |
Matthew Honnibal
|
2163fd238f
|
Add tests for entity->biluo transformation
|
2016-10-15 21:50:43 +02:00 |
Matthew Honnibal
|
5e923b9bfa
|
Return None in match_best_version if not path exists.
|
2016-10-15 14:47:29 +02:00 |
Matthew Honnibal
|
2516382106
|
Fix loading of English in span test
|
2016-10-15 14:44:37 +02:00 |
Matthew Honnibal
|
dda2fc6bef
|
Add empty data directory
|
2016-10-15 14:25:25 +02:00 |
Matthew Honnibal
|
049197e0ae
|
Update tests, somewhat messily.
|
2016-10-15 14:14:04 +02:00 |
Matthew Honnibal
|
1e1a1d9517
|
Update matcher test
|
2016-10-15 14:13:41 +02:00 |