spaCy

History

Adriane Boyd 960d9cfadc Officially support DependencyMatcher Add official support for the `DependencyMatcher`. Redesign the pattern specification. Fix and extend operator implementations. Update API docs and add usage docs. Patterns -------- Refactor pattern structure to: ``` { "LEFT_ID": str, "REL_OP": str, "RIGHT_ID": str, "RIGHT_ATTRS": dict, } ``` The first node contains only `RIGHT_ID` and `RIGHT_ATTRS` and all subsequent nodes contain all four keys. New operators ------------- Because of the way patterns are constructed from left to right, it's helpful to have `follows` operators along with `precedes` operators. Add operators for simple precedes / follows alongside immediate precedes / follows. * `.`: precedes `;`: immediately follows * `;`: follows Operator fixes -------------- `<` and `<<` do not include the node itself * Fix reversed order for all operators involving linear precedence (`.`, all sibling operators) * Linear precedence operators do not match nodes outside the same parse Additional fixes ---------------- * Use v3 Matcher API * Support `get` and `remove` * Support pickling		2020-09-02 17:45:29 +02:00
..
architectures.md	Rename Transformer listener (#6001 )	2020-08-31 12:41:39 +02:00
attributeruler.md	Update docs [ci skip]	2020-08-29 18:43:19 +02:00
cli.md	Add more info to debug config	2020-08-27 18:17:58 +02:00
corpus.md	…
cython-classes.md	…
cython-structs.md	…
cython.md	…
data-formats.md	Update docs	2020-08-29 12:36:05 +02:00
dependencymatcher.md	Officially support DependencyMatcher	2020-09-02 17:45:29 +02:00
dependencyparser.md	…
doc.md	…
docbin.md	…
entitylinker.md	…
entityrecognizer.md	…
entityruler.md	…
example.md	…
index.md	…
kb.md	…
language.md	Update docs	2020-08-29 12:36:05 +02:00
lemmatizer.md	Update usage docs for lemmatization and morphology	2020-08-29 15:56:50 +02:00
lexeme.md	…
lookups.md	…
matcher.md	Deprecate (Phrase)Matcher.pipe	2020-08-31 17:01:24 +02:00
morphologizer.md	…
morphology.md	…
phrasematcher.md	Deprecate (Phrase)Matcher.pipe	2020-08-31 17:01:24 +02:00
pipe.md	…
pipeline-functions.md	…
scorer.md	…
sentencerecognizer.md	Update sentence segmentation usage docs	2020-08-28 10:58:16 +02:00
sentencizer.md	…
span.md	…
stringstore.md	…
tagger.md	Add AttributeRuler.score (#5963 )	2020-08-26 15:39:30 +02:00
textcategorizer.md	…
tok2vec.md	…
token.md	…
tokenizer.md	…
top-level.md	Update docs [ci skip]	2020-08-31 17:06:33 +02:00
transformer.md	small fixes	2020-08-31 14:46:00 +02:00
vectors.md	…
vocab.md	…