spaCy/spacy
Adriane Boyd a77c4c3465
Add strings and ENT_KB_ID to Doc serialization (#5691)
* Add strings for all writeable Token attributes to `Doc.to/from_bytes()`.
* Add ENT_KB_ID to default attributes.
2020-07-02 17:11:57 +02:00
..
cli prevent loading a pretrained Tok2Vec layer AND pretrained components 2020-05-29 17:38:33 +02:00
data
displacy Add missing import 2020-04-28 13:48:37 +02:00
lang Revert "Convert custom user_data to token extension format for Japanese tokenizer (#5652)" (#5665) 2020-06-29 14:34:15 +02:00
matcher Switch to new add API in PhraseMatcher unpickle 2020-05-25 11:22:47 +02:00
ml Replace function registries with catalogue (#4584) 2019-11-07 11:45:22 +01:00
pipeline Disregard special tag _SP in check for new tag map (#5641) 2020-06-26 09:23:21 +02:00
syntax Fix and add warnings related to spacy-lookups-data (#5588) 2020-06-15 14:58:29 +02:00
tests Add strings and ENT_KB_ID to Doc serialization (#5691) 2020-07-02 17:11:57 +02:00
tokens Add strings and ENT_KB_ID to Doc serialization (#5691) 2020-07-02 17:11:57 +02:00
__init__.pxd
__init__.py Simplify warnings 2020-04-28 13:37:37 +02:00
__main__.py Use latest wasabi 2019-11-04 02:38:45 +01:00
_ml.py Skip duplicate lexeme rank setting (#5401) 2020-05-14 18:26:12 +02:00
about.py Set version to v2.3.0 2020-06-15 18:22:25 +02:00
analysis.py Simplify warnings 2020-04-28 13:37:37 +02:00
attrs.pxd Reduce stored lexemes data, move feats to lookups (#5238) 2020-05-19 15:59:14 +02:00
attrs.pyx Reduce stored lexemes data, move feats to lookups (#5238) 2020-05-19 15:59:14 +02:00
compat.py Replace function registries with catalogue (#4584) 2019-11-07 11:45:22 +01:00
errors.py Fix and add warnings related to spacy-lookups-data (#5588) 2020-06-15 14:58:29 +02:00
glossary.py Update tag maps and docs for English and German (#4501) 2019-10-24 12:56:05 +02:00
gold.pxd
gold.pyx Updates to docstrings (#5589) 2020-06-15 14:58:36 +02:00
kb.pxd Tidy up and avoid absolute spacy imports in core 2020-05-21 20:05:03 +02:00
kb.pyx Merge pull request #5264 from lfiedler/issue-5230 2020-05-22 00:31:07 +02:00
language.py Include git commit in package and model meta (#5694) 2020-07-02 17:10:27 +02:00
lemmatizer.py Move lemmatizer is_base_form to language settings (#5663) 2020-06-29 14:16:57 +02:00
lexeme.pxd Reduce stored lexemes data, move feats to lookups (#5238) 2020-05-19 15:59:14 +02:00
lexeme.pyx Fix polarity of Token.is_oov and Lexeme.is_oov (#5634) 2020-06-23 13:29:51 +02:00
lookups.py Reduce memory usage of Lookup's BloomFilter (#5606) 2020-06-26 14:09:10 +02:00
morphology.pxd
morphology.pyx Prefer _SP over SP for default tag map space attrs 2020-05-26 14:57:13 +02:00
parts_of_speech.pxd
parts_of_speech.pyx
scorer.py Fix GoldParse init when token count differs (#5191) 2020-03-26 10:46:23 +01:00
strings.pxd
strings.pyx
structs.pxd Reduce stored lexemes data, move feats to lookups (#5238) 2020-05-19 15:59:14 +02:00
symbols.pxd Reduce stored lexemes data, move feats to lookups (#5238) 2020-05-19 15:59:14 +02:00
symbols.pyx Reduce stored lexemes data, move feats to lookups (#5238) 2020-05-19 15:59:14 +02:00
tokenizer.pxd Rename to url_match 2020-05-22 12:41:03 +02:00
tokenizer.pyx Rename to url_match 2020-05-22 12:41:03 +02:00
typedefs.pxd
typedefs.pyx
util.py Skip vocab in component config overrides (#5624) 2020-06-23 23:21:11 +02:00
vectors.pyx fix deserialization order 2020-05-30 12:53:32 +02:00
vocab.pxd Reduce stored lexemes data, move feats to lookups (#5238) 2020-05-19 15:59:14 +02:00
vocab.pyx Updates to docstrings (#5589) 2020-06-15 14:58:36 +02:00