spaCy/spacy
Adriane Boyd 6f314f99c4
Use Latin normalization for Serbian attrs (#12608)
* Use Latin normalization for Serbian attrs

Use Latin normalization for Serbian `NORM`, `PREFIX`, and `SUFFIX`.

* Update NORMs in tokenizer exceptions and related tests

* Add tests for all custom lex attrs

* Remove unused imports
2023-05-08 12:33:56 +02:00
..
cli Remove #egg from download URLs (#12567) 2023-05-04 17:13:12 +02:00
displacy Allow passing a Span to displacy.parse_deps (#12477) 2023-03-31 09:44:01 +02:00
kb rely on is_empty property instead of __len__ (#12347) 2023-03-01 12:06:07 +01:00
lang Use Latin normalization for Serbian attrs (#12608) 2023-05-08 12:33:56 +02:00
matcher perf(REL_OP): Replace some token.children with token.rights or token.lefts (#12528) 2023-04-17 13:16:34 +02:00
ml Spancat speed improvement (#12577) 2023-04-27 15:27:13 +02:00
pipeline Fix pickle for ngram suggester (#12486) 2023-03-31 13:43:51 +02:00
tests Use Latin normalization for Serbian attrs (#12608) 2023-05-08 12:33:56 +02:00
tokens Add default option to MorphAnalysis.get (#12545) 2023-04-20 14:06:32 +02:00
training Add model-last saving mechanism to pretraining (#12459) 2023-04-03 15:24:03 +02:00
__init__.pxd
__init__.py Simplify and clarify enable/disable behavior of spacy.load() (#11459) 2022-09-27 14:22:36 +02:00
__main__.py
about.py Set version to v3.6.0.dev0 2023-05-08 09:10:35 +02:00
attrs.pxd
attrs.pyx Intify IOB (#9738) 2022-01-20 13:19:38 +01:00
compat.py Custom component types in spacy.ty (#9469) 2021-10-21 15:31:06 +02:00
default_config.cfg Add `training.before_update` callback (#11739) 2022-11-23 17:54:58 +01:00
default_config_pretraining.cfg Add new parameter for saving every n epoch in pretraining (#8912) 2021-08-12 11:14:48 +02:00
errors.py Support floret for PretrainVectors (#12435) 2023-03-24 16:28:51 +01:00
glossary.py Add glossary entry for root (#10821) 2022-05-20 09:56:32 +02:00
language.py Have logging calls use string formatting types (#12215) 2023-02-02 11:15:22 +01:00
lexeme.pxd
lexeme.pyi fix types (#12365) 2023-03-07 13:29:08 +01:00
lexeme.pyx fix types (#12365) 2023-03-07 13:29:08 +01:00
lookups.py Fix issues for Mypy 0.950 and Pydantic 1.9.0 (#10786) 2022-05-25 09:33:54 +02:00
morphology.pxd Clean up Morphology imports and definitions (#7441) 2021-04-26 16:54:23 +02:00
morphology.pyx Clean up Morphology imports and definitions (#7441) 2021-04-26 16:54:23 +02:00
parts_of_speech.pxd
parts_of_speech.pyx
pipe_analysis.py 🏷 Add Mypy check to CI and ignore all existing Mypy errors (#9167) 2021-10-14 15:21:40 +02:00
py.typed Add py.typed 2021-03-16 09:48:31 +01:00
schemas.py Auto-format code with black (#12100) 2023-01-13 10:12:10 +01:00
scorer.py Restore v2 token_acc score implementation (#12073) 2023-01-11 08:01:47 +01:00
strings.pxd `StringStore`-related optimizations (#10938) 2022-07-04 15:04:03 +02:00
strings.pyi Fix StringStore.__getitem__ return type depending on parameter types (#10741) 2022-05-03 17:57:07 +02:00
strings.pyx `StringStore`-related optimizations (#10938) 2022-07-04 15:04:03 +02:00
structs.pxd
symbols.pxd
symbols.pyx
tokenizer.pxd Add tokenizer option to allow Matcher handling for all rules (#10452) 2022-03-24 13:21:32 +01:00
tokenizer.pyx Load exceptions last in Tokenizer.from_bytes (#12553) 2023-04-20 11:30:34 +02:00
ty.py Custom component types in spacy.ty (#9469) 2021-10-21 15:31:06 +02:00
typedefs.pxd
typedefs.pyx
util.py Add tests for projects to master (#12303) 2023-02-23 10:22:57 +01:00
vectors.pyx Add equality definition for vectors (#11806) 2022-11-16 09:44:42 +01:00
vocab.pxd Add support for floret vectors (#8909) 2021-10-27 14:08:31 +02:00
vocab.pyi Add vector deduplication (#10551) 2022-03-30 08:54:23 +02:00
vocab.pyx fix comparison of constants (#11834) 2022-11-21 08:12:03 +01:00