spaCy

History

Sofie Van Landeghem 06f0a8daa0 Default settings to configurations (#4995 ) * fix grad_clip naming * cleaning up pretrained_vectors out of cfg * further refactoring Model init's * move Model building out of pipes * further refactor to require a model config when creating a pipe * small fixes * making cfg in nn_parser more consistent * fixing nr_class for parser * fixing nn_parser's nO * fix printing of loss * architectures in own file per type, consistent naming * convenience methods default_tagger_config and default_tok2vec_config * let create_pipe access default config if available for that component * default_parser_config * move defaults to separate folder * allow reading nlp from package or dir with argument 'name' * architecture spacy.VocabVectors.v1 to read static vectors from file * cleanup * default configs for nel, textcat, morphologizer, tensorizer * fix imports * fixing unit tests * fixes and clean up * fixing defaults, nO, fix unit tests * restore parser IO * fix IO * 'fix' serialization test * add .cfg to manifest fix example configs with additional arguments * replace Morpohologizer with Tagger * add IO bit when testing overfitting of tagger (currently failing) * fix IO - don't initialize when reading from disk * expand overfitting tests to also check IO goes OK * remove dropout from HashEmbed to fix Tagger performance * add defaults for sentrec * update thinc * always pass a Model instance to a Pipe * fix piped_added statement * remove obsolete W029 * remove obsolete errors * restore byte checking tests (work again) * clean up test * further test cleanup * convert from config to Model in create_pipe * bring back error when component is not initialized * cleanup * remove calls for nlp2.begin_training * use thinc.api in imports * allow setting charembed's nM and nC * fix for hardcoded nM/nC + unit test * formatting fixes * trigger build		2020-02-27 18:42:27 +01:00
..
cli	Default settings to configurations (#4995 )	2020-02-27 18:42:27 +01:00
displacy	Tidy up and auto-format	2020-02-18 15:38:18 +01:00
lang	Tidy up and auto-format	2020-02-18 15:38:18 +01:00
matcher	Add better schemas and validation using Pydantic (#4831 )	2019-12-25 12:39:49 +01:00
ml	Default settings to configurations (#4995 )	2020-02-27 18:42:27 +01:00
pipeline	Default settings to configurations (#4995 )	2020-02-27 18:42:27 +01:00
syntax	Default settings to configurations (#4995 )	2020-02-27 18:42:27 +01:00
tests	Default settings to configurations (#4995 )	2020-02-27 18:42:27 +01:00
tokens	Tidy up and auto-format	2020-02-18 15:38:18 +01:00
__init__.pxd	…
__init__.py	Tidy up and auto-format	2020-02-18 15:38:18 +01:00
__main__.py	Update spaCy for thinc 8.0.0 (#4920 )	2020-01-29 17:06:46 +01:00
_ml.py	Update spaCy for thinc 8.0.0 (#4920 )	2020-01-29 17:06:46 +01:00
about.py	Set dev version	2020-01-30 15:41:45 +01:00
analysis.py	Drop Python 2.7 and 3.5 (#4828 )	2019-12-22 01:53:56 +01:00
attrs.pxd	Merge branch 'master' into develop	2020-02-18 14:47:23 +01:00
attrs.pyx	Merge branch 'master' into develop	2020-02-18 14:47:23 +01:00
compat.py	Merge branch 'develop' into refactor/remove-symlinks	2020-02-18 17:22:20 +01:00
errors.py	Default settings to configurations (#4995 )	2020-02-27 18:42:27 +01:00
glossary.py	Tidy up and auto-format	2020-02-18 15:38:18 +01:00
gold.pxd	Tidy up and auto-format	2020-02-18 15:38:18 +01:00
gold.pyx	Tidy up and fix issues	2020-02-18 15:17:03 +01:00
kb.pxd	Tidy up and auto-format	2020-02-18 15:38:18 +01:00
kb.pyx	More formatting changes	2019-12-25 17:59:52 +01:00
language.py	Default settings to configurations (#4995 )	2020-02-27 18:42:27 +01:00
lemmatizer.py	Drop Python 2.7 and 3.5 (#4828 )	2019-12-22 01:53:56 +01:00
lexeme.pxd	…
lexeme.pyx	Tidy up and auto-format	2020-02-18 15:38:18 +01:00
lookups.py	Drop Python 2.7 and 3.5 (#4828 )	2019-12-22 01:53:56 +01:00
morphology.pxd	Modify morphology to support arbitrary features (#4932 )	2020-01-23 22:01:54 +01:00
morphology.pyx	Modify morphology to support arbitrary features (#4932 )	2020-01-23 22:01:54 +01:00
parts_of_speech.pxd	…
parts_of_speech.pyx	Drop Python 2.7 and 3.5 (#4828 )	2019-12-22 01:53:56 +01:00
schemas.py	Allow spacy attr in token pattern	2020-02-18 14:32:53 +01:00
scorer.py	Drop Python 2.7 and 3.5 (#4828 )	2019-12-22 01:53:56 +01:00
strings.pxd	…
strings.pyx	Drop Python 2.7 and 3.5 (#4828 )	2019-12-22 01:53:56 +01:00
structs.pxd	Modify morphology to support arbitrary features (#4932 )	2020-01-23 22:01:54 +01:00
symbols.pxd	Merge branch 'master' into develop	2020-02-18 14:47:23 +01:00
symbols.pyx	Merge branch 'master' into develop	2020-02-18 14:47:23 +01:00
tokenizer.pxd	Generalize handling of tokenizer special cases (#4259 )	2019-11-13 21:24:35 +01:00
tokenizer.pyx	Improve setup.py and call into Cython directly (#4952 )	2020-02-11 17:46:18 -05:00
typedefs.pxd	Update spaCy for thinc 8.0.0 (#4920 )	2020-01-29 17:06:46 +01:00
typedefs.pyx	…
util.py	Default settings to configurations (#4995 )	2020-02-27 18:42:27 +01:00
vectors.pyx	Tidy up and auto-format	2020-02-18 15:38:18 +01:00
vocab.pxd	💫 WIP: Basic lookup class scaffolding and JSON for all lemmati… (#4167 )	2019-08-22 14:21:32 +02:00
vocab.pyx	Tidy up and auto-format	2020-02-18 15:38:18 +01:00