spaCy/spacy/tests/pipeline
adrianeboyd b71a11ff6d
Update morphologizer (#5108)
* Add pos and morph scoring to Scorer

Add pos, morph, and morph_per_type to `Scorer`. Report pos and morph
accuracy in `spacy evaluate`.

* Update morphologizer for v3

* switch to tagger-based morphologizer
* use `spacy.HashCharEmbedCNN` for morphologizer defaults
* add `Doc.is_morphed` flag

* Add morphologizer to train CLI

* Add basic morphologizer pipeline tests

* Add simple morphologizer training example

* Remove subword_features from CharEmbed models

Remove `subword_features` argument from `spacy.HashCharEmbedCNN.v1` and
`spacy.HashCharEmbedBiLSTM.v1` since in these cases `subword_features`
is always `False`.

* Rename setting in morphologizer example

Use `with_pos_tags` instead of `without_pos_tags`.

* Fix kwargs for spacy.HashCharEmbedBiLSTM.v1

* Remove defaults for spacy.HashCharEmbedBiLSTM.v1

Remove default `nM/nC` for `spacy.HashCharEmbedBiLSTM.v1`.

* Set random seed for textcat overfitting test
2020-04-02 14:46:32 +02:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
test_analysis.py Default settings to configurations (#4995) 2020-02-27 18:42:27 +01:00
test_entity_linker.py Unit test for NEL functionality (#5114) 2020-03-06 14:42:23 +01:00
test_entity_ruler.py Tidy up and auto-format 2020-02-18 15:38:18 +01:00
test_factories.py Drop Python 2.7 and 3.5 (#4828) 2019-12-22 01:53:56 +01:00
test_functions.py Drop Python 2.7 and 3.5 (#4828) 2019-12-22 01:53:56 +01:00
test_morphologizer.py Update morphologizer (#5108) 2020-04-02 14:46:32 +02:00
test_pipe_methods.py More formatting changes 2019-12-25 17:59:52 +01:00
test_sentencizer.py Merge branch 'master' into develop 2020-02-18 14:47:23 +01:00
test_senter.py Tok2Vec: extract-embed-encode (#5102) 2020-03-08 13:23:18 +01:00
test_tagger.py Default settings to configurations (#4995) 2020-02-27 18:42:27 +01:00
test_textcat.py Update morphologizer (#5108) 2020-04-02 14:46:32 +02:00