spaCy/spacy/tokens
adrianeboyd b71a11ff6d
Update morphologizer (#5108)
* Add pos and morph scoring to Scorer

Add pos, morph, and morph_per_type to `Scorer`. Report pos and morph
accuracy in `spacy evaluate`.

* Update morphologizer for v3

* switch to tagger-based morphologizer
* use `spacy.HashCharEmbedCNN` for morphologizer defaults
* add `Doc.is_morphed` flag

* Add morphologizer to train CLI

* Add basic morphologizer pipeline tests

* Add simple morphologizer training example

* Remove subword_features from CharEmbed models

Remove `subword_features` argument from `spacy.HashCharEmbedCNN.v1` and
`spacy.HashCharEmbedBiLSTM.v1` since in these cases `subword_features`
is always `False`.

* Rename setting in morphologizer example

Use `with_pos_tags` instead of `without_pos_tags`.

* Fix kwargs for spacy.HashCharEmbedBiLSTM.v1

* Remove defaults for spacy.HashCharEmbedBiLSTM.v1

Remove default `nM/nC` for `spacy.HashCharEmbedBiLSTM.v1`.

* Set random seed for textcat overfitting test
2020-04-02 14:46:32 +02:00
..
__init__.pxd
__init__.py Modify morphology to support arbitrary features (#4932) 2020-01-23 22:01:54 +01:00
_retokenize.pyx Merge branch 'master' into tmp/sync 2020-03-26 13:38:14 +01:00
_serialize.py fix serialization of empty doc + unit test 2020-03-13 16:07:56 +01:00
doc.pxd Update morphologizer (#5108) 2020-04-02 14:46:32 +02:00
doc.pyx bugfix in span similarity (#5155) 2020-03-29 13:56:07 +02:00
morphanalysis.pxd Modify morphology to support arbitrary features (#4932) 2020-01-23 22:01:54 +01:00
morphanalysis.pyx Modify morphology to support arbitrary features (#4932) 2020-01-23 22:01:54 +01:00
span.pxd annotate kb_id through ents in doc 2019-03-22 11:36:44 +01:00
span.pyx bugfix in span similarity (#5155) 2020-03-29 13:56:07 +02:00
token.pxd Tidy up compiler flags and imports (#5071) 2020-03-02 11:48:10 +01:00
token.pyx Merge branch 'master' into tmp/sync 2020-03-26 13:38:14 +01:00
underscore.py Merge branch 'master' into tmp/sync 2020-03-26 13:38:14 +01:00