spaCy/examples/training
adrianeboyd b71a11ff6d
Update morphologizer (#5108)
* Add pos and morph scoring to Scorer

Add pos, morph, and morph_per_type to `Scorer`. Report pos and morph
accuracy in `spacy evaluate`.

* Update morphologizer for v3

* switch to tagger-based morphologizer
* use `spacy.HashCharEmbedCNN` for morphologizer defaults
* add `Doc.is_morphed` flag

* Add morphologizer to train CLI

* Add basic morphologizer pipeline tests

* Add simple morphologizer training example

* Remove subword_features from CharEmbed models

Remove `subword_features` argument from `spacy.HashCharEmbedCNN.v1` and
`spacy.HashCharEmbedBiLSTM.v1` since in these cases `subword_features`
is always `False`.

* Rename setting in morphologizer example

Use `with_pos_tags` instead of `without_pos_tags`.

* Fix kwargs for spacy.HashCharEmbedBiLSTM.v1

* Remove defaults for spacy.HashCharEmbedBiLSTM.v1

Remove default `nM/nC` for `spacy.HashCharEmbedBiLSTM.v1`.

* Set random seed for textcat overfitting test
2020-04-02 14:46:32 +02:00
..
ner_example_data Updates/bugfixes for NER/IOB converters (#4186) 2019-08-29 12:04:01 +02:00
textcat_example_data Add textcat to train CLI (#4226) 2019-09-15 22:31:31 +02:00
conllu-config.json Generalize handling of tokenizer special cases (#4259) 2019-11-13 21:24:35 +01:00
conllu.py Merge branch 'master' into develop 2019-12-21 18:55:03 +01:00
ner_multitask_objective.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
pretrain_kb.py Friendly error warning for NEL example script (#4881) 2020-01-14 01:51:14 +01:00
pretrain_textcat.py Default settings to configurations (#4995) 2020-02-27 18:42:27 +01:00
rehearsal.py Merge branch 'master' into develop 2020-02-18 14:47:23 +01:00
train_entity_linker.py Merge branch 'master' into develop 2020-02-18 14:47:23 +01:00
train_intent_parser.py Merge branch 'master' into develop 2020-02-18 14:47:23 +01:00
train_morphologizer.py Update morphologizer (#5108) 2020-04-02 14:46:32 +02:00
train_ner.py Merge branch 'master' into develop 2020-02-18 14:47:23 +01:00
train_new_entity_type.py Merge branch 'master' into develop 2020-02-18 14:47:23 +01:00
train_parser.py Merge branch 'master' into develop 2020-02-18 14:47:23 +01:00
train_tagger.py Example class for training data (#4543) 2019-11-11 17:35:27 +01:00
train_textcat.py Train textcat with config (#5143) 2020-03-29 19:40:36 +02:00
train_textcat_config.cfg Train textcat with config (#5143) 2020-03-29 19:40:36 +02:00
training-data.json Revert training example edit from #4327 (#4403) 2019-10-10 17:00:26 +02:00
vocab-data.jsonl Use even smaller examle size 2017-10-30 19:46:45 +01:00