spaCy/spacy/cli
Matthew Honnibal ecb3c4e8f4
Create corpus iterator and batcher from registry during training (#5865)
* Move batchers into their own module (and registry)

* Update CLI

* Update Corpus and batcher

* Update tests

* Update one config

* Merge 'evaluation' block back under [training]

* Import batchers in gold __init__

* Fix batchers

* Update config

* Update schema

* Update util

* Don't assume train and dev are actually paths

* Update onto-joint config

* Fix missing import

* Format

* Format

* Update spacy/gold/corpus.py

Co-authored-by: Ines Montani <ines@ines.io>

* Fix name

* Update default config

* Fix get_length option in batchers

* Update test

* Add comment

* Pass path into Corpus

* Update docstring

* Update schema and configs

* Update config

* Fix test

* Fix paths

* Fix print

* Fix create_train_batches

* [training.read_train] -> [training.train_corpus]

* Update onto-joint config

Co-authored-by: Ines Montani <ines@ines.io>
2020-08-04 15:09:37 +02:00
..
project Clean up link_vectors_to_models unused stuff 2020-07-29 14:01:11 +02:00
__init__.py Add init CLI and init config (#5854) 2020-08-02 15:18:30 +02:00
_util.py Add init CLI and init config (#5854) 2020-08-02 15:18:30 +02:00
convert.py Tidy up, autoformat, add types 2020-07-25 15:01:15 +02:00
debug_data.py Create corpus iterator and batcher from registry during training (#5865) 2020-08-04 15:09:37 +02:00
debug_model.py Merge pull request #5855 from svlandeg/fix/cli-debug 2020-08-03 13:09:20 +02:00
download.py Update CLI commans to use one shared util file 2020-07-10 17:57:40 +02:00
evaluate.py Create corpus iterator and batcher from registry during training (#5865) 2020-08-04 15:09:37 +02:00
info.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
init_config.py Add init CLI and init config (#5854) 2020-08-02 15:18:30 +02:00
init_model.py Add init CLI and init config (#5854) 2020-08-02 15:18:30 +02:00
package.py Add flag to toggle sdist creation on package [ci skip] 2020-07-27 16:52:23 +02:00
pretrain.py Add init CLI and init config (#5854) 2020-08-02 15:18:30 +02:00
profile.py Refactor pipeline components, config and language data (#5759) 2020-07-22 13:42:59 +02:00
train.py Create corpus iterator and batcher from registry during training (#5865) 2020-08-04 15:09:37 +02:00
validate.py Add init CLI and init config (#5854) 2020-08-02 15:18:30 +02:00