spaCy/spacy/cli
adrianeboyd a58cb023d7 WIP: Extending debug-data (#4114)
* Extending debug-data with dependency checks, etc.

* Modify debug-data to load with GoldCorpus to iterate over .json/.jsonl
files within directories

* Add GoldCorpus iterator train_docs_without_preprocessing to load
original train docs without shuffling and projectivizing

* Report number of misaligned tokens

* Add more dependency checks and messages

* Update spacy/cli/debug_data.py

Co-Authored-By: Ines Montani <ines@ines.io>

* Fixed conflict

* Move counts to _compile_gold()

* Move all dependency nonproj/sent/head/cycle counting to
_compile_gold()

* Unclobber previous merges

* Update variable names

* Update more variable names, fix misspelling

* Don't clobber loading error messages

* Only warn about misaligned tokens if present
2019-08-16 10:52:46 +02:00
..
converters
__init__.py
_schemas.py
convert.py
debug_data.py WIP: Extending debug-data (#4114) 2019-08-16 10:52:46 +02:00
download.py Require downloaded model in pkg_resources (#4090) 2019-08-07 13:18:11 +02:00
evaluate.py
info.py
init_model.py Fix init_model if there's no vocab (closes #4048) (#4049) 2019-08-01 17:26:09 +02:00
link.py
package.py Also support "requirements" in model.json 2019-07-27 13:34:57 +02:00
pretrain.py
profile.py
train.py
validate.py