spaCy

History

Sofie Van Landeghem e48a09df4e Example class for training data (#4543 ) * OrigAnnot class instead of gold.orig_annot list of zipped tuples * from_orig to replace from_annot_tuples * rename to RawAnnot * some unit tests for GoldParse creation and internal format * removing orig_annot and switching to lists instead of tuple * rewriting tuples to use RawAnnot (+ debug statements, WIP) * fix pop() changing the data * small fixes * pop-append fixes * return RawAnnot for existing GoldParse to have uniform interface * clean up imports * fix merge_sents * add unit test for 4402 with new structure (not working yet) * introduce DocAnnot * typo fixes * add unit test for merge_sents * rename from_orig to from_raw * fixing unit tests * fix nn parser * read_annots to produce text, doc_annot pairs * _make_golds fix * rename golds_to_gold_annots * small fixes * fix encoding * have golds_to_gold_annots use DocAnnot * missed a spot * merge_sents as function in DocAnnot * allow specifying only part of the token-level annotations * refactor with Example class + underlying dicts * pipeline components to work with Example objects (wip) * input checking * fix yielding * fix calls to update * small fixes * fix scorer unit test with new format * fix kwargs order * fixes for ud and conllu scripts * fix reading data for conllu script * add in proper errors (not fixed numbering yet to avoid merge conflicts) * fixing few more small bugs * fix EL script		2019-11-11 17:35:27 +01:00
..
converters	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
__init__.py	Move UD scripts to bin	2019-03-20 01:19:34 +01:00
_schemas.py	Store JSON schemas in Python and tidy up (#3235 )	2019-02-07 19:44:31 +11:00
convert.py	Auto-format [ci skip]	2019-10-24 16:21:08 +02:00
debug_data.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
download.py	Improve usage of pkg_resources and handling of entry points (#4387 )	2019-10-07 17:22:09 +02:00
evaluate.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
info.py	Small CLI improvements (#3030 )	2018-12-08 11:49:43 +01:00
init_model.py	Support model name in init-model	2019-09-26 03:01:32 +02:00
link.py	Small CLI improvements (#3030 )	2018-12-08 11:49:43 +01:00
package.py	Also support "requirements" in model.json	2019-07-27 13:34:57 +02:00
pretrain.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
profile.py	pulling tqdm imports in functions to avoid bug (tmp fix) (#4263 )	2019-09-09 16:32:11 +02:00
train.py	Example class for training data (#4543 )	2019-11-11 17:35:27 +01:00
validate.py	Improve usage of pkg_resources and handling of entry points (#4387 )	2019-10-07 17:22:09 +02:00