Sofie Van Landeghem
33ba9ff464
set encodings explicitly to utf8 ( #4551 )
2019-10-29 13:16:55 +01:00
Sofie Van Landeghem
4e7259c6cf
Bugfix initializing DocBin with attributes ( #4368 )
...
* docbin init fix + documentation fix + unit tests
* newline
* try with zlib instead of gzip (python 2 incompatibilities)
2019-10-03 14:48:45 +02:00
Ines Montani
b6670bf0c2
Use consistent spelling
2019-10-02 10:37:39 +02:00
adrianeboyd
d844030fd8
Update UD bin scripts ( #4315 )
...
* Update imports for `bin/`
* Add all currently supported languages
* Update subtok merger for new Matcher validation
* Modify blinded check to look at tokens instead of lemmas (for corpora
with tokens but not lemmas like Telugu)
2019-09-27 16:20:38 +02:00
Matthew Honnibal
7b858ba606
Update from master
2019-09-10 20:14:08 +02:00
Sofie Van Landeghem
482c7cd1b9
pulling tqdm imports in functions to avoid bug (tmp fix) ( #4263 )
2019-09-09 16:32:11 +02:00
Matthew Honnibal
bcd08f20af
Merge changes from master
2019-08-21 14:18:52 +02:00
svlandeg
b58bace84b
small fixes
2019-06-24 10:55:04 +02:00
Ines Montani
7400c7f8a7
Move UD scripts to bin
2019-03-20 01:19:34 +01:00