spaCy/spacy
Kamolsit Mongkolsrisawat dcc67f3f51 Update Thai tokenizer_exception list (#3529)
* add tokenizer_exceptions word (ก-น) from https://goo.gl/JpJ2qq

* update tokenizer_exceptions word list

* add contributor file
2019-04-03 09:13:36 +02:00
..
cli Auto-format 2019-04-01 12:11:27 +02:00
data
displacy 💫 Fix displaCy support for RTL languages (#3393) 2019-03-11 18:52:50 +01:00
lang Update Thai tokenizer_exception list (#3529) 2019-04-03 09:13:36 +02:00
matcher Add actual deprecation warning for n_threads (resolves #3410) 2019-03-15 16:38:44 +01:00
pipeline Merge branch 'master' into feature/el-framework 2019-03-26 11:00:02 +01:00
syntax 💫 Fix class mismap on parser deserializing (closes #3433) (#3470) 2019-03-23 13:46:25 +01:00
tests Auto-format 2019-04-01 12:11:27 +02:00
tokens Merge branch 'master' into feature/el-framework 2019-03-26 11:00:02 +01:00
__init__.pxd
__init__.py Fix formatting (hopefully also restarts build properly) 2019-03-20 09:55:45 +01:00
__main__.py Update __main__.py 2019-03-20 09:43:26 +01:00
_align.pyx
_ml.py Auto-format 2019-04-01 12:11:27 +02:00
about.py Set version to v2.1.3 2019-03-23 16:47:57 +01:00
attrs.pxd
attrs.pyx
compat.py Fix tokenizer on Python2.7 (#3460) 2019-03-22 13:42:47 +01:00
errors.py error and warning messages 2019-03-22 16:55:05 +01:00
glossary.py
gold.pxd
gold.pyx Fix jsonl to json conversion (#3419) 2019-03-17 22:12:54 +01:00
kb.pxd entity as one field instead of both ID and name 2019-03-25 18:10:41 +01:00
kb.pyx entity as one field instead of both ID and name 2019-03-25 18:10:41 +01:00
language.py Merge branch 'master' into feature/el-framework 2019-03-26 11:00:02 +01:00
lemmatizer.py Tidy up and improve docs and docstrings (#3370) 2019-03-08 11:42:26 +01:00
lexeme.pxd 💫 Support lexical attributes in retokenizer attrs (closes #2390) (#3325) 2019-02-24 21:13:51 +01:00
lexeme.pyx Tidy up property code style (#3391) 2019-03-11 15:59:09 +01:00
morphology.pxd annotate kb_id through ents in doc 2019-03-22 11:36:44 +01:00
morphology.pyx annotate kb_id through ents in doc 2019-03-22 11:36:44 +01:00
parts_of_speech.pxd
parts_of_speech.pyx
scorer.py
strings.pxd
strings.pyx 💫 Make serialization methods consistent (#3385) 2019-03-10 19:16:45 +01:00
structs.pxd annotate kb_id through ents in doc 2019-03-22 11:36:44 +01:00
symbols.pxd
symbols.pyx
tokenizer.pxd
tokenizer.pyx DOC: Update tokenizer docs to include default value for batch_size in pipe (#3492) 2019-03-28 12:48:02 +01:00
typedefs.pxd
typedefs.pyx
util.py fix(util): fix decaying function output (#3495) 2019-03-28 13:24:47 +01:00
vectors.pyx Update Vectors.find docs [ci skip] 2019-03-16 17:10:57 +01:00
vocab.pxd
vocab.pyx Tidy up property code style (#3391) 2019-03-11 15:59:09 +01:00