mirror of https://github.com/explosion/spaCy.git
e1f777b151
* don't split on a colon. Colon is used to attach suffixes for abbreviations * tokenize on any of LIST_HYPHENS (except a single hyphen), not just on -- * simplify infix rules by merging similar rules |
||
---|---|---|
.. | ||
__init__.py | ||
examples.py | ||
lex_attrs.py | ||
punctuation.py | ||
stop_words.py | ||
tokenizer_exceptions.py |