mirror of https://github.com/explosion/spaCy.git
923a453449
Modifications to Portuguese tokenization for UD_Portuguese-Bosque. Instead of splitting contactions as exceptions, they are kept as merged tokens. |
||
---|---|---|
.. | ||
__init__.py | ||
examples.py | ||
lex_attrs.py | ||
norm_exceptions.py | ||
punctuation.py | ||
stop_words.py | ||
tag_map.py | ||
tokenizer_exceptions.py |