spaCy/spacy/lang/pt
adrianeboyd 923a453449
Modifications/updates to Portuguese tokenization (#5203)
Modifications to Portuguese tokenization for UD_Portuguese-Bosque.
Instead of splitting contactions as exceptions, they are kept as merged
tokens.
2020-03-25 11:27:53 +01:00
..
__init__.py Move lookup tables out of the core library (#4346) 2019-10-01 00:01:27 +02:00
examples.py 💫 Tidy up and auto-format .py files (#2983) 2018-11-30 17:03:03 +01:00
lex_attrs.py 💫 Tidy up and auto-format .py files (#2983) 2018-11-30 17:03:03 +01:00
norm_exceptions.py 💫 Tidy up and auto-format .py files (#2983) 2018-11-30 17:03:03 +01:00
punctuation.py 💫 Tidy up and auto-format .py files (#2983) 2018-11-30 17:03:03 +01:00
stop_words.py 💫 Tidy up and auto-format .py files (#2983) 2018-11-30 17:03:03 +01:00
tag_map.py Add missing tags to el/es/pt tag maps (#4696) 2019-11-23 14:57:21 +01:00
tokenizer_exceptions.py Modifications/updates to Portuguese tokenization (#5203) 2020-03-25 11:27:53 +01:00