Pavle Vidanović
|
d03401f532
|
Lemmatizer lookup dictionary for Serbian and basic tag set adde… (#4251)
* Serbian stopwords added. (cyrillic alphabet)
* spaCy Contribution agreement included.
* Test initialize updated
* Serbian language code update. --bugfix
* Tokenizer exceptions added. Init file updated.
* Norm exceptions and lexical attributes added.
* Examples added.
* Tests added.
* sr_lang examples update.
* Tokenizer exceptions updated. (Serbian)
* Lemmatizer created. Licence included.
* Test updated.
* Tag map basic added.
* tag_map.py file removed since it uses default spacy tags.
|
2019-09-08 14:19:15 +02:00 |
Pavle Vidanović
|
60e10a9f93
|
Serbian language improvement (#4169)
* Serbian stopwords added. (cyrillic alphabet)
* spaCy Contribution agreement included.
* Test initialize updated
* Serbian language code update. --bugfix
* Tokenizer exceptions added. Init file updated.
* Norm exceptions and lexical attributes added.
* Examples added.
* Tests added.
* sr_lang examples update.
* Tokenizer exceptions updated. (Serbian)
|
2019-08-22 11:43:07 +02:00 |
Pavle Vidanović
|
4fe9329bfb
|
Serbian language code update "rs" -> "sr" (#4159)
* Serbian stopwords added. (cyrillic alphabet)
* spaCy Contribution agreement included.
* Test initialize updated
* Serbian language code update. --bugfix
|
2019-08-21 19:57:37 +02:00 |