spaCy/spacy/lang
Daniel Vasic 20d72de986
Added Multext-East V5 tagset for Croatian language (#6248)
* Added Multext-East V5 tagset for Croatian language

* Create danielvasic.md

* Update danielvasic.md

* Update danielvasic.md

* Add tag map to CroatianDefaults

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2020-11-05 12:19:22 +01:00
..
af
ar
bg
bn Update invalid tag maps (#5796) 2020-07-22 16:02:51 +02:00
ca
cs Add tag map to cs language (#6284) 2020-11-05 10:13:11 +01:00
da Tidy up and auto-format 2020-05-21 14:14:01 +02:00
de Fix overlapping German noun chunks (#6112) 2020-09-22 21:52:42 +02:00
el span / noun chunk has +1 because end is exclusive 2020-05-21 19:56:56 +02:00
en add oprd to the list of accepted deps for noun chunking (#6302) 2020-11-05 09:17:35 +01:00
es Fix span boundary handling in Spanish noun_chunks (#5860) 2020-08-03 13:53:15 +02:00
et
eu Update invalid tag maps (#5796) 2020-07-22 16:02:51 +02:00
fa span / noun chunk has +1 because end is exclusive 2020-05-21 19:56:56 +02:00
fi
fr Remove is_base_form from French lemmatizer (#5733) 2020-07-09 22:11:13 +02:00
ga
gu Tidy up and auto-format 2020-05-21 14:14:01 +02:00
he Hebrew like num (#5952) 2020-08-24 14:30:05 +02:00
hi Hindi: Adds tests for lexical attributes (norm and like_num) (#5829) 2020-10-07 10:23:32 +02:00
hr Added Multext-East V5 tagset for Croatian language (#6248) 2020-11-05 12:19:22 +01:00
hu
hy Update invalid tag maps (#5796) 2020-07-22 16:02:51 +02:00
id Update Indonesian Example Phrases (#6124) 2020-09-23 14:02:26 +02:00
is
it
ja fix ja leading spaces (#5969) 2020-08-25 14:16:24 +02:00
kn
ko fix bug in Korean language, resulting in 100x speedup by reducing overhead of mecab (#5701) 2020-07-06 17:03:33 +02:00
lb Reduce stored lexemes data, move feats to lookups (#5238) 2020-05-19 15:59:14 +02:00
lij
lt
lv
mk Include Macedonian language (#6230) 2020-10-15 15:55:01 +02:00
ml Tidy up and auto-format 2020-05-21 14:14:01 +02:00
mr
nb span / noun chunk has +1 because end is exclusive 2020-05-21 19:56:56 +02:00
ne Add Nepali Language (#5622) 2020-06-22 10:25:46 +02:00
nl
pl Update invalid tag maps (#5796) 2020-07-22 16:02:51 +02:00
pt Reduce stored lexemes data, move feats to lookups (#5238) 2020-05-19 15:59:14 +02:00
ro Update invalid tag maps (#5796) 2020-07-22 16:02:51 +02:00
ru Update invalid tag maps (#5796) 2020-07-22 16:02:51 +02:00
sa Added support for Sanskrit language (#5956) 2020-08-25 10:56:29 +02:00
si
sk
sl
sq
sr Reduce stored lexemes data, move feats to lookups (#5238) 2020-05-19 15:59:14 +02:00
sv Update morph_rules.py (#6102) 2020-10-06 15:14:47 +02:00
ta Update spacy/lang/ta/examples.py 2020-10-13 11:03:35 +02:00
te
th Add Thai tag map (LST20 Corpus) (#6163) 2020-10-07 11:12:01 +02:00
tl
tr Turkish tokenization improvements (#6268) 2020-10-29 09:43:17 +01:00
tt
uk
ur Tidy up and auto-format 2020-05-21 14:14:01 +02:00
vi
xx
yo
zh Update pkuseg version (#5774) 2020-07-19 11:09:49 +02:00
__init__.py
char_classes.py Include Macedonian language (#6230) 2020-10-15 15:55:01 +02:00
lex_attrs.py Hebrew like num (#5952) 2020-08-24 14:30:05 +02:00
norm_exceptions.py
punctuation.py
tag_map.py
tokenizer_exceptions.py Fix raw strings in URL pattern (#5972) 2020-08-26 04:00:49 +02:00