spaCy

History

Daniel Vasic 20d72de986 Added Multext-East V5 tagset for Croatian language (#6248 ) * Added Multext-East V5 tagset for Croatian language * Create danielvasic.md * Update danielvasic.md * Update danielvasic.md * Add tag map to CroatianDefaults Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>		2020-11-05 12:19:22 +01:00
..
af	…
ar	…
bg	…
bn	Update invalid tag maps (#5796 )	2020-07-22 16:02:51 +02:00
ca	…
cs	Add tag map to cs language (#6284 )	2020-11-05 10:13:11 +01:00
da	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
de	Fix overlapping German noun chunks (#6112 )	2020-09-22 21:52:42 +02:00
el	span / noun chunk has +1 because end is exclusive	2020-05-21 19:56:56 +02:00
en	add oprd to the list of accepted deps for noun chunking (#6302 )	2020-11-05 09:17:35 +01:00
es	Fix span boundary handling in Spanish noun_chunks (#5860 )	2020-08-03 13:53:15 +02:00
et	…
eu	Update invalid tag maps (#5796 )	2020-07-22 16:02:51 +02:00
fa	span / noun chunk has +1 because end is exclusive	2020-05-21 19:56:56 +02:00
fi	…
fr	Remove is_base_form from French lemmatizer (#5733 )	2020-07-09 22:11:13 +02:00
ga	…
gu	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
he	Hebrew like num (#5952 )	2020-08-24 14:30:05 +02:00
hi	Hindi: Adds tests for lexical attributes (norm and like_num) (#5829 )	2020-10-07 10:23:32 +02:00
hr	Added Multext-East V5 tagset for Croatian language (#6248 )	2020-11-05 12:19:22 +01:00
hu	…
hy	Update invalid tag maps (#5796 )	2020-07-22 16:02:51 +02:00
id	Update Indonesian Example Phrases (#6124 )	2020-09-23 14:02:26 +02:00
is	…
it	…
ja	fix ja leading spaces (#5969 )	2020-08-25 14:16:24 +02:00
kn	…
ko	fix bug in Korean language, resulting in 100x speedup by reducing overhead of mecab (#5701 )	2020-07-06 17:03:33 +02:00
lb	Reduce stored lexemes data, move feats to lookups (#5238 )	2020-05-19 15:59:14 +02:00
lij	…
lt	…
lv	…
mk	Include Macedonian language (#6230 )	2020-10-15 15:55:01 +02:00
ml	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
mr	…
nb	span / noun chunk has +1 because end is exclusive	2020-05-21 19:56:56 +02:00
ne	Add Nepali Language (#5622 )	2020-06-22 10:25:46 +02:00
nl	…
pl	Update invalid tag maps (#5796 )	2020-07-22 16:02:51 +02:00
pt	Reduce stored lexemes data, move feats to lookups (#5238 )	2020-05-19 15:59:14 +02:00
ro	Update invalid tag maps (#5796 )	2020-07-22 16:02:51 +02:00
ru	Update invalid tag maps (#5796 )	2020-07-22 16:02:51 +02:00
sa	Added support for Sanskrit language (#5956 )	2020-08-25 10:56:29 +02:00
si	…
sk	…
sl	…
sq	…
sr	Reduce stored lexemes data, move feats to lookups (#5238 )	2020-05-19 15:59:14 +02:00
sv	Update morph_rules.py (#6102 )	2020-10-06 15:14:47 +02:00
ta	Update spacy/lang/ta/examples.py	2020-10-13 11:03:35 +02:00
te	…
th	Add Thai tag map (LST20 Corpus) (#6163 )	2020-10-07 11:12:01 +02:00
tl	…
tr	Turkish tokenization improvements (#6268 )	2020-10-29 09:43:17 +01:00
tt	…
uk	…
ur	Tidy up and auto-format	2020-05-21 14:14:01 +02:00
vi	…
xx	…
yo	…
zh	Update pkuseg version (#5774 )	2020-07-19 11:09:49 +02:00
__init__.py	…
char_classes.py	Include Macedonian language (#6230 )	2020-10-15 15:55:01 +02:00
lex_attrs.py	Hebrew like num (#5952 )	2020-08-24 14:30:05 +02:00
norm_exceptions.py	…
punctuation.py	…
tag_map.py	…
tokenizer_exceptions.py	Fix raw strings in URL pattern (#5972 )	2020-08-26 04:00:49 +02:00