..
af
New tests for a number of alpha languages ( #9703 )
2021-11-28 21:59:23 +01:00
am
…
ar
…
bg
Handle Cyrillic combining diacritics ( #10837 )
2022-06-28 15:35:32 +02:00
bn
…
ca
Update Catalan tokenizer ( #9297 )
2021-09-27 14:42:30 +02:00
cs
…
da
…
de
…
dsb
Add Lower Sorbian support. ( #10431 )
2022-03-07 16:57:14 +01:00
el
…
en
Remove English exceptions with mismatched features ( #10873 )
2022-06-03 09:44:04 +02:00
es
Migrate regression tests into the main test suite ( #9655 )
2021-12-04 20:34:48 +01:00
et
New tests for a number of alpha languages ( #9703 )
2021-11-28 21:59:23 +01:00
eu
…
fa
…
fi
Auto-format code with black ( #10333 )
2022-02-21 09:15:42 +01:00
fr
Revert "Bump sudachipy version ( #9917 )" ( #10071 )
2022-01-17 10:38:37 +01:00
ga
…
grc
add punctuation to grc ( #11426 )
2022-09-27 11:38:56 +02:00
gu
…
he
…
hi
Migrate regression tests into the main test suite ( #9655 )
2021-12-04 20:34:48 +01:00
hr
New tests for a number of alpha languages ( #9703 )
2021-11-28 21:59:23 +01:00
hsb
Add Upper Sorbian support. ( #10432 )
2022-03-07 16:20:39 +01:00
hu
🏷 Add Mypy check to CI and ignore all existing Mypy errors ( #9167 )
2021-10-14 15:21:40 +02:00
hy
…
id
…
is
New tests for a number of alpha languages ( #9703 )
2021-11-28 21:59:23 +01:00
it
Revert "Bump sudachipy version ( #9917 )" ( #10071 )
2022-01-17 10:38:37 +01:00
ja
Migrate regression tests into the main test suite ( #9655 )
2021-12-04 20:34:48 +01:00
ko
Handle unknown tags in KoreanTokenizer tag map ( #10536 )
2022-03-24 11:25:36 +01:00
ky
Update Cython string types ( #9143 )
2021-09-13 17:02:17 +02:00
la
Update LatinDefaults for lang 'la' ( #12538 )
2023-04-20 16:55:40 +02:00
lb
…
lg
luganda language extension ( #10847 )
2022-08-23 13:09:36 +02:00
lt
…
lv
New tests for a number of alpha languages ( #9703 )
2021-11-28 21:59:23 +01:00
mk
…
ml
…
nb
…
ne
…
nl
Fix Dutch noun chunks to skip overlapping spans ( #11275 )
2022-08-10 09:49:08 +02:00
pl
…
pt
Portuguese noun chunks review ( #9559 )
2021-11-04 23:55:49 +01:00
ro
…
ru
Update Russian and Ukrainian lemmatizers ( #11811 )
2022-11-25 11:12:46 +01:00
sa
…
sk
New tests for a number of alpha languages ( #9703 )
2021-11-28 21:59:23 +01:00
sl
Updates to Slovenian language ( #11162 )
2022-08-05 10:10:18 +02:00
sq
New tests for a number of alpha languages ( #9703 )
2021-11-28 21:59:23 +01:00
sr
Revert "Use Latin normalization for Serbian attrs ( #12608 )" ( #12621 )
2023-05-11 11:54:16 +02:00
sv
Bugfix/swedish tokenizer ( #12315 )
2023-02-27 10:53:45 +01:00
ta
Basic tests for the Tamil language ( #10629 )
2022-04-07 14:47:37 +02:00
th
Update custom tokenizer APIs and pickling ( #8972 )
2021-08-19 14:37:47 +02:00
ti
Update Tigrinya ትግርኛ language support ( #8900 )
2021-08-10 13:55:08 +02:00
tl
Add initial Tagalog (tl) tests ( #9582 )
2021-11-02 08:35:49 +01:00
tr
removing print statements from the test suite ( #10712 )
2022-04-27 09:14:25 +02:00
tt
…
uk
Update Russian and Ukrainian lemmatizers ( #11811 )
2022-11-25 11:12:46 +01:00
ur
…
vi
Update custom tokenizer APIs and pickling ( #8972 )
2021-08-19 14:37:47 +02:00
xx
New tests for a number of alpha languages ( #9703 )
2021-11-28 21:59:23 +01:00
yo
…
zh
…
__init__.py
…
test_attrs.py
Intify IOB ( #9738 )
2022-01-20 13:19:38 +01:00
test_initialize.py
Fix Azerbaijani init, extend lang init tests ( #8656 )
2021-07-09 15:36:35 +02:00
test_lemmatizers.py
Update Catalan language data ( #8308 )
2021-06-11 10:21:22 +02:00