.. |
bn
|
Add symbols class to punctuation rules to handle emoji (see #1088)
|
2017-05-27 17:57:10 +02:00 |
da
|
Add language example sentences (see #1107)
|
2017-08-19 12:22:29 +02:00 |
de
|
Don't split hyphenated words in German
|
2017-09-16 20:40:15 +02:00 |
en
|
Improve lemmatization rule for am|VBP
|
2017-09-04 15:18:10 +02:00 |
es
|
Add language example sentences (see #1107)
|
2017-08-19 12:22:29 +02:00 |
fi
|
Add norm exceptions to other Language classes
|
2017-06-03 22:29:21 +02:00 |
fr
|
Add language example sentences (see #1107)
|
2017-08-19 12:22:29 +02:00 |
he
|
Add language example sentences (see #1107)
|
2017-08-19 12:22:29 +02:00 |
hu
|
Merge pull request #1101 from oroszgy/hu_tokenizer_fix
|
2017-06-04 22:37:51 +02:00 |
id
|
reworked
|
2017-08-20 13:43:21 +07:00 |
it
|
Add language example sentences (see #1107)
|
2017-08-19 12:22:29 +02:00 |
ja
|
…
|
|
nb
|
Add language example sentences (see #1107)
|
2017-08-19 12:22:29 +02:00 |
nl
|
Add norm exceptions to other Language classes
|
2017-06-03 22:29:21 +02:00 |
pl
|
Add language example sentences (see #1107)
|
2017-08-19 12:22:29 +02:00 |
pt
|
Add language example sentences (see #1107)
|
2017-08-19 12:22:29 +02:00 |
sv
|
Add language example sentences (see #1107)
|
2017-08-19 12:22:29 +02:00 |
th
|
fix newline in file
|
2017-09-26 21:49:43 +07:00 |
xx
|
Add norm exceptions to other Language classes
|
2017-06-03 22:29:21 +02:00 |
zh
|
Port over change from #1323 and tidy up
|
2017-09-14 19:23:13 +02:00 |
__init__.py
|
Remove imports in /lang/__init__.py
|
2017-05-08 23:58:07 +02:00 |
char_classes.py
|
Split % in units, for compatibility with corpus
|
2017-08-25 20:03:37 -05:00 |
entity_rules.py
|
Reorganise entity rules
|
2017-05-09 01:37:10 +02:00 |
lex_attrs.py
|
Fix PROB attr for OOV words
|
2017-08-23 12:11:52 +02:00 |
norm_exceptions.py
|
Add base norm exceptions
|
2017-06-03 20:27:05 +02:00 |
punctuation.py
|
Add symbols class to punctuation rules to handle emoji (see #1088)
|
2017-05-27 17:57:10 +02:00 |
tag_map.py
|
Fix formatting
|
2017-05-09 11:08:14 +02:00 |
tokenizer_exceptions.py
|
Use consistent regex imports
|
2017-05-09 00:34:31 +02:00 |