spaCy

History

Jim O'Regan 1ee75ae337 Merge remote-tracking branch 'origin/develop' into develop-irish		2017-09-11 08:40:11 +01:00
..
bn	Add symbols class to punctuation rules to handle emoji (see #1088 )	2017-05-27 17:57:10 +02:00
da	Add language example sentences (see #1107 )	2017-08-19 12:22:29 +02:00
de	Add language example sentences (see #1107 )	2017-08-19 12:22:29 +02:00
en	Improve lemmatization rule for am\|VBP	2017-09-04 15:18:10 +02:00
es	Add language example sentences (see #1107 )	2017-08-19 12:22:29 +02:00
fi	Add norm exceptions to other Language classes	2017-06-03 22:29:21 +02:00
fr	Add language example sentences (see #1107 )	2017-08-19 12:22:29 +02:00
ga	fix in UD submitted; map either way	2017-08-08 19:22:14 +01:00
he	Add language example sentences (see #1107 )	2017-08-19 12:22:29 +02:00
hu	Merge pull request #1101 from oroszgy/hu_tokenizer_fix	2017-06-04 22:37:51 +02:00
id	reworked	2017-08-20 13:43:21 +07:00
it	Add language example sentences (see #1107 )	2017-08-19 12:22:29 +02:00
ja	Fix relative imports	2017-05-08 22:29:04 +02:00
nb	Add language example sentences (see #1107 )	2017-08-19 12:22:29 +02:00
nl	Add norm exceptions to other Language classes	2017-06-03 22:29:21 +02:00
pl	Add language example sentences (see #1107 )	2017-08-19 12:22:29 +02:00
pt	Add language example sentences (see #1107 )	2017-08-19 12:22:29 +02:00
sv	Add language example sentences (see #1107 )	2017-08-19 12:22:29 +02:00
xx	Add norm exceptions to other Language classes	2017-06-03 22:29:21 +02:00
zh	Port over changes from #1168	2017-07-01 11:43:54 +02:00
__init__.py	Remove imports in /lang/__init__.py	2017-05-08 23:58:07 +02:00
char_classes.py	Split % in units, for compatibility with corpus	2017-08-25 20:03:37 -05:00
entity_rules.py	Reorganise entity rules	2017-05-09 01:37:10 +02:00
lex_attrs.py	Fix PROB attr for OOV words	2017-08-23 12:11:52 +02:00
norm_exceptions.py	Add base norm exceptions	2017-06-03 20:27:05 +02:00
punctuation.py	Add symbols class to punctuation rules to handle emoji (see #1088 )	2017-05-27 17:57:10 +02:00
tag_map.py	Fix formatting	2017-05-09 11:08:14 +02:00
tokenizer_exceptions.py	Use consistent regex imports	2017-05-09 00:34:31 +02:00