spaCy

History

Jani Monoses 0e08e49e87 Lemmatizer ro (#2319 ) * Add Romanian lemmatizer lookup table. Adapted from http://www.lexiconista.com/datasets/lemmatization/ by replacing cedillas with commas (ș and ț). The original dataset is licensed under the Open Database License. * Fix one blatant issue in the Romanian lemmatizer * Romanian examples file * Add ro_tokenizer in conftest * Add Romanian lemmatizer test	2018-05-12 15:20:04 +02:00
..
__init__.py	Lemmatizer ro (#2319 )	2018-05-12 15:20:04 +02:00
test_lemmatizer.py	Lemmatizer ro (#2319 )	2018-05-12 15:20:04 +02:00

Jani Monoses 0e08e49e87 Lemmatizer ro (#2319 )

* Add Romanian lemmatizer lookup table.

Adapted from http://www.lexiconista.com/datasets/lemmatization/
by replacing cedillas with commas (ș and ț).

The original dataset is licensed under the Open Database License.

* Fix one blatant issue in the Romanian lemmatizer

* Romanian examples file

* Add ro_tokenizer in conftest

* Add Romanian lemmatizer test

2018-05-12 15:20:04 +02:00

__init__.py

Lemmatizer ro (#2319 )

2018-05-12 15:20:04 +02:00

test_lemmatizer.py

Lemmatizer ro (#2319 )

2018-05-12 15:20:04 +02:00