spaCy/spacy
Jani Monoses 0e08e49e87 Lemmatizer ro (#2319)
* Add Romanian lemmatizer lookup table.

Adapted from http://www.lexiconista.com/datasets/lemmatization/
by replacing cedillas with commas (ș and ț).

The original dataset is licensed under the Open Database License.

* Fix one blatant issue in the Romanian lemmatizer

* Romanian examples file

* Add ro_tokenizer in conftest

* Add Romanian lemmatizer test
2018-05-12 15:20:04 +02:00
..
cli Fix formatting and consistency 2018-05-07 23:02:11 +02:00
data
displacy Add collapse_phrases option to displacy (closes #2266) 2018-04-28 23:06:50 +02:00
lang
syntax Fix loading of models when custom vectors are added 2018-04-10 22:19:20 +02:00
tests Lemmatizer ro (#2319) 2018-05-12 15:20:04 +02:00
tokens Test and fix for Issue #2219 (#2272) 2018-05-03 18:40:46 +02:00
__init__.pxd * Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags. 2014-10-24 02:23:42 +11:00
__init__.py
__main__.py
_ml.py
about.py
attrs.pxd
attrs.pyx
compat.py
errors.py
glossary.py
gold.pxd
gold.pyx rename SP to _SP (#2289) 2018-05-03 18:33:49 +02:00
language.py Fix vector-name loading fix 2018-04-04 01:31:25 +02:00
lemmatizer.py
lexeme.pxd
lexeme.pyx
matcher.pyx
morphology.pxd fix typo/missing here too 2018-02-18 14:38:27 +00:00
morphology.pyx
parts_of_speech.pxd Add support for Universal Dependencies v2.0 2017-03-03 13:17:34 +01:00
parts_of_speech.pyx
pipeline.pxd
pipeline.pyx
scorer.py
strings.pxd Try to fix StringStore clean up (see #1506) 2017-11-11 03:11:27 +03:00
strings.pyx 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
structs.pxd Make TokenC.sent_tart an int, to allow ternary value 2017-10-08 19:58:54 +02:00
symbols.pxd
symbols.pyx
tokenizer.pxd
tokenizer.pyx
typedefs.pxd
typedefs.pyx Tidy up rest 2017-10-27 21:07:59 +02:00
util.py 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
vectors.pyx 💫 New system for error messages and warnings (#2163) 2018-04-03 15:50:31 +02:00
vocab.pxd
vocab.pyx