spaCy/spacy/lang/fr
Paul O'Leary McCann fd759a881b
Fix inconsistent lemmas (#9405)
* Add util function to unique lists and preserve order

* Use unique function instead of list(set())

list(set()) has the issue that it's not consistent between runs of the
Python interpreter, so order can vary.

list(set()) calls were left in a few places where they were behind calls
to sorted(). I think in this case the calls to list() can be removed,
but this commit doesn't do that.

* Use the existing pattern for this
2021-10-11 11:38:45 +02:00
..
__init__.py Implement overwrite param for all custom lemmatizers (#6794) 2021-01-26 14:53:43 +11:00
_tokenizer_exceptions_list.py Merge branch 'develop' into master-tmp 2020-06-20 15:52:00 +02:00
examples.py Tidy up and auto-format 2020-02-18 15:38:18 +01:00
lemmatizer.py Fix inconsistent lemmas (#9405) 2021-10-11 11:38:45 +02:00
lex_attrs.py Drop Python 2.7 and 3.5 (#4828) 2019-12-22 01:53:56 +01:00
punctuation.py Merge branch 'master' into tmp/sync 2020-03-26 13:38:14 +01:00
stop_words.py Improvements to French stopwords list (#7941) 2021-06-02 11:50:49 +02:00
syntax_iterators.py Refactor Docs.is_ flags (#6044) 2020-09-17 00:14:01 +02:00
tokenizer_exceptions.py Fix verbs list in lang/fr/tokenizer_exceptions.py (#9033) 2021-08-25 15:55:09 +02:00