mirror of https://github.com/explosion/spaCy.git
Reorganise English tokenizer exceptions (as discussed in #718)
Add logic to generate exceptions that follow a consistent pattern (like verbs and pronouns) and allow certain tokens to be excluded explicitly.
This commit is contained in:
parent
fb9d3bb022
commit
35b39f53c3