spaCy/spacy/de/data/tokenizer/infix.txt

4 lines
56 B
Plaintext

\.\.\.
(?<=[a-z])\.(?=[A-Z])
(?<=[a-zA-Z])-(?=[a-zA-z])