spaCy/spacy
Matthew Honnibal 7b66ec896a Revert "Revert "Improve parser oracle around sentence breaks.""
This reverts commit 36e481c584.
2018-02-26 10:57:37 +01:00
..
cli Fix CLI for multitask objectives 2018-02-18 10:59:11 +01:00
data Make spacy/data a package 2017-03-18 20:04:22 +01:00
displacy Don't use deprecated Doc.merge call in displaCy 2018-01-27 11:25:05 +01:00
lang Add option to not use Janome for Japanese tokenization 2018-02-26 09:39:46 +01:00
syntax Revert "Revert "Improve parser oracle around sentence breaks."" 2018-02-26 10:57:37 +01:00
tests Update alignment tests 2018-02-24 16:03:58 +01:00
tokens Add doc.is_sentenced attribute, re #1959 2018-02-18 14:16:55 +01:00
__init__.pxd
__init__.py Remove dummy variable from function calls 2018-01-05 09:37:05 +01:00
__main__.py Don't pass CLI command name as dummy argument 2018-01-04 21:33:47 +01:00
_align.pyx Fix many-to-one alignment 2018-02-24 16:03:50 +01:00
_matcher2_notes.py Update notes on matcher2 2018-02-13 11:45:45 +01:00
_ml.py Create a preprocess function that gets bigrams 2017-11-12 00:43:41 +01:00
about.py Set version to 2.1.0.dev1 2018-02-23 16:22:24 +01:00
attrs.pxd Fix LANG symbol 2018-02-17 18:10:50 +01:00
attrs.pyx missing PrepCase attribute 2018-02-18 14:46:12 +00:00
compat.py Drop six and related hacks as a dependency 2018-02-18 13:29:56 +01:00
glossary.py Fix typo in glossary (resolves #1964) 2018-02-10 11:58:41 +01:00
gold.pxd Add support for sent_start to GoldParse 2017-08-25 20:03:14 -05:00
gold.pyx Fix alignment for words with spaces 2018-02-25 14:55:00 +01:00
language.py Fix issue #1959 2018-02-15 12:51:49 -08:00
lemmatizer.py Don't lower-case lemmas of proper nouns 2018-02-21 16:01:16 +01:00
lexeme.pxd WIP on stringstore change. 27 failures 2017-05-28 14:06:40 +02:00
lexeme.pyx added new lexical feat to lexeme 2018-02-11 18:51:48 +01:00
matcher.pyx Move cython declarations in matcher.pyx 2018-02-24 10:32:18 +01:00
morphology.pxd fix typo/missing here too 2018-02-18 14:38:27 +00:00
morphology.pyx fix typo/missing here too 2018-02-18 14:38:27 +00:00
parts_of_speech.pxd Add support for Universal Dependencies v2.0 2017-03-03 13:17:34 +01:00
parts_of_speech.pyx Tidy up rest 2017-10-27 21:07:59 +02:00
pipeline.pxd Fix names of pipeline components 2017-10-26 12:38:23 +02:00
pipeline.pyx Fix bug in multi-task objective 2018-02-23 23:48:09 +01:00
scorer.py Fix scoring of tokenization for punct 2018-02-24 10:32:32 +01:00
strings.pxd Try to fix StringStore clean up (see #1506) 2017-11-11 03:11:27 +03:00
strings.pyx Use safer method to get string without hit 2017-11-14 22:58:46 +03:00
structs.pxd Make TokenC.sent_tart an int, to allow ternary value 2017-10-08 19:58:54 +02:00
symbols.pxd Fix inconsistencies in the symbols table 2018-02-18 13:51:31 +01:00
symbols.pyx Fix inconsistencies in the symbols table 2018-02-18 13:51:31 +01:00
tokenizer.pxd Disable tokenizer cache for special-cases. Fixes #1250 2017-10-24 16:08:05 +02:00
tokenizer.pyx Merge pull request #1611 from fsonntag/master 2017-11-29 23:11:23 +01:00
typedefs.pxd Work on changing StringStore to return hashes. 2017-05-28 12:36:27 +02:00
typedefs.pyx Tidy up rest 2017-10-27 21:07:59 +02:00
util.py Add contributor agreement for emulbreh 2018-02-13 13:40:33 +01:00
vectors.pyx Ensure files opened in `from_disk` are closed 2018-02-13 20:49:43 +01:00
vocab.pxd Add Vocab.cfg attr, to hold stuff like oov probs 2017-10-30 16:08:50 +01:00
vocab.pyx Make Vocab.__contains__ work with ints. Fixes #1868 2018-01-23 23:26:47 +01:00