spaCy/spacy
Matthew Honnibal 8d0f1d98da * Draft dockstring for HuffmanCache 2015-07-13 12:01:18 +02:00
..
en * Ensure unseen words are given low log probability 2015-07-12 01:31:09 +02:00
munge * Fix head alignment in read_conll.parse, which was causing corrupt parses when strip_bad_periods=True. A similar problem may apply to other data readers. 2015-06-18 16:35:27 +02:00
ner
syntax * Improve efficiency of L and R features, correcting the non-linear-in-length problem. 2015-07-09 12:17:26 +02:00
__init__.pxd
__init__.py
_ml.pxd * Rename Tokens to Doc 2015-07-08 18:53:00 +02:00
_ml.pyx * Add set_scores method to Model 2015-06-02 18:37:10 +02:00
attrs.pxd
gold.pxd * Have oracle functions take a struct instead of a Python object 2015-06-02 20:01:06 +02:00
gold.pyx * Add SPACE part-of-speech tag, and train tagger to assign it. Also train tagger not to make whitespace an entity 2015-07-09 13:30:41 +02:00
lexeme.pxd * Remove has_sense method from Lexeme declaration 2015-07-08 19:41:20 +02:00
lexeme.pyx * Remove has_sense method from Lexeme 2015-07-08 19:28:29 +02:00
morphology.pxd
morphology.pyx
multi_words.py
orth.pxd
orth.pyx
parts_of_speech.pxd * Add SPACE part-of-speech tag, and train tagger to assign it. Also train tagger not to make whitespace an entity 2015-07-09 13:30:41 +02:00
parts_of_speech.pyx * Add SPACE part-of-speech tag, and train tagger to assign it. Also train tagger not to make whitespace an entity 2015-07-09 13:30:41 +02:00
scorer.py * Start scoring tokens 2015-06-28 06:21:38 +02:00
senses.pxd * Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token. 2015-07-01 20:12:13 +02:00
senses.pyx * Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token. 2015-07-01 20:12:13 +02:00
serialize.pyx * Draft dockstring for HuffmanCache 2015-07-13 12:01:18 +02:00
spans.pxd * Rename Tokens to Doc 2015-07-08 18:53:00 +02:00
spans.pyx * Rename span.right to span.rights 2015-07-11 22:15:04 +02:00
strings.pxd
strings.pyx * Add __len__ function to StringStore 2015-06-23 00:02:50 +02:00
structs.pxd * Remove the senses attr from LexemeC, to keep data compatibility 2015-07-08 19:24:44 +02:00
tokenizer.pxd * Rename Tokens to Doc 2015-07-08 18:53:00 +02:00
tokenizer.pyx * Rename Tokens to Doc 2015-07-08 18:53:00 +02:00
tokens.pxd * Rename Tokens to Doc 2015-07-08 18:53:00 +02:00
tokens.pyx * Allow slice indexing in Doc.__getitem__, returning a Span object 2015-07-09 15:15:32 +02:00
typedefs.pxd
typedefs.pyx
util.py
vocab.pxd * Add pos_tags attr to Vocab. 2015-07-08 12:36:38 +02:00
vocab.pyx * Add pos_tags attr to Vocab. 2015-07-08 12:36:38 +02:00