spaCy

History

Matthew Honnibal 8d0f1d98da * Draft dockstring for HuffmanCache		2015-07-13 12:01:18 +02:00
..
en	* Ensure unseen words are given low log probability	2015-07-12 01:31:09 +02:00
munge	* Fix head alignment in read_conll.parse, which was causing corrupt parses when strip_bad_periods=True. A similar problem may apply to other data readers.	2015-06-18 16:35:27 +02:00
ner	…
syntax	* Improve efficiency of L and R features, correcting the non-linear-in-length problem.	2015-07-09 12:17:26 +02:00
__init__.pxd	…
__init__.py	…
_ml.pxd	* Rename Tokens to Doc	2015-07-08 18:53:00 +02:00
_ml.pyx	* Add set_scores method to Model	2015-06-02 18:37:10 +02:00
attrs.pxd	…
gold.pxd	* Have oracle functions take a struct instead of a Python object	2015-06-02 20:01:06 +02:00
gold.pyx	* Add SPACE part-of-speech tag, and train tagger to assign it. Also train tagger not to make whitespace an entity	2015-07-09 13:30:41 +02:00
lexeme.pxd	* Remove has_sense method from Lexeme declaration	2015-07-08 19:41:20 +02:00
lexeme.pyx	* Remove has_sense method from Lexeme	2015-07-08 19:28:29 +02:00
morphology.pxd	…
morphology.pyx	…
multi_words.py	…
orth.pxd	…
orth.pyx	…
parts_of_speech.pxd	* Add SPACE part-of-speech tag, and train tagger to assign it. Also train tagger not to make whitespace an entity	2015-07-09 13:30:41 +02:00
parts_of_speech.pyx	* Add SPACE part-of-speech tag, and train tagger to assign it. Also train tagger not to make whitespace an entity	2015-07-09 13:30:41 +02:00
scorer.py	* Start scoring tokens	2015-06-28 06:21:38 +02:00
senses.pxd	* Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token.	2015-07-01 20:12:13 +02:00
senses.pyx	* Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token.	2015-07-01 20:12:13 +02:00
serialize.pyx	* Draft dockstring for HuffmanCache	2015-07-13 12:01:18 +02:00
spans.pxd	* Rename Tokens to Doc	2015-07-08 18:53:00 +02:00
spans.pyx	* Rename span.right to span.rights	2015-07-11 22:15:04 +02:00
strings.pxd	…
strings.pyx	* Add __len__ function to StringStore	2015-06-23 00:02:50 +02:00
structs.pxd	* Remove the senses attr from LexemeC, to keep data compatibility	2015-07-08 19:24:44 +02:00
tokenizer.pxd	* Rename Tokens to Doc	2015-07-08 18:53:00 +02:00
tokenizer.pyx	* Rename Tokens to Doc	2015-07-08 18:53:00 +02:00
tokens.pxd	* Rename Tokens to Doc	2015-07-08 18:53:00 +02:00
tokens.pyx	* Allow slice indexing in Doc.__getitem__, returning a Span object	2015-07-09 15:15:32 +02:00
typedefs.pxd	…
typedefs.pyx	…
util.py	…
vocab.pxd	* Add pos_tags attr to Vocab.	2015-07-08 12:36:38 +02:00
vocab.pyx	* Add pos_tags attr to Vocab.	2015-07-08 12:36:38 +02:00