spaCy/spacy
Matthew Honnibal e770fade1e * Don't set dependency labels in set_parse, as this may be used by the Entity recogniser instead. Need to clean this method up... 2015-03-26 16:44:47 +01:00
..
en * Use values encoded by StringStore in POS tagging, rather than indices into a list of tags 2015-03-26 16:44:45 +01:00
ner * Resurrect old NER code. This version won't be the one that runs; we want to re-use the parser code. But for now this is a useful reference. 2015-03-26 16:44:41 +01:00
syntax * Add support for debug feature set. Just use unigrams for this. 2015-03-26 16:44:47 +01:00
__init__.pxd * Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags. 2014-10-24 02:23:42 +11:00
__init__.py * Basic punct tests updated and passing 2014-08-27 19:38:57 +02:00
_ml.pxd * Some minor clean-up after HastyModel 2014-12-31 19:46:04 +11:00
_ml.pyx * Make PyPy work 2015-01-05 17:54:38 +11:00
attrs.pxd * Add attrs.pxd 2015-01-26 22:22:09 +11:00
lexeme.pxd * Store the l2 norm of the word's vector 2015-02-07 08:42:16 -05:00
lexeme.pyx * Add a has_repvec property to Lexeme, and a check function to check flags 2015-02-07 08:42:44 -05:00
morphology.pxd * Tmp commit. Refactoring to create a Python Lexeme class. 2015-01-12 10:26:22 +11:00
morphology.pyx * Make PyPy work 2015-01-05 17:54:38 +11:00
orth.pxd * Make PyPy work 2015-01-05 17:54:38 +11:00
orth.pyx * Work on word vectors, and other stuff 2015-01-17 16:21:17 +11:00
parts_of_speech.pxd * Add parts-of-speech file 2015-01-25 22:00:39 +11:00
parts_of_speech.pyx * Add parts_of_speech.pyx 2015-01-25 16:32:26 +11:00
scorer.py * Adjust scorer to account for tokenization mistakes 2015-03-26 16:44:47 +01:00
strings.pxd * Tmp commit. Refactoring to create a Python Lexeme class. 2015-01-12 10:26:22 +11:00
strings.pyx * Add docstring to StringStore 2015-01-24 20:49:15 +11:00
structs.pxd * NER seems to be working, scoring 69 F. Need to add decision-history features --- currently only use current word, 2 words context. Need refactoring. 2015-03-26 16:44:44 +01:00
tokenizer.pxd * Work on word vectors, and other stuff 2015-01-17 16:21:17 +11:00
tokenizer.pyx * Load tag for specials.json token 2015-03-26 16:44:46 +01:00
tokens.pxd * Don't pass label_ids dict to Tokens, since we now use the StringStore to manage string-to-int mapping for labels 2015-03-26 16:44:45 +01:00
tokens.pyx * Don't set dependency labels in set_parse, as this may be used by the Entity recogniser instead. Need to clean this method up... 2015-03-26 16:44:47 +01:00
typedefs.pxd * Move POS tag definitions to parts_of_speech.pxd 2015-01-25 16:31:07 +11:00
typedefs.pyx * Move POS tag definitions to parts_of_speech.pxd 2015-01-25 16:31:07 +11:00
util.py * Make PyPy work 2015-01-05 17:54:38 +11:00
vocab.pxd * Tmp. Working on refactor. Compiles, must hook up lexical feats. 2015-01-14 00:03:48 +11:00
vocab.pyx * Fill L2 norm attribute on LexemeC struct 2015-02-07 08:44:42 -05:00