spaCy

History

Matthew Honnibal e770fade1e * Don't set dependency labels in set_parse, as this may be used by the Entity recogniser instead. Need to clean this method up...		2015-03-26 16:44:47 +01:00
..
en	* Use values encoded by StringStore in POS tagging, rather than indices into a list of tags	2015-03-26 16:44:45 +01:00
ner	* Resurrect old NER code. This version won't be the one that runs; we want to re-use the parser code. But for now this is a useful reference.	2015-03-26 16:44:41 +01:00
syntax	* Add support for debug feature set. Just use unigrams for this.	2015-03-26 16:44:47 +01:00
__init__.pxd	* Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags.	2014-10-24 02:23:42 +11:00
__init__.py	* Basic punct tests updated and passing	2014-08-27 19:38:57 +02:00
_ml.pxd	* Some minor clean-up after HastyModel	2014-12-31 19:46:04 +11:00
_ml.pyx	* Make PyPy work	2015-01-05 17:54:38 +11:00
attrs.pxd	* Add attrs.pxd	2015-01-26 22:22:09 +11:00
lexeme.pxd	* Store the l2 norm of the word's vector	2015-02-07 08:42:16 -05:00
lexeme.pyx	* Add a has_repvec property to Lexeme, and a check function to check flags	2015-02-07 08:42:44 -05:00
morphology.pxd	* Tmp commit. Refactoring to create a Python Lexeme class.	2015-01-12 10:26:22 +11:00
morphology.pyx	* Make PyPy work	2015-01-05 17:54:38 +11:00
orth.pxd	* Make PyPy work	2015-01-05 17:54:38 +11:00
orth.pyx	* Work on word vectors, and other stuff	2015-01-17 16:21:17 +11:00
parts_of_speech.pxd	* Add parts-of-speech file	2015-01-25 22:00:39 +11:00
parts_of_speech.pyx	* Add parts_of_speech.pyx	2015-01-25 16:32:26 +11:00
scorer.py	* Adjust scorer to account for tokenization mistakes	2015-03-26 16:44:47 +01:00
strings.pxd	* Tmp commit. Refactoring to create a Python Lexeme class.	2015-01-12 10:26:22 +11:00
strings.pyx	* Add docstring to StringStore	2015-01-24 20:49:15 +11:00
structs.pxd	* NER seems to be working, scoring 69 F. Need to add decision-history features --- currently only use current word, 2 words context. Need refactoring.	2015-03-26 16:44:44 +01:00
tokenizer.pxd	* Work on word vectors, and other stuff	2015-01-17 16:21:17 +11:00
tokenizer.pyx	* Load tag for specials.json token	2015-03-26 16:44:46 +01:00
tokens.pxd	* Don't pass label_ids dict to Tokens, since we now use the StringStore to manage string-to-int mapping for labels	2015-03-26 16:44:45 +01:00
tokens.pyx	* Don't set dependency labels in set_parse, as this may be used by the Entity recogniser instead. Need to clean this method up...	2015-03-26 16:44:47 +01:00
typedefs.pxd	* Move POS tag definitions to parts_of_speech.pxd	2015-01-25 16:31:07 +11:00
typedefs.pyx	* Move POS tag definitions to parts_of_speech.pxd	2015-01-25 16:31:07 +11:00
util.py	* Make PyPy work	2015-01-05 17:54:38 +11:00
vocab.pxd	* Tmp. Working on refactor. Compiles, must hook up lexical feats.	2015-01-14 00:03:48 +11:00
vocab.pyx	* Fill L2 norm attribute on LexemeC struct	2015-02-07 08:44:42 -05:00