spaCy

History

Matthew Honnibal f8de403483 * Work on pickling Vocab instances. The current implementation is not correct, but it may serve to see whether this approach is workable. Pickling is necessary to address Issue #125		2015-10-13 13:44:41 +11:00
..
de	* Add rule to ensure ordinals are preserved as single tokens	2015-09-22 12:26:05 +10:00
en	* Increment data version	2015-10-09 13:26:17 +02:00
fi	* More work on language-generic parsing	2015-08-28 02:02:33 +02:00
it	* Delete extra wordnets	2015-09-13 10:31:37 +10:00
munge	* Fix Python3 problem in align_raw	2015-07-28 16:06:53 +02:00
serialize	…
syntax	* Fix Issue #131 : Force whitespace characters to attach syntactically to previous token, and ensure they cannot serve as stand-alone 'sentence' units.	2015-10-10 15:53:30 +11:00
tokens	…
__init__.pxd	…
__init__.py	…
_ml.pxd	…
_ml.pyx	…
_nn.py	…
_nn.pyx	…
_theano.pxd	* Merge changes, and adjust Example to use memoryview	2015-06-28 11:36:11 +02:00
_theano.pyx	* Begin reorganizing neuralnet work	2015-06-30 14:26:32 +02:00
attrs.pxd	…
attrs.pyx	…
cfile.pxd	…
cfile.pyx	* Fix CFile for Python2	2015-07-25 22:55:53 +02:00
gold.pxd	…
gold.pyx	* Use io module insteads of deprecated codecs module	2015-10-10 14:13:01 +11:00
language.py	…
lemmatizer.py	…
lexeme.pxd	…
lexeme.pyx	…
matcher.pyx	* Refactor symbols, so that frequency rank can be derived from the orth id of a word.	2015-10-13 13:44:39 +11:00
morphology.pxd	…
morphology.pyx	…
multi_words.py	…
orth.pxd	…
orth.pyx	* Fix type declaration in asciied function	2015-10-09 13:46:57 +11:00
parts_of_speech.pxd	…
parts_of_speech.pyx	…
scorer.py	…
senses.pxd	* Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token.	2015-07-01 20:12:13 +02:00
senses.pyx	…
strings.pxd	…
strings.pyx	* Allow StringStore to be pickled, to start addressing Issue #125	2015-10-13 13:44:41 +11:00
structs.pxd	…
symbols.pxd	…
symbols.pyx	…
tagger.pxd	…
tagger.pyx	* More work on language independent parsing	2015-08-28 03:44:54 +02:00
tokenizer.pxd	…
tokenizer.pyx	…
typedefs.pxd	* Fix type declarations for attr_t. Remove unused id_t.	2015-07-18 22:39:57 +02:00
typedefs.pyx	…
util.py	Merge pull request #126 from tomtung/master	2015-10-10 14:14:57 +11:00
vocab.pxd	* Start trying to pickle Vocab	2015-10-13 13:44:41 +11:00
vocab.pyx	* Work on pickling Vocab instances. The current implementation is not correct, but it may serve to see whether this approach is workable. Pickling is necessary to address Issue #125	2015-10-13 13:44:41 +11:00