spaCy/spacy
Matthew Honnibal 38ca0c33f5 Merge branch 'neuralnet' into refactor
Mostly refactors parser, to use new thinc3.2 Example class.
Aim is to remove use of shared memory, so that we can parallelize
over documents easily.

Conflicts:
	setup.py
	spacy/syntax/parser.pxd
	spacy/syntax/parser.pyx
	spacy/syntax/stateclass.pyx
2015-07-14 14:13:47 +02:00
..
en * Break up tokens.pyx into tokens/doc.pyx, tokens/token.pyx, tokens/spans.pyx 2015-07-13 20:20:58 +02:00
munge * Fix head alignment in read_conll.parse, which was causing corrupt parses when strip_bad_periods=True. A similar problem may apply to other data readers. 2015-06-18 16:35:27 +02:00
ner Remove trailing whitespace 2015-04-19 13:01:38 -07:00
syntax Merge branch 'neuralnet' into refactor 2015-07-14 14:13:47 +02:00
tokens * Extend count_by method 2015-07-14 03:20:09 +02:00
__init__.pxd * Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags. 2014-10-24 02:23:42 +11:00
__init__.py
_bu_nn.pyx * Merge changes, and adjust Example to use memoryview 2015-06-28 11:36:11 +02:00
_ml.pxd Merge branch 'neuralnet' into refactor 2015-07-14 14:13:47 +02:00
_ml.pyx * Use new Example class 2015-06-28 22:36:03 +02:00
_nn.py * Merge changes, and adjust Example to use memoryview 2015-06-28 11:36:11 +02:00
_nn.pyx * Merge changes, and adjust Example to use memoryview 2015-06-28 11:36:11 +02:00
_theano.pxd * Merge changes, and adjust Example to use memoryview 2015-06-28 11:36:11 +02:00
_theano.pyx * Begin reorganizing neuralnet work 2015-06-30 14:26:32 +02:00
attrs.pxd Remove trailing whitespace 2015-04-19 13:01:38 -07:00
gold.pxd * Have oracle functions take a struct instead of a Python object 2015-06-02 20:01:06 +02:00
gold.pyx * Fix space check in gold.pyx 2015-07-14 00:10:27 +02:00
lexeme.pxd * Remove has_sense method from Lexeme declaration 2015-07-08 19:41:20 +02:00
lexeme.pyx * Remove has_sense method from Lexeme 2015-07-08 19:28:29 +02:00
morphology.pxd * Tmp commit. Refactoring to create a Python Lexeme class. 2015-01-12 10:26:22 +11:00
morphology.pyx * Make PyPy work 2015-01-05 17:54:38 +11:00
multi_words.py * Fix Issue #50: Python 3 compatibility of v0.80 2015-04-13 05:59:43 +02:00
orth.pxd * Make PyPy work 2015-01-05 17:54:38 +11:00
orth.pyx * Work on word vectors, and other stuff 2015-01-17 16:21:17 +11:00
parts_of_speech.pxd * Add SPACE part-of-speech tag, and train tagger to assign it. Also train tagger not to make whitespace an entity 2015-07-09 13:30:41 +02:00
parts_of_speech.pyx * Add SPACE part-of-speech tag, and train tagger to assign it. Also train tagger not to make whitespace an entity 2015-07-09 13:30:41 +02:00
scorer.py * Start scoring tokens 2015-06-28 06:21:38 +02:00
senses.pxd * Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token. 2015-07-01 20:12:13 +02:00
senses.pyx * Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token. 2015-07-01 20:12:13 +02:00
serialize.pxd * Make .pxd file for huffman codec 2015-07-13 13:54:51 +02:00
serialize.pyx * Round-trip for serialization finally working. Needs a lot of optimization. 2015-07-13 18:39:38 +02:00
strings.pxd * Tmp commit. Refactoring to create a Python Lexeme class. 2015-01-12 10:26:22 +11:00
strings.pyx * Add __len__ function to StringStore 2015-06-23 00:02:50 +02:00
structs.pxd * Add TokenC.spacy attr 2015-07-13 19:48:07 +02:00
tokenizer.pxd * Refactor tokenizer, to set the 'spacy' field on TokenC instead of passing a string 2015-07-13 21:46:02 +02:00
tokenizer.pyx * Fix tokenizer 2015-07-14 00:10:51 +02:00
typedefs.pxd Remove trailing whitespace 2015-04-19 13:01:38 -07:00
typedefs.pyx * Move POS tag definitions to parts_of_speech.pxd 2015-01-25 16:31:07 +11:00
util.py Remove trailing whitespace 2015-04-19 13:01:38 -07:00
vocab.pxd * Add codec property to Vocab, to use the Huffman encoding 2015-07-13 13:55:14 +02:00
vocab.pyx * Add codec property to Vocab, to use the Huffman encoding 2015-07-13 13:55:14 +02:00