spaCy/spacy
Matthew Honnibal d11c1edf8c * Import slice_unicode from strings.pyx 2014-12-20 07:56:26 +11:00
..
ner * Work on beam parser 2014-11-20 19:49:33 +11:00
syntax * Clean up 'guess_cache' idea, which didnt work well enough 2014-12-20 03:49:11 +11:00
__init__.pxd * Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags. 2014-10-24 02:23:42 +11:00
__init__.py * Basic punct tests updated and passing 2014-08-27 19:38:57 +02:00
en.pxd * Start laying out parse tree iteration methods 2014-12-20 01:42:09 +11:00
en.pyx * Remove POS cache for now 2014-12-20 03:49:58 +11:00
index.pxd * Working DecisionMemory class 2014-12-20 01:43:26 +11:00
index.pyx * Working DecisionMemory class 2014-12-20 01:43:26 +11:00
lemmatizer.py * Work on morphological processing 2014-12-08 21:12:15 +11:00
lexeme.pxd * Refactor, move Lexeme struct to structs.pxd 2014-12-20 06:51:03 +11:00
lexeme.pyx * Rearrange POS stuff, so that language-specific stuff can live in language-specific modules 2014-12-07 23:52:41 +11:00
morphology.pxd * Export set_morph_from_dict function 2014-12-20 07:28:27 +11:00
morphology.pyx * Refactor morphology.pyx 2014-12-20 07:27:28 +11:00
orth.py * Improve efficiency of tagger, and improve morphological processing 2014-12-10 01:02:04 +11:00
pos_feats.pxd * Complete refactor of Tagger features, to use a generic list of context names. 2014-11-05 20:45:29 +11:00
pos_feats.pyx * Fix context vector, importing all features 2014-11-05 22:11:39 +11:00
pos_util.py * Work on morphological processing 2014-12-08 21:12:15 +11:00
strings.pxd * Move murmurhash to .pxd in strings file 2014-12-20 07:41:08 +11:00
strings.pyx * Move murmurhash to .pxd in strings file 2014-12-20 07:41:08 +11:00
structs.pxd * Move all struct definitions to structs.pxd, to avoid circular dependencies 2014-12-20 06:51:33 +11:00
tagger.pxd * Working morphology and lemmatisation. POS tagging quite fast. 2014-12-10 08:09:32 +11:00
tagger.pyx * Set const-correctness for tagger 2014-12-18 20:41:52 +11:00
tokenizer.pxd * Move tokenizer to its own file, and refactor 2014-12-20 07:29:16 +11:00
tokenizer.pyx * Move lang.pyx to tokenizer.pyx 2014-12-20 07:55:40 +11:00
tokens.pxd * Update tokens.pyx as part of reorg 2014-12-20 07:03:26 +11:00
tokens.pyx * Update tokens.pyx as part of reorg 2014-12-20 07:03:26 +11:00
typedefs.pxd * Working morphology and lemmatisation. POS tagging quite fast. 2014-12-10 08:09:32 +11:00
utf8string.pxd * Make the string-store serve const pointers to Utf8Str 2014-12-03 16:01:47 +11:00
utf8string.pyx * Make the string-store serve const pointers to Utf8Str 2014-12-03 16:01:47 +11:00
util.py * Make suffixes file use full-power regex, so that we can handle periods properly 2014-12-09 19:04:27 +11:00
vocab.pxd * Import slice_unicode from strings.pyx 2014-12-20 07:56:26 +11:00
vocab.pyx * Import slice_unicode from strings.pyx 2014-12-20 07:56:26 +11:00