Matthew Honnibal
|
bb0b00f819
|
* Repurporse the Tagger class as a generic Model, wrapping thinc's interface
|
2014-12-30 21:20:15 +11:00 |
Matthew Honnibal
|
fe2a5e0370
|
* Work on docstrings
|
2014-12-27 21:46:04 +11:00 |
Matthew Honnibal
|
6352e3e2a2
|
* Work on API reference
|
2014-12-27 18:45:47 +11:00 |
Matthew Honnibal
|
bb80937544
|
* Upd docstrings
|
2014-12-27 18:45:16 +11:00 |
Matthew Honnibal
|
91a5064b7f
|
* Upd tests
|
2014-12-26 14:26:27 +11:00 |
Matthew Honnibal
|
b8b65903fc
|
* Tmp
|
2014-12-24 17:42:00 +11:00 |
Matthew Honnibal
|
75a6930ad9
|
* Fix results table
|
2014-12-24 14:35:32 +11:00 |
Matthew Honnibal
|
a68ecc50fa
|
* Ignore cpp files within en dir
|
2014-12-23 15:19:01 +11:00 |
Matthew Honnibal
|
ab61673edd
|
* Fix api of array method
|
2014-12-23 15:18:48 +11:00 |
Matthew Honnibal
|
ed0ff63c09
|
* Compile attrs and parser in setup
|
2014-12-23 15:18:20 +11:00 |
Matthew Honnibal
|
9dda8b4500
|
* Play with examples in index.rst
|
2014-12-23 15:17:56 +11:00 |
Matthew Honnibal
|
7708d0e24a
|
* Move lemmatizer to en dir
|
2014-12-23 15:16:57 +11:00 |
Matthew Honnibal
|
98eb4c0426
|
* Fix path to parser model
|
2014-12-23 15:09:09 +11:00 |
Matthew Honnibal
|
b00bc01d8c
|
* All tests now passing for reorg
|
2014-12-23 13:18:59 +11:00 |
Matthew Honnibal
|
73f200436f
|
* Tests passing except for morphology/lemmatization stuff
|
2014-12-23 11:40:32 +11:00 |
Matthew Honnibal
|
cf8d26c3d2
|
* POS tagger training working after reorg
|
2014-12-22 08:54:47 +11:00 |
Matthew Honnibal
|
4c4aa2c5c9
|
* Work on train
|
2014-12-22 07:25:43 +11:00 |
Matthew Honnibal
|
4d4d2c0db4
|
* Upd test
|
2014-12-21 21:05:28 +11:00 |
Matthew Honnibal
|
d047dc0d0f
|
Upd lemmatizer test
|
2014-12-21 21:02:44 +11:00 |
Matthew Honnibal
|
b864f0e539
|
* Upd iteration test
|
2014-12-21 21:01:46 +11:00 |
Matthew Honnibal
|
61df50b598
|
* Add English-subclass POS tagger
|
2014-12-21 20:59:07 +11:00 |
Matthew Honnibal
|
c1ab134159
|
* Upd lemmas test
|
2014-12-21 20:58:21 +11:00 |
Matthew Honnibal
|
82bd57c76f
|
* Upd intern test
|
2014-12-21 20:44:21 +11:00 |
Matthew Honnibal
|
734d1da55c
|
* Upd emoticons test
|
2014-12-21 20:43:27 +11:00 |
Matthew Honnibal
|
199025609f
|
* Upd contractions test
|
2014-12-21 20:41:13 +11:00 |
Matthew Honnibal
|
0d9972f4b0
|
* Upd tokenizer test
|
2014-12-21 20:38:27 +11:00 |
Matthew Honnibal
|
69e3a07fa1
|
* More index.rst fiddling
|
2014-12-21 17:40:12 +11:00 |
Matthew Honnibal
|
9f3f07cab6
|
* Add attrs file for English
|
2014-12-21 11:29:11 +11:00 |
Matthew Honnibal
|
2a89d70429
|
* Add vocab.pyx to setup, and ensure we can import spacy.en.lang
|
2014-12-21 06:03:53 +11:00 |
Matthew Honnibal
|
b34a1325d3
|
* Everything compiling after reorg. About to start testing.
|
2014-12-21 05:42:23 +11:00 |
Matthew Honnibal
|
e1c1a4b868
|
* Tmp
|
2014-12-21 05:36:29 +11:00 |
Matthew Honnibal
|
d11c1edf8c
|
* Import slice_unicode from strings.pyx
|
2014-12-20 07:56:26 +11:00 |
Matthew Honnibal
|
be1bdcbd85
|
* Move lang.pyx to tokenizer.pyx
|
2014-12-20 07:55:40 +11:00 |
Matthew Honnibal
|
89a1cc1a48
|
* Move murmurhash to .pxd in strings file
|
2014-12-20 07:41:08 +11:00 |
Matthew Honnibal
|
d5a942c4a4
|
* Rename lang.pyx to tokenizer.pyx
|
2014-12-20 07:30:39 +11:00 |
Matthew Honnibal
|
a60ae261ae
|
* Move tokenizer to its own file, and refactor
|
2014-12-20 07:29:16 +11:00 |
Matthew Honnibal
|
867a4a000c
|
* Export set_morph_from_dict function
|
2014-12-20 07:28:27 +11:00 |
Matthew Honnibal
|
4e30195c6d
|
* Refactor morphology.pyx
|
2014-12-20 07:27:28 +11:00 |
Matthew Honnibal
|
4c6ce7ee84
|
* Update tokens.pyx as part of reorg
|
2014-12-20 07:03:26 +11:00 |
Matthew Honnibal
|
116f7f3bc1
|
* Rename Lexicon to Vocab, and move it to its own file
|
2014-12-20 06:54:03 +11:00 |
Matthew Honnibal
|
780cbd68b1
|
* Move all struct definitions to structs.pxd, to avoid circular dependencies
|
2014-12-20 06:51:33 +11:00 |
Matthew Honnibal
|
f6556d8e5d
|
* Refactor, move Lexeme struct to structs.pxd
|
2014-12-20 06:51:03 +11:00 |
Matthew Honnibal
|
7d48bba6c4
|
* Move StringStore class to its own file
|
2014-12-20 06:42:01 +11:00 |
Matthew Honnibal
|
e15b9da7db
|
* Pin preshed to a particular version
|
2014-12-20 04:01:32 +11:00 |
Matthew Honnibal
|
ed2fff6128
|
* Add tests
|
2014-12-20 03:51:25 +11:00 |
Matthew Honnibal
|
b066102d2d
|
* Remove POS cache for now
|
2014-12-20 03:49:58 +11:00 |
Matthew Honnibal
|
ff252dd535
|
* Clean up 'guess_cache' idea, which didnt work well enough
|
2014-12-20 03:49:11 +11:00 |
Matthew Honnibal
|
9d3ca13909
|
* Start work on parse-tree iteration classes
|
2014-12-20 03:48:10 +11:00 |
Matthew Honnibal
|
bed680c632
|
* Remove commented-out features
|
2014-12-20 03:47:32 +11:00 |
Matthew Honnibal
|
3d178c03ae
|
* Prune the features a bit
|
2014-12-20 02:46:14 +11:00 |