Commit Graph

860 Commits

Author SHA1 Message Date
Matthew Honnibal 1a7a1c2771 * Fix Issue #16: tokens recurse when printing 2015-01-30 19:47:50 +11:00
Matthew Honnibal cb95ef6934 * Fix download script 2015-01-30 19:28:43 +11:00
Matthew Honnibal e578bd37bd * Fix download script 2015-01-30 18:59:31 +11:00
Matthew Honnibal df52014d12 * Fix download script 2015-01-30 18:36:24 +11:00
Matthew Honnibal f0bbffca8d * Fix the way numpy headers are installed during compilation from source 2015-01-30 18:14:45 +11:00
Matthew Honnibal 1ef0e04aa0 * Change travis.yml to build from source, instead of from PyPi. PyPi checking will be done in a different branch. 2015-01-30 18:06:02 +11:00
Matthew Honnibal 0f95712189 * Improve accuracy reporting during training 2015-01-30 18:05:06 +11:00
Matthew Honnibal b68f563c2f * Fix Issue #14: Improve parsing API 2015-01-30 18:04:41 +11:00
Matthew Honnibal 998b607f65 * Upd download script, having it download all data if there's no data/ directory, allowing easier compilation from source 2015-01-30 18:04:01 +11:00
Matthew Honnibal 0b53fd7daa * Add test for parse tree navigation 2015-01-30 18:02:58 +11:00
Matthew Honnibal ef2493a3bd * Upd gitignore 2015-01-30 16:49:44 +11:00
Matthew Honnibal d5d1578e44 * Add manifest file 2015-01-30 16:49:02 +11:00
Matthew Honnibal 0a1ec40f76 * Add draft work on features 2015-01-30 16:46:52 +11:00
Matthew Honnibal 7d432b7e39 * Add tests for vector-space model 2015-01-30 16:45:45 +11:00
Matthew Honnibal 48b98e3fb5 * Add test for tag names 2015-01-30 16:45:11 +11:00
Matthew Honnibal 613a195f92 * Add test for indices 2015-01-30 16:44:29 +11:00
Matthew Honnibal 03cc2ee08e * Add test for numpy array transport 2015-01-30 16:43:55 +11:00
Matthew Honnibal d20eeac156 * Start work on lexrank tutorial 2015-01-30 16:42:43 +11:00
Matthew Honnibal b3f9b199cf Merge branch 'punctparse' 2015-01-30 16:38:56 +11:00
Matthew Honnibal ca7577d8a9 * Allow parsers and taggers to be trained on text without gold pre-processing. 2015-01-30 16:36:24 +11:00
Matthew Honnibal 67d6e53a69 * Ensure parser and tagger function correctly when training from missing values, indicated by -1 2015-01-30 14:08:56 +11:00
Matthew Honnibal 4ff180db74 * Fix off-by-one error in commit 0a7fceb 2015-01-30 12:49:33 +11:00
Matthew Honnibal d0e08a5b57 * Upd index tests 2015-01-30 12:35:13 +11:00
Matthew Honnibal 0a7fcebdf7 * Fix Issue #12: Incorrect token.idx calculations for some punctuation, in the presence of token cache 2015-01-30 12:33:38 +11:00
Matthew Honnibal b38093237e * More debug prints 2015-01-30 11:15:54 +11:00
Matthew Honnibal 35a18250cc * Upd tests, avoiding unnecessary processing to make testing faster 2015-01-30 10:41:55 +11:00
Matthew Honnibal 5458f220f8 * Fix quickstart instructions 2015-01-30 10:31:25 +11:00
Matthew Honnibal 11ed65b93c * Work on alignment, for evaluation with non-gold preprocessing 2015-01-30 10:31:03 +11:00
Matthew Honnibal ebf7d2fab1 * Use non-joint sbd, for more simplicity and fewer classes 2015-01-29 06:22:03 +11:00
Matthew Honnibal d05c5bf141 * Remove comment 2015-01-29 05:19:27 +11:00
Matthew Honnibal b4348ce1c3 * Messily use unsegmented sentences to train the parser 2015-01-29 04:21:13 +11:00
Matthew Honnibal 320b045daa * Oracle now consistent over gold standard derivation 2015-01-29 03:41:58 +11:00
Matthew Honnibal f590382134 * Work on sbd 2015-01-29 03:18:29 +11:00
Matthew Honnibal 9e78d673d5 * Fix quickstart installation docs 2015-01-28 14:28:34 +11:00
Matthew Honnibal fe5f34c37c Merge branch 'master' of ssh://github.com/honnibal/spaCy 2015-01-28 14:01:00 +11:00
Matthew Honnibal 781dd712dc * Fix numpy commit problem 2015-01-28 14:00:20 +11:00
Matthew Honnibal b08c0ce54e * Fix numpy install problem 2015-01-28 13:58:33 +11:00
Matthew Honnibal 9171284d62 * Fix compile-from-source instructions 2015-01-28 12:27:44 +11:00
Matthew Honnibal 1884a7a0be * Attach comment with paper 2015-01-28 03:18:43 +11:00
Matthew Honnibal a2d6b195db * Add messy Break transitions, carefully following the scheme of Dd Zhang et al (2013) 2015-01-28 03:09:45 +11:00
Matthew Honnibal f9ee5d9934 * Build a python list of word strings, for debugging 2015-01-28 01:06:13 +11:00
Matthew Honnibal d819101571 * Improve error message on oracle failure 2015-01-28 00:58:03 +11:00
Matthew Honnibal 1d39562946 * Ws 2015-01-28 00:57:16 +11:00
honnibal 97d91ada3a Update README.md 2015-01-27 21:55:36 +11:00
honnibal 51ca3a285d Update README.md 2015-01-27 21:55:23 +11:00
Matthew Honnibal e6c3d3471f * Tweak documentation for Tokens, and hide constructor as __cinit__ 2015-01-27 18:57:52 +11:00
Matthew Honnibal e10b712d29 * Ammend warning 2015-01-27 18:56:18 +11:00
Matthew Honnibal a5a9417d92 * Fix typo from toby 2015-01-27 18:53:29 +11:00
Matthew Honnibal f5fab2d9d7 * Switch to using sphinx_rtd_theme locally, instead of via pip install, for better control. Specifically, need to support google analytics 2015-01-27 18:52:47 +11:00
Matthew Honnibal 7b6bf87c25 * Fix api docs 2015-01-27 17:58:29 +11:00