Matthew Honnibal
|
9568ebed08
|
* Fix off-by-one in head reading
|
2015-05-12 20:27:56 +02:00 |
Matthew Honnibal
|
3d6b3fc6fb
|
* Restore shuffling, and remove print statements from train.py
|
2015-05-12 20:27:56 +02:00 |
Matthew Honnibal
|
e167355505
|
* Use JSON docs for training and evaluation. Currently a bug that is costing 0.6 acc
|
2015-05-12 20:27:56 +02:00 |
Matthew Honnibal
|
69840d8cc3
|
* Tweak verbose output printing in scorer.py
|
2015-05-12 20:27:56 +02:00 |
Matthew Honnibal
|
e0ef6b6992
|
* Fix alignment in prepare_treebank
|
2015-05-12 20:27:56 +02:00 |
Matthew Honnibal
|
0605af6838
|
* Fix head misalignment in read_conll, when periods are ignored
|
2015-05-12 20:27:56 +02:00 |
Matthew Honnibal
|
d2ac8d8007
|
* Add ctnt field to State, in preparation for constituency parsing
|
2015-05-12 20:27:56 +02:00 |
Matthew Honnibal
|
ab67693393
|
* Add read_json_file to conll.pyx
|
2015-05-12 20:27:55 +02:00 |
Matthew Honnibal
|
aff9359a8d
|
* Update ner.pyx to expect brackets from gold_tuples
|
2015-05-12 20:27:55 +02:00 |
Matthew Honnibal
|
0ad72a77ce
|
* Write JSON files, with both dependency and PSG parses
|
2015-05-12 20:27:55 +02:00 |
Matthew Honnibal
|
5078a32213
|
* Work on script to format training data as a JSON file.
|
2015-05-12 20:27:55 +02:00 |
Matthew Honnibal
|
d48218f4b2
|
* Add left_edge and right_edge properties
|
2015-05-12 20:27:55 +02:00 |
Matthew Honnibal
|
bdb56497b5
|
* Add test for right_edge and left_edge
|
2015-05-12 20:27:55 +02:00 |
Matthew Honnibal
|
53cf77e1c8
|
* Bug fix: when non-monotonically correct a dependency, make sure to delete the old one from the child list
|
2015-05-12 20:26:41 +02:00 |
Matthew Honnibal
|
a4e2af54f9
|
* Add support for l/r edge to add_dep, and move inlined methods into _state.pyx where possible
|
2015-05-12 20:26:41 +02:00 |
Matthew Honnibal
|
d634038eb6
|
* Add l_edge and r_edge props in TokenC for tracking the parse-yield of the token
|
2015-05-12 20:26:41 +02:00 |
Matthew Honnibal
|
ec42b06a8d
|
* Add download data warning to index.rst
|
2015-05-12 03:40:58 +02:00 |
Matthew Honnibal
|
194f080b7d
|
* Update index.rst with release blurb
|
2015-05-12 03:36:57 +02:00 |
Matthew Honnibal
|
3a1ab85a76
|
* Tweak readme.md
|
2015-05-12 03:32:53 +02:00 |
Matthew Honnibal
|
b9da330d8e
|
* Fix updates description for v0.84
|
2015-05-12 03:30:56 +02:00 |
Matthew Honnibal
|
6944a512b7
|
* Fix updates.rst
|
2015-05-12 03:27:12 +02:00 |
Matthew Honnibal
|
03ebf70a66
|
* Inc version to 0.84
|
2015-05-12 02:38:51 +02:00 |
Matthew Honnibal
|
e73eaf2d05
|
* Replace some assertions with proper errors
|
2015-05-08 16:52:17 +02:00 |
Matthew Honnibal
|
fb8d50b3d5
|
Merge branch 'master' of ssh://github.com/honnibal/spaCy
|
2015-04-30 12:45:15 +02:00 |
Matthew Honnibal
|
4489d87550
|
* Add cluster=0 by default in init_model
|
2015-04-29 14:23:13 +02:00 |
Matthew Honnibal
|
ed8e8c3bd0
|
* Whitespace
|
2015-04-29 14:22:47 +02:00 |
Matthew Honnibal
|
378c2a6435
|
* Fix POS model: make it use tag instead of pos in history features
|
2015-04-29 00:02:53 +02:00 |
Matthew Honnibal
|
763ef01575
|
* Fix two bugs in feature calculation
|
2015-04-28 23:25:09 +02:00 |
Matthew Honnibal
|
918b820472
|
* Add testing file for issues such as raised in #57
|
2015-04-28 20:46:29 +02:00 |
Matthew Honnibal
|
b3fd48c97b
|
* Fix missing root labels bug identified in Issue #57
|
2015-04-28 20:45:51 +02:00 |
Matthew Honnibal
|
fd71ed5361
|
Merge pull request #55 from suchow/master
Misc. improvements in style and consistency
|
2015-04-21 02:14:47 +10:00 |
Jordan Suchow
|
3005c86682
|
Don't track generated data files
|
2015-04-19 13:25:42 -07:00 |
Jordan Suchow
|
38ed265b7d
|
Tweak line spacing
|
2015-04-19 13:01:38 -07:00 |
Jordan Suchow
|
85603f5b6a
|
Add CLA for suchow
|
2015-04-19 13:01:38 -07:00 |
Jordan Suchow
|
1b79d947b9
|
Minor copyediting
|
2015-04-19 13:01:38 -07:00 |
Jordan Suchow
|
7bddd15e27
|
Use consistent sentence spacing within files
|
2015-04-19 13:01:38 -07:00 |
Jordan Suchow
|
3a8d9b37a6
|
Remove trailing whitespace
|
2015-04-19 13:01:38 -07:00 |
Jordan Suchow
|
5f0f940a1f
|
Remove unused imports
|
2015-04-19 01:05:22 -07:00 |
Matthew Honnibal
|
693c5a1558
|
* Exclude clusterings for words only seen 1 or 2 times, as their clusters are unreliable
|
2015-04-17 04:44:52 +02:00 |
Matthew Honnibal
|
cc4e395927
|
* Add some ad hoc regexes, for multi-word location prepositions
|
2015-04-17 04:44:24 +02:00 |
Matthew Honnibal
|
f7ffd94e6a
|
* Add Token.conjuncts property
|
2015-04-17 01:40:53 +02:00 |
Matthew Honnibal
|
4757899370
|
* Fix times test
|
2015-04-16 04:50:40 +02:00 |
Matthew Honnibal
|
684d0e5e85
|
* Download updated data
|
2015-04-16 04:29:15 +02:00 |
Matthew Honnibal
|
716ba06711
|
* Inc version
|
2015-04-16 04:28:15 +02:00 |
Matthew Honnibal
|
2ef170a991
|
* Fix Issue #54: Error merging multi-word token when there's a mid-token match.
|
2015-04-16 04:28:06 +02:00 |
Matthew Honnibal
|
42617548af
|
* Disable merge_mwes by default
|
2015-04-16 04:20:31 +02:00 |
Matthew Honnibal
|
99dbf8a38c
|
* Fix error type in lookup_transition
|
2015-04-16 01:36:22 +02:00 |
Matthew Honnibal
|
77d0700caf
|
* Add on X way regexes
|
2015-04-16 01:35:46 +02:00 |
Matthew Honnibal
|
adcad4f353
|
* Clean up train.py
|
2015-04-15 06:02:04 +02:00 |
Matthew Honnibal
|
9f16848b60
|
* Add (N0w, N1w) unigram pair to NER features, prompted by failure to detect 'this weekend'
|
2015-04-15 06:01:18 +02:00 |