Matthew Honnibal
|
3b793cf4f7
|
* Tests passing for new Word object version
|
2014-08-24 18:13:53 +02:00 |
Matthew Honnibal
|
a22101404a
|
* Move en_ptb data
|
2014-08-22 04:28:51 +02:00 |
Matthew Honnibal
|
a2047fa5aa
|
* Add 's suffix to tokenization table
|
2014-08-18 23:21:37 +02:00 |
Matthew Honnibal
|
cc3971ce5c
|
* Fix error in tokenization rules
|
2014-07-07 05:09:34 +02:00 |
Matthew Honnibal
|
997551241f
|
* Upd ptb tokenization rules
|
2014-07-07 05:09:22 +02:00 |
Matthew Honnibal
|
df0458001d
|
* Begin work on full PTB-compatible English tokenization
|
2014-07-07 04:29:24 +02:00 |
Matthew Honnibal
|
d5bef02c72
|
* Reorganized, moving language-independent stuff to spacy. The functions in spacy ask for the dictionaries and split function on input, but the language-specific modules are curried versions that use the globals
|
2014-07-07 04:21:06 +02:00 |