spaCy

Jordan Suchow 5f0f940a1f Remove unused imports	2015-04-19 01:05:22 -07:00
..
_depr_group_by.py	* Refactor around Word objects, adapting tests. Tests passing, except for string views.	2014-08-23 19:55:06 +02:00
depr_test_ner.py	* Add WordNet lemmatizer	2014-12-08 01:39:13 +11:00
my_test.py	* Initial commit. Tests passing for punctuation handling. Need contractions, file transport, tokenize function, etc.	2014-07-05 20:51:42 +02:00
prag_sbd.py	* Add pragmatic sentence boundary detection tests, from that Ruby gem. Not automatically run, as they can arbitrarily fail based on model changes. Currently 8/15 fail.	2015-04-12 04:46:40 +02:00
sun.tokens	* Working tokenization. en doesn't match PTB perfectly. Need to reorganize before adding more schemes.	2014-07-07 01:15:59 +02:00
sun.txt	* Working tokenization. en doesn't match PTB perfectly. Need to reorganize before adding more schemes.	2014-07-07 01:15:59 +02:00
test_add_lemmas.py	* Add test for Issue 24	2015-02-08 18:30:46 -05:00
test_align.py	* Commit outstanding tests	2014-11-12 23:24:32 +11:00
test_array.py	* Fix Issue #43 : TAG attr not supported. Also add DEP attr, while I'm at it. Need better way of ensuring future changes don't break in similar way.	2015-04-07 06:00:57 +02:00
test_asciify.py	* Upd asciify test, fixing type error	2015-01-06 01:09:44 +11:00
test_contractions.py	* Upd tests, avoiding unnecessary processing to make testing faster	2015-01-30 10:41:55 +11:00
test_detokenize.py	* Add detokenize method and test	2014-10-18 18:07:29 +11:00
test_docs.py	* Add some tests for the code in the index.html docstrings	2015-02-07 08:52:13 -05:00
test_emoticons.py	* Upd tests for new meaning of 'string'	2015-01-24 07:22:30 +11:00
test_flag_features.py	* Fix unicode bugs in tests	2015-01-05 17:54:54 +11:00
test_indices.py	* Add test for indices	2015-01-30 16:44:29 +11:00
test_infix.py	* Tests passing except for morphology/lemmatization stuff	2014-12-23 11:40:32 +11:00
test_intern.py	* Fix unicode in test	2015-01-25 19:04:23 +11:00
test_is_punct.py	* Switch to new data model, tests passing	2014-10-10 08:11:31 +11:00
test_iter_lexicon.py	* Upd tests	2014-12-30 21:34:09 +11:00
test_lemmatizer.py	* Tests passing after refactor. API has obvious warts, particularly in Token and Lexeme	2015-01-15 00:33:16 +11:00
test_lexeme_flags.py	* Tests passing after refactor. API has obvious warts, particularly in Token and Lexeme	2015-01-15 00:33:16 +11:00
test_merge.py	* Fix Issue #54 : Error merging multi-word token when there's a mid-token match.	2015-04-16 04:28:06 +02:00
test_morph_exceptions.py	* Fix test_morph_exceptions	2015-03-26 16:44:46 +01:00
test_ner.py	* Add test for simple NER case	2015-04-13 21:33:54 +02:00
test_number.py	* Fix unicode bugs in tests	2015-01-05 17:54:54 +11:00
test_only_punct.py	* Upd tests	2014-12-30 21:34:09 +11:00
test_parse_navigate.py	* Extend parse tree navigation tests	2015-02-07 18:28:45 -05:00
test_post_punct.py	* Tests passing except for morphology/lemmatization stuff	2014-12-23 11:40:32 +11:00
test_pre_punct.py	* Don't load parser in test_pre_punct	2015-01-30 20:11:47 +11:00
test_sbd.py	* Upd sbd tests	2015-03-26 16:44:45 +01:00
test_shape.py	* Upd shape test	2014-11-07 04:42:54 +11:00
test_span.py	Remove unused imports	2015-04-19 01:05:22 -07:00
test_special_affix.py	* Upd tests, avoiding unnecessary processing to make testing faster	2015-01-30 10:41:55 +11:00
test_string_loading.py	* Upd tests for new meaning of 'string'	2015-01-24 07:22:30 +11:00
test_subtree.py	* Add tests for new subtree method	2015-03-03 05:41:00 -05:00
test_surround_punct.py	* Upd tests for new meaning of 'string'	2015-01-24 07:22:30 +11:00
test_tag_names.py	* Fix test_tag_names again	2015-02-01 16:25:03 +11:00
test_times.py	* Fix times test	2015-04-16 04:50:40 +02:00
test_token.py	* Merge train.py	2015-03-26 16:44:41 +01:00
test_token_api.py	* Add test for Issue #44	2015-04-07 06:05:18 +02:00
test_token_references.py	* Add test from NSchrading	2015-02-16 11:49:31 -05:00
test_tokenizer.py	* Upd tokenizer with i.e. tests	2015-02-18 06:37:04 -05:00
test_tokens_api.py	* Add tests for tokens api	2015-02-07 13:14:07 -05:00
test_tokens_from_list.py	* Upd tests for new meaning of 'string'	2015-01-24 07:22:30 +11:00
test_urlish.py	* Fix unicode bugs in tests	2015-01-05 17:54:54 +11:00
test_vec.py	* Add tests for vector-space model	2015-01-30 16:45:45 +11:00
test_vocab.py	* Rename sic to orth	2015-01-23 02:08:25 +11:00
test_whitespace.py	* Upd tests for new meaning of 'string'	2015-01-24 07:22:30 +11:00
test_wiki_sun.py	Remove unused imports	2015-04-19 01:05:22 -07:00
tokenizer.sed	* Working tokenization. en doesn't match PTB perfectly. Need to reorganize before adding more schemes.	2014-07-07 01:15:59 +02:00

_depr_group_by.py

* Refactor around Word objects, adapting tests. Tests passing, except for string views.

2014-08-23 19:55:06 +02:00

depr_test_ner.py

* Add WordNet lemmatizer

2014-12-08 01:39:13 +11:00

my_test.py

* Initial commit. Tests passing for punctuation handling. Need contractions, file transport, tokenize function, etc.

2014-07-05 20:51:42 +02:00

prag_sbd.py

* Add pragmatic sentence boundary detection tests, from that Ruby gem. Not automatically run, as they can arbitrarily fail based on model changes. Currently 8/15 fail.

2015-04-12 04:46:40 +02:00

sun.tokens

* Working tokenization. en doesn't match PTB perfectly. Need to reorganize before adding more schemes.

2014-07-07 01:15:59 +02:00

sun.txt

* Working tokenization. en doesn't match PTB perfectly. Need to reorganize before adding more schemes.

2014-07-07 01:15:59 +02:00

test_add_lemmas.py

* Add test for Issue 24

2015-02-08 18:30:46 -05:00

test_align.py

* Commit outstanding tests

2014-11-12 23:24:32 +11:00

test_array.py

* Fix Issue #43 : TAG attr not supported. Also add DEP attr, while I'm at it. Need better way of ensuring future changes don't break in similar way.

2015-04-07 06:00:57 +02:00

test_asciify.py

* Upd asciify test, fixing type error

2015-01-06 01:09:44 +11:00

test_contractions.py

* Upd tests, avoiding unnecessary processing to make testing faster

2015-01-30 10:41:55 +11:00

test_detokenize.py

* Add detokenize method and test

2014-10-18 18:07:29 +11:00

test_docs.py

* Add some tests for the code in the index.html docstrings

2015-02-07 08:52:13 -05:00

test_emoticons.py

* Upd tests for new meaning of 'string'

2015-01-24 07:22:30 +11:00

test_flag_features.py

* Fix unicode bugs in tests

2015-01-05 17:54:54 +11:00

test_indices.py

* Add test for indices

2015-01-30 16:44:29 +11:00

test_infix.py

* Tests passing except for morphology/lemmatization stuff

2014-12-23 11:40:32 +11:00

test_intern.py

* Fix unicode in test

2015-01-25 19:04:23 +11:00

test_is_punct.py

* Switch to new data model, tests passing

2014-10-10 08:11:31 +11:00

test_iter_lexicon.py

* Upd tests

2014-12-30 21:34:09 +11:00

test_lemmatizer.py

* Tests passing after refactor. API has obvious warts, particularly in Token and Lexeme

2015-01-15 00:33:16 +11:00

test_lexeme_flags.py

* Tests passing after refactor. API has obvious warts, particularly in Token and Lexeme

2015-01-15 00:33:16 +11:00

test_merge.py

* Fix Issue #54 : Error merging multi-word token when there's a mid-token match.

2015-04-16 04:28:06 +02:00

test_morph_exceptions.py

* Fix test_morph_exceptions

2015-03-26 16:44:46 +01:00

test_ner.py

* Add test for simple NER case

2015-04-13 21:33:54 +02:00

test_number.py

* Fix unicode bugs in tests

2015-01-05 17:54:54 +11:00

test_only_punct.py

* Upd tests

2014-12-30 21:34:09 +11:00

test_parse_navigate.py

* Extend parse tree navigation tests

2015-02-07 18:28:45 -05:00

test_post_punct.py

* Tests passing except for morphology/lemmatization stuff

2014-12-23 11:40:32 +11:00

test_pre_punct.py

* Don't load parser in test_pre_punct

2015-01-30 20:11:47 +11:00

test_sbd.py

* Upd sbd tests

2015-03-26 16:44:45 +01:00

test_shape.py

* Upd shape test

2014-11-07 04:42:54 +11:00

test_span.py

Remove unused imports

2015-04-19 01:05:22 -07:00

test_special_affix.py

* Upd tests, avoiding unnecessary processing to make testing faster

2015-01-30 10:41:55 +11:00

test_string_loading.py

* Upd tests for new meaning of 'string'

2015-01-24 07:22:30 +11:00

test_subtree.py

* Add tests for new subtree method

2015-03-03 05:41:00 -05:00

test_surround_punct.py

* Upd tests for new meaning of 'string'

2015-01-24 07:22:30 +11:00

test_tag_names.py

* Fix test_tag_names again

2015-02-01 16:25:03 +11:00

test_times.py

* Fix times test

2015-04-16 04:50:40 +02:00

test_token.py

* Merge train.py

2015-03-26 16:44:41 +01:00

test_token_api.py

* Add test for Issue #44

2015-04-07 06:05:18 +02:00

test_token_references.py

* Add test from NSchrading

2015-02-16 11:49:31 -05:00

test_tokenizer.py

* Upd tokenizer with i.e. tests

2015-02-18 06:37:04 -05:00

test_tokens_api.py

* Add tests for tokens api

2015-02-07 13:14:07 -05:00

test_tokens_from_list.py

* Upd tests for new meaning of 'string'

2015-01-24 07:22:30 +11:00

test_urlish.py

* Fix unicode bugs in tests

2015-01-05 17:54:54 +11:00

test_vec.py

* Add tests for vector-space model

2015-01-30 16:45:45 +11:00

test_vocab.py

* Rename sic to orth

2015-01-23 02:08:25 +11:00

test_whitespace.py

* Upd tests for new meaning of 'string'

2015-01-24 07:22:30 +11:00

test_wiki_sun.py

Remove unused imports

2015-04-19 01:05:22 -07:00

tokenizer.sed

* Working tokenization. en doesn't match PTB perfectly. Need to reorganize before adding more schemes.

2014-07-07 01:15:59 +02:00