Canbey Bilgili
|
abe098b255
|
Adds Turkish Lemmatization
|
2017-12-01 17:04:32 +03:00 |
Vadim Mazaev
|
4ba7ddf651
|
Bugfixies
|
2017-11-30 12:29:38 +03:00 |
Vadim Mazaev
|
53e7c38637
|
Fixed tests depends on pymorphy2
|
2017-11-26 21:04:44 +03:00 |
Vadim Mazaev
|
cacd859dcd
|
Added tag map, fixed tests fails, added more exceptions
|
2017-11-26 20:54:48 +03:00 |
Vadim Mazaev
|
81314f8659
|
Fixed tokenizer: added char classes; added first lemmatizer and
tokenizer tests
|
2017-11-21 22:23:59 +03:00 |
ines
|
3af281a334
|
Update test model name
|
2017-11-01 23:02:00 +01:00 |
Jim O'Regan
|
34ca59691b
|
no idea what is wrong here
|
2017-10-31 14:50:13 +00:00 |
Jim O'Regan
|
41dd29e48e
|
merge
|
2017-10-31 14:07:45 +00:00 |
Ines Montani
|
facf77e541
|
Merge branch 'develop' into support-danish
|
2017-10-24 11:53:19 +02:00 |
ines
|
612224c10d
|
Port over changes from #1157
|
2017-10-14 13:11:39 +02:00 |
ines
|
9b3f8f9ec3
|
Fix formatting and add comment on languages
|
2017-10-14 13:11:18 +02:00 |
ines
|
61a503a611
|
Fix parser test
|
2017-10-07 00:38:51 +02:00 |
Wannaphong Phatthiyaphaibun
|
7b5263ffa4
|
fix thai test
|
2017-09-26 23:54:15 +07:00 |
Wannaphong Phatthiyaphaibun
|
5cba67146c
|
add thai in spacy2
|
2017-09-26 21:36:27 +07:00 |
Jim O'Regan
|
7de709483b
|
missed adding here
|
2017-09-11 10:51:21 +01:00 |
Jim O'Regan
|
b1b6123867
|
add ga_tokenizer
|
2017-09-11 10:31:41 +01:00 |
Matthew Honnibal
|
cb4839033c
|
Fix loader for EN tests
|
2017-09-04 15:19:18 +02:00 |
Jim Geovedi
|
713d7c0aa0
|
added indonesian lang test
|
2017-08-20 12:17:14 +07:00 |
mollerhoj
|
e840077601
|
Add some basic tests for Danish
|
2017-07-03 15:49:51 +02:00 |
ines
|
a0f4592f0a
|
Update tests
|
2017-06-05 02:26:13 +02:00 |
ines
|
3e105bcd36
|
Update tests
|
2017-06-05 02:09:27 +02:00 |
ines
|
078232932c
|
Fix tokenizer fixture scope
|
2017-06-05 01:06:34 +02:00 |
Matthew Honnibal
|
55d0621532
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-04 15:53:25 -05:00 |
Matthew Honnibal
|
5b9f116aca
|
Update tests
|
2017-06-04 15:53:17 -05:00 |
ines
|
f432bb4b48
|
Fix fixture scopes
|
2017-06-04 22:34:31 +02:00 |
ines
|
20a7003c0d
|
Update model fixtures and reorganise tests
|
2017-05-29 22:14:31 +02:00 |
ines
|
6e3937efc5
|
Check for arguments of model markers to specify models to test
Lets user set --models --en for only English models
|
2017-05-29 22:10:16 +02:00 |
ines
|
b462076d80
|
Merge load_lang_class and get_lang_class
|
2017-05-14 01:31:10 +02:00 |
ines
|
5858857a78
|
Update languages list in conftest
|
2017-05-13 15:37:54 +02:00 |
ines
|
bd57b611cc
|
Update conftest to lazy load languages
|
2017-05-09 00:02:21 +02:00 |
Gregory Howard
|
c0afcd22bb
|
Merge remote-tracking branch 'remotes/upstream/master'
|
2017-04-27 14:42:54 +02:00 |
Gregory Howard
|
8ff4682255
|
correcting tokenizer exception.
Adding tests for lemmatization
|
2017-04-27 11:52:14 +02:00 |
luvogels
|
d12a0b6431
|
Hooked up tokenizer tests
|
2017-04-26 23:21:41 +02:00 |
oeg
|
c693d40791
|
feature(model): Add support for creating the Spanish model, including rich tagset, configuration, and basich tests
|
2017-04-06 18:48:45 +02:00 |
Ines Montani
|
97cb4d5e3c
|
Merge branch 'master' into master
|
2017-03-25 10:03:47 +01:00 |
Iddo Berger
|
da135bd823
|
add hebrew tokenizer
|
2017-03-24 18:27:44 +03:00 |
Matthew Honnibal
|
a630726b13
|
Fix typo in tests
|
2017-03-16 20:50:36 -05:00 |
Matthew Honnibal
|
f98b30583f
|
Fix tests
|
2017-03-16 19:48:00 -05:00 |
Matthew Honnibal
|
db51abf685
|
Fix tests
|
2017-03-16 18:53:47 -05:00 |
Aniruddha Adhikary
|
696215a3fb
|
add tests for Bengali
|
2017-03-05 11:25:12 +06:00 |
ines
|
21f09d10d7
|
Revert "Revert "Merge pull request #818 from raphael0202/tokenizer_exceptions""
This reverts commit f02a2f9322 .
|
2017-02-10 13:17:05 +01:00 |
ines
|
f02a2f9322
|
Revert "Merge pull request #818 from raphael0202/tokenizer_exceptions"
This reverts commit b95afdf39c , reversing
changes made to b0ccf32378 .
|
2017-02-09 17:07:21 +01:00 |
Raphaël Bournhonesque
|
309da78bf0
|
Merge branch 'master' into tokenizer_exceptions
|
2017-02-09 16:32:12 +01:00 |
Michael Wallin
|
35100c8bdd
|
[issue 805] Add regression test and the required fixture
|
2017-02-04 16:21:34 +02:00 |
Michael Wallin
|
1a1952afa5
|
[finnish] Add initial tests for tokenizer
|
2017-02-04 13:54:10 +02:00 |
Raphaël Bournhonesque
|
85f951ca99
|
Add tokenizer exceptions for French
|
2017-02-02 08:36:16 +01:00 |
Raphaël Bournhonesque
|
1be9c0e724
|
Add fr tokenization unit tests
|
2017-01-24 10:57:37 +01:00 |
Ines Montani
|
4bb5b89ee4
|
Add text_file_b fixture using BytesIO
|
2017-01-13 02:23:50 +01:00 |
Ines Montani
|
09acfbca01
|
Add Lemmatizer fixture
|
2017-01-12 23:38:55 +01:00 |
Ines Montani
|
514bfa2597
|
Add path fixture for spaCy data path
|
2017-01-12 23:38:47 +01:00 |