ines
|
531ee1373b
|
Rename "Language models" to "Languages" in API
|
2017-05-12 15:38:56 +02:00 |
ines
|
c4d2c3cac7
|
Update adding languages docs
|
2017-05-12 15:38:17 +02:00 |
ines
|
c4857bc7db
|
Remove unused argument
|
2017-05-12 15:37:54 +02:00 |
ines
|
c13b3fa052
|
Add LEX_ATTRS
|
2017-05-12 15:37:45 +02:00 |
ines
|
bca2ea9c72
|
Update Portuguese lexical attributes
|
2017-05-12 15:37:39 +02:00 |
ines
|
2f870123bf
|
Fix formatting
|
2017-05-12 15:37:20 +02:00 |
ines
|
ca65993d59
|
Add basic Polish Language class
|
2017-05-12 09:25:37 +02:00 |
ines
|
48177c4f92
|
Add missing tokenizer exceptions
|
2017-05-12 09:25:24 +02:00 |
ines
|
bb8be3d194
|
Add Danish language data
|
2017-05-10 21:15:12 +02:00 |
Matthew Honnibal
|
4efb391994
|
Fix serializer
|
2017-05-09 18:45:18 +02:00 |
Matthew Honnibal
|
b16ae75824
|
Remove serializer hacks from pipeline classes
|
2017-05-09 18:16:40 +02:00 |
Matthew Honnibal
|
7253b4e649
|
Remove old serialization tests
|
2017-05-09 18:12:58 +02:00 |
Matthew Honnibal
|
f9327343ce
|
Start updating serializer test
|
2017-05-09 18:12:03 +02:00 |
Matthew Honnibal
|
1166b0c491
|
Implement Doc.to_bytes and Doc.from_bytes methods
|
2017-05-09 18:11:34 +02:00 |
Matthew Honnibal
|
9e167b7bb6
|
Strip serializer from code
|
2017-05-09 17:28:50 +02:00 |
Matthew Honnibal
|
825c6403d8
|
Remove serializer
|
2017-05-09 17:28:30 +02:00 |
Matthew Honnibal
|
b53f7dfdc3
|
Remove spacy.serialize
|
2017-05-09 17:22:06 +02:00 |
Matthew Honnibal
|
62ecdea9f2
|
Add binder class for document serialization
|
2017-05-09 17:21:00 +02:00 |
ines
|
a0b00624bb
|
Make sure like_email returns bool
|
2017-05-09 11:37:29 +02:00 |
ines
|
ea60932e1b
|
Fix formatting
|
2017-05-09 11:08:14 +02:00 |
ines
|
2c3bdd09b1
|
Add English test for like_num
|
2017-05-09 11:06:34 +02:00 |
ines
|
22375eafb0
|
Fix and merge attrs and lex_attrs tests
|
2017-05-09 11:06:25 +02:00 |
ines
|
02d0ac5cab
|
Remove redundant function and fix formatting
|
2017-05-09 11:06:04 +02:00 |
ines
|
b5ca50607e
|
Reorganise entity rules
|
2017-05-09 01:37:10 +02:00 |
ines
|
564939391a
|
Remove spacy.orth
|
2017-05-09 01:21:47 +02:00 |
ines
|
12c3d5fbba
|
Fix formatting
|
2017-05-09 01:15:28 +02:00 |
ines
|
2829a024ef
|
Re-add basic like_num check to global lex_attrs
|
2017-05-09 01:15:23 +02:00 |
ines
|
88adeee548
|
Add English lex_attrs overrides
|
2017-05-09 01:09:52 +02:00 |
ines
|
8f3fbbb147
|
Fix typos
|
2017-05-09 01:09:37 +02:00 |
ines
|
ea5fa46475
|
Import LEX_ATTRS from lang.lex_attrs
|
2017-05-09 00:58:10 +02:00 |
ines
|
2216e5f326
|
Reorganise lex_attrs and add dict
|
2017-05-09 00:57:54 +02:00 |
ines
|
e666f14d20
|
Add global lex_attrs
|
2017-05-09 00:41:53 +02:00 |
ines
|
41972c43fe
|
Use consistent regex imports
|
2017-05-09 00:34:31 +02:00 |
ines
|
7b83977020
|
Remove unused munge package
|
2017-05-09 00:16:16 +02:00 |
ines
|
c714841cc8
|
Move language-specific tests to tests/lang
|
2017-05-09 00:02:37 +02:00 |
ines
|
bd57b611cc
|
Update conftest to lazy load languages
|
2017-05-09 00:02:21 +02:00 |
ines
|
9f0fd5963f
|
Reorganise Hungarian punctuation rules
|
2017-05-09 00:01:59 +02:00 |
ines
|
fc0d793360
|
Reorganise Bengali punctuation rules
|
2017-05-09 00:01:52 +02:00 |
ines
|
e895d1afd7
|
Reorganise French punctuation rules
|
2017-05-09 00:00:54 +02:00 |
ines
|
014bda0ae3
|
Reorganise global punctuation rules
|
2017-05-09 00:00:46 +02:00 |
ines
|
a91278cb32
|
Rename _URL_PATTERN to URL_PATTERN
|
2017-05-09 00:00:00 +02:00 |
ines
|
604f299cf6
|
Add char classes to global language data
|
2017-05-08 23:59:33 +02:00 |
ines
|
f6f5d78cb9
|
Fix formatting
|
2017-05-08 23:59:17 +02:00 |
ines
|
6eb6306843
|
Fix language data imports
|
2017-05-08 23:58:31 +02:00 |
ines
|
3c0f85de8e
|
Remove imports in /lang/__init__.py
|
2017-05-08 23:58:07 +02:00 |
ines
|
86d9c29f30
|
Reorder util functions
|
2017-05-08 23:51:15 +02:00 |
ines
|
9a0d2fdef1
|
Add load_lang_class() util function
|
2017-05-08 23:50:45 +02:00 |
ines
|
614aa09582
|
Tidy up Bengali tokenizer exceptions
|
2017-05-08 22:29:49 +02:00 |
ines
|
73b577cb01
|
Fix relative imports
|
2017-05-08 22:29:04 +02:00 |
ines
|
ae99990f63
|
Fix formatting
|
2017-05-08 22:23:48 +02:00 |