Commit Graph

5978 Commits

Author SHA1 Message Date
Matthew Honnibal ed6c85fa3c Fix loading of text categories in GoldParse 2017-07-22 20:04:03 +02:00
Matthew Honnibal 6ffec9dfea Update _ml, for textcat model 2017-07-22 20:03:40 +02:00
Matthew Honnibal d6a5c2c85a Add test for NER 2017-07-22 01:48:58 +02:00
Matthew Honnibal 28244df4da Add test for beam parsing 2017-07-22 01:48:35 +02:00
Matthew Honnibal c86445bdfd Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-07-22 01:14:28 +02:00
Matthew Honnibal b3a749610e Fix name of TextCategorizer 2017-07-22 01:14:07 +02:00
Matthew Honnibal 2424493970 Remove unnecessary import of Mock 2017-07-22 01:13:54 +02:00
Matthew Honnibal baa3d81c35 Add text categorizer to Language 2017-07-22 01:13:36 +02:00
Matthew Honnibal a6a2159969 Add slot for text categories to Doc 2017-07-22 00:34:15 +02:00
Matthew Honnibal 374ab3ecfb Increment alpha version 2017-07-22 00:32:49 +02:00
Matthew Honnibal 289f23df51 Test beam parsing 2017-07-20 15:03:10 +02:00
Matthew Honnibal 3da1063b36 Add beam decoding to parser, to allow NER uncertainties 2017-07-20 15:02:55 +02:00
Matthew Honnibal 0ca5832427 Improve negative example handling in NER oracle 2017-07-20 00:18:49 +02:00
Matthew Honnibal a231b56d40 Add text-classification hook to pipeline 2017-07-20 00:18:15 +02:00
Matthew Honnibal 7ea50182a5 Add support for text-classification labels to GoldParse 2017-07-20 00:17:47 +02:00
Matthew Honnibal 727481377e Add text-classifer thinc models 2017-07-20 00:17:17 +02:00
Matthew Honnibal f014138c11 Fix parser tests 2017-07-20 00:16:52 +02:00
Ines Montani c91642efd5 Port over changes from #1168 2017-07-01 11:43:54 +02:00
Ines Montani e265e34e18 Merge pull request #1153 from jimregan/polish
add tokeniser exceptions for Polish
2017-06-27 14:48:00 +02:00
Jim Regan d81ceb0cd5 Merge branch 'develop' into polish 2017-06-26 22:42:27 +01:00
Jim O'Regan 2f84c73585 a start 2017-06-26 22:40:04 +01:00
Jim O'Regan 28d7f0a672 reference 2017-06-26 22:38:28 +01:00
Ines Montani 01c7c09c7f Merge pull request #1146 from jarle/doc-patch
Fix small typo in the new spaCy 101 guide
2017-06-26 10:41:18 +02:00
Jarle Mathiesen f20533ec0c fix small typo 2017-06-24 12:31:33 +02:00
Matthew Honnibal 91e52543ef Merge pull request #1118 from Gregory-Howard/patch-2
Update _tokenizer_exceptions_list (adding cities)
2017-06-20 11:16:07 +02:00
Matthew Honnibal 8ea785e01a Merge pull request #1119 from oroszgy/patch-3
Fixed conllu converter
2017-06-20 11:14:41 +02:00
Ines Montani f64e3efc76 Merge pull request #1128 from thinline72/patch-1
Changed the capital of Lithuania to Vilnius
2017-06-13 13:14:43 +02:00
Savva Kolbachev 800a8faff4 Changed the capital of Lithuania to Vilnius
Hi,
There is a typo about the capital of Lithuania.

Vilnius is the capital of Lithuania https://en.wikipedia.org/wiki/Vilnius
Ljubljana is the capital of Slovenia https://en.wikipedia.org/wiki/Ljubljana
2017-06-12 23:27:00 +03:00
Ines Montani 6eae9f943a Merge pull request #1125 from Tpt/french_noun_chunks
Adds function to extract french noun chunks
2017-06-12 21:25:33 +02:00
Ines Montani 57f64b9e1c Merge pull request #1124 from v3t3a/patch-3
docs - Fix url error for Displacy Ent visualizer
2017-06-12 21:20:32 +02:00
Ines Montani b2a28028cf Merge pull request #1115 from v3t3a/patch-2
docs - Add read() method when opening file (Lightning tour)
2017-06-12 21:19:25 +02:00
Ines Montani fe8d136ae0 Merge pull request #1114 from v3t3a/patch-1
docs - Update doc.jade (Just remove a duplicate 'doc =')
2017-06-12 21:19:02 +02:00
Tpt 7745b3ae04 Adds noun chunks to French syntax iterators 2017-06-12 15:29:58 +02:00
Tpt 57e8254f63 Adds function to extract french noun chunks 2017-06-12 15:20:49 +02:00
Vetea eae1f7b19c Fix url error for Displacy Ent visualizer 2017-06-12 14:30:02 +02:00
György Orosz 62dbf9025c Fixed conllu converter 2017-06-09 22:53:56 +02:00
Grégory Howard cd974b32b7 Update _tokenizer_exceptions_list (adding cities) 2017-06-09 17:58:18 +02:00
ines 49026a1346 Fix typos in example (see #1105) 2017-06-08 19:15:50 +02:00
Vetea cc3aee1189 Add read() method when opening file
Add read() method for 

to avoid :
```TypeError: Argument 'string' has incorrect type (expected str, got _io.TextIOWrapper)```

Test with:
spaCy : v2.0.0 Alpha
python : 3.5.2+ (default, Sep 22 2016, 12:18:14)
2017-06-08 11:27:09 +02:00
Vetea 8e20cf6368 Update doc.jade
Just remove a duplicate 'doc ='
2017-06-08 10:35:58 +02:00
ines 34a2eecb17 Add simple "naughty strings" test (see #1107) 2017-06-06 17:43:51 +02:00
ines 6b799bac54 Fix formatting and details 2017-06-06 14:37:49 +02:00
ines 6c34b1a65b Update alpha thread link 2017-06-06 00:58:12 +02:00
ines 045574a936 Update package name and increment version 2017-06-05 20:41:30 +02:00
Matthew Honnibal 1f5874a927 Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-06-05 20:20:00 +02:00
ines 03db56f48c Detect spaCy version and add package title
Package title allows customised package names (like spacy-nightly)
2017-06-05 20:11:02 +02:00
ines c921ba109a Fix robots and meta 2017-06-05 20:07:52 +02:00
Matthew Honnibal c0d90f52f7 Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-06-05 19:20:13 +02:00
ines fd9ae0f0e0 Update v2 comparison table 2017-06-05 16:39:11 +02:00
ines cc9c5dc7a3 Fix noun chunks test 2017-06-05 16:39:04 +02:00