Commit Graph

13679 Commits

Author SHA1 Message Date
Matthew Honnibal b7e01d2024 Fix quickstart 2020-10-05 21:21:30 +02:00
Matthew Honnibal ff8b980775 Upd quickstart template 2020-10-05 21:19:41 +02:00
Matthew Honnibal 91d0fbb588 Fix test 2020-10-05 21:13:53 +02:00
Ines Montani 9ca283a899 Merge branch 'develop' into feature/project-spacy-version 2020-10-05 21:06:07 +02:00
Ines Montani 9aa07ad001 Update quickstarts [ci skip] 2020-10-05 21:05:41 +02:00
Ines Montani 706b7f6973 Update docs 2020-10-05 20:51:22 +02:00
Ines Montani 0135f6ed95 Enable commit check via env var 2020-10-05 20:51:15 +02:00
Matthew Honnibal 919790cb47 Upd MultiHashEmbed docs 2020-10-05 20:28:21 +02:00
Matthew Honnibal b392d48e76 Fix test 2020-10-05 20:17:07 +02:00
Ines Montani be99f1e4de
Remove output dirs before training (#6204)
* Remove output dirs before training

* Re-raise error if cleaning fails
2020-10-05 20:11:16 +02:00
Matthew Honnibal e50047f1c5 Check lengths match 2020-10-05 20:02:45 +02:00
Ines Montani 582701519e Remove __release__ flag 2020-10-05 20:00:49 +02:00
Ines Montani d58fb42707 Add spacy_version option and validation for project.yml 2020-10-05 20:00:42 +02:00
Matthew Honnibal db84d175c3 Fix test 2020-10-05 19:59:30 +02:00
Matthew Honnibal cdd2b79b6d Remove deprecated MultiHashEmbed 2020-10-05 19:58:18 +02:00
Matthew Honnibal 6dcc4a0ba6 Simplify MultiHashEmbed signature 2020-10-05 19:57:45 +02:00
Adriane Boyd d2806f11f2 Update to spacy-pkuseg==0.0.26 in Makefile 2020-10-05 18:08:32 +02:00
svlandeg 193e0d5a98 add docs for entity_ruler.initialize 2020-10-05 18:04:08 +02:00
svlandeg 3ac3447eee cleanup 2020-10-05 17:50:37 +02:00
svlandeg 9eb813a35d Merge remote-tracking branch 'upstream/develop' into fix/patterns-init 2020-10-05 17:49:44 +02:00
Adriane Boyd f102ef6b54 Read features.msgpack instead of features.pkl 2020-10-05 17:47:39 +02:00
svlandeg 4e3ace4b8c is_trainable method 2020-10-05 17:43:42 +02:00
Ines Montani 84fedcebab
Make args keyword-only [ci skip]
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com>
2020-10-05 17:07:35 +02:00
Matthew Honnibal 71e73ed0a6 Merge branch 'develop' into feature/embed-features 2020-10-05 17:00:05 +02:00
Matthew Honnibal 3ee3649b52 Fix augment 2020-10-05 16:59:49 +02:00
Matthew Honnibal 22937d25a9 Merge branch 'develop' into feature/embed-features 2020-10-05 16:42:17 +02:00
Matthew Honnibal 8deed614e9 Fix augment 2020-10-05 16:41:45 +02:00
Matthew Honnibal 4ed3e037df Fix augment 2020-10-05 16:40:55 +02:00
Matthew Honnibal 9f1bc3f24c Fix augment 2020-10-05 16:40:23 +02:00
svlandeg dc06912c76 prevent loss keyerror for non-trainable components 2020-10-05 16:33:28 +02:00
Adriane Boyd 187234648c Revert back to "default" as default for pkuseg_user_dict 2020-10-05 16:24:28 +02:00
svlandeg 65abd77779 add finish_update to Pipe 2020-10-05 16:23:33 +02:00
Matthew Honnibal 90040aacec Fix merge 2020-10-05 16:12:01 +02:00
Matthew Honnibal 93a98e8c3e
Merge branch 'develop' into feature/embed-features 2020-10-05 15:51:31 +02:00
Matthew Honnibal eb9ba61517 Format 2020-10-05 15:29:49 +02:00
Matthew Honnibal 7d93575f35 spacy/tests/ 2020-10-05 15:28:12 +02:00
Matthew Honnibal f4ca9a39cb spacy/tests/ 2020-10-05 15:27:06 +02:00
Matthew Honnibal f2f1deca66 spacy/tests/ 2020-10-05 15:24:33 +02:00
Matthew Honnibal 8ec79ad3fa Allow configuration of MultiHashEmbed features
Update arguments to MultiHashEmbed layer so that the attributes can be
controlled. A kind of tricky scheme is used to allow optional
specification of the rows. I think it's an okay balance between
flexibility and convenience.
2020-10-05 15:22:00 +02:00
Ines Montani 7946fd84bb
Merge pull request #6200 from adrianeboyd/bugfix/vocab-disk-lookups-vectors
Always serialize lookups and vectors to disk
2020-10-05 15:15:25 +02:00
Ines Montani 8171e28b20 Remove logging [ci skip]
This would be fired on each example, which is wrong
2020-10-05 15:09:52 +02:00
svlandeg 251b3eb4e5 add initialize method for entity_ruler 2020-10-05 14:59:13 +02:00
Sofie Van Landeghem f4f49f5877
update blis (#6198)
* allow higher blis version

* fix typo

* bump to 3.0.0a34

* fix pins in other files
2020-10-05 14:58:56 +02:00
Adriane Boyd 5d19dfc9d3 Update Chinese tokenizer for spacy-pkuseg fork 2020-10-05 14:21:53 +02:00
Matthew Honnibal 6a9d14e35a Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2020-10-05 14:17:41 +02:00
Matthew Honnibal d2b9aafb8c Fix augmenter 2020-10-05 14:14:49 +02:00
Ines Montani 6260fa3c10
Merge pull request #6201 from svlandeg/fix/error_nr 2020-10-05 14:00:57 +02:00
Ines Montani 6958510bda Include spaCy version check in project CLI 2020-10-05 13:53:07 +02:00
Ines Montani 20f2a17a09 Merge test_misc and test_util 2020-10-05 13:45:57 +02:00
svlandeg fd2d48556c fix E902 and E903 numbering 2020-10-05 13:43:32 +02:00