spaCy/spacy/pipeline
Matthew Honnibal 8656a08777
Add beam_parser and beam_ner components for v3 (#6369)
* Get basic beam tests working

* Get basic beam tests working

* Compile _beam_utils

* Remove prints

* Test beam density

* Beam parser seems to train

* Draft beam NER

* Upd beam

* Add hypothesis as dev dependency

* Implement missing is-gold-parse method

* Implement early update

* Fix state hashing

* Fix test

* Fix test

* Default to non-beam in parser constructor

* Improve oracle for beam

* Start refactoring beam

* Update test

* Refactor beam

* Update nn

* Refactor beam and weight by cost

* Update ner beam settings

* Update test

* Add __init__.pxd

* Upd test

* Fix test

* Upd test

* Fix test

* Remove ring buffer history from StateC

* WIP change arc-eager transitions

* Add state tests

* Support ternary sent start values

* Fix arc eager

* Fix NER

* Pass oracle cut size for beam

* Fix ner test

* Fix beam

* Improve StateC.clone

* Improve StateClass.borrow

* Work directly with StateC, not StateClass

* Remove print statements

* Fix state copy

* Improve state class

* Refactor parser oracles

* Fix arc eager oracle

* Fix arc eager oracle

* Use a vector to implement the stack

* Refactor state data structure

* Fix alignment of sent start

* Add get_aligned_sent_starts method

* Add test for ae oracle when bad sentence starts

* Fix sentence segment handling

* Avoid Reduce that inserts illegal sentence

* Update preset SBD test

* Fix test

* Remove prints

* Fix sent starts in Example

* Improve python API of StateClass

* Tweak comments and debug output of arc eager

* Upd test

* Fix state test

* Fix state test
2020-12-13 09:08:32 +08:00
..
_parser_internals Add beam_parser and beam_ner components for v3 (#6369) 2020-12-13 09:08:32 +08:00
__init__.py TrainablePipe (#6213) 2020-10-08 21:33:49 +02:00
attributeruler.py Handle missing reference values in scorer (#6286) 2020-11-03 15:47:18 +01:00
dep_parser.pyx Add beam_parser and beam_ner components for v3 (#6369) 2020-12-13 09:08:32 +08:00
entity_linker.py fix embed_size in Entity Linker architecture (#6343) 2020-11-04 22:20:13 +01:00
entityruler.py Merge branch 'master' into pr/6444 2020-12-09 11:09:40 +11:00
functions.py Refactor Docs.is_ flags (#6044) 2020-09-17 00:14:01 +02:00
lemmatizer.py Warn on empty POS for the rule-based lemmatizer 2020-12-04 11:46:15 +01:00
morphologizer.pyx remove labels from morphologizer constructor 2020-11-11 21:48:50 +01:00
multitask.pyx remove labels from constructor 2020-11-11 21:34:12 +01:00
ner.pyx Add beam_parser and beam_ner components for v3 (#6369) 2020-12-13 09:08:32 +08:00
pipe.pxd TrainablePipe (#6213) 2020-10-08 21:33:49 +02:00
pipe.pyx TrainablePipe (#6213) 2020-10-08 21:33:49 +02:00
sentencizer.pyx Handle missing reference values in scorer (#6286) 2020-11-03 15:47:18 +01:00
senter.pyx Handle missing reference values in scorer (#6286) 2020-11-03 15:47:18 +01:00
tagger.pyx remove labels from constructor 2020-11-11 21:34:12 +01:00
textcat.py TextCat updates and fixes (#6263) 2020-10-18 14:50:41 +02:00
tok2vec.py Revert added_strings change (#6236) 2020-10-10 18:55:07 +02:00
trainable_pipe.pxd Revert added_strings change (#6236) 2020-10-10 18:55:07 +02:00
trainable_pipe.pyx always return losses 2020-10-14 15:00:49 +02:00
transition_parser.pxd TrainablePipe (#6213) 2020-10-08 21:33:49 +02:00
transition_parser.pyx Add beam_parser and beam_ner components for v3 (#6369) 2020-12-13 09:08:32 +08:00