spaCy

History

Daniël de Kok e5debc68e4 Tagger: use unnormalized probabilities for inference (#10197 ) * Tagger: use unnormalized probabilities for inference Using unnormalized softmax avoids use of the relatively expensive exp function, which can significantly speed up non-transformer models (e.g. I got a speedup of 27% on a German tagging + parsing pipeline). * Add spacy.Tagger.v2 with configurable normalization Normalization of probabilities is disabled by default to improve performance. * Update documentation, models, and tests to spacy.Tagger.v2 * Move Tagger.v1 to spacy-legacy * docs/architectures: run prettier * Unnormalized softmax is now a Softmax_v2 option * Require thinc 8.0.14 and spacy-legacy 3.0.9		2022-03-15 14:15:31 +01:00
..
_parser_internals	Make core projectivization functions cdef nogil (#10241 )	2022-02-21 15:02:21 +01:00
legacy	Fix entity linker batching (#9669 )	2022-03-04 09:17:36 +01:00
__init__.py	Add SpanCategorizer component (#6747 )	2021-06-24 12:35:27 +02:00
attributeruler.py	Merge remote-tracking branch 'upstream/master' into chore/update-develop-from-master-v3.2-1	2021-10-26 11:53:50 +02:00
dep_parser.pyx	Document scorers in registry and components from #8766 (#8929 )	2021-08-12 12:50:03 +02:00
entity_linker.py	Fix entity linker batching (#9669 )	2022-03-04 09:17:36 +01:00
entityruler.py	Entity ruler remove pattern (#9685 )	2021-12-06 15:32:49 +01:00
functions.py	Add doc_cleaner component (#9659 )	2021-11-23 15:33:33 +01:00
lemmatizer.py	Merge remote-tracking branch 'upstream/master' into chore/update-develop-from-master-v3.2-1	2021-10-26 11:53:50 +02:00
morphologizer.pyx	Tagger: use unnormalized probabilities for inference (#10197 )	2022-03-15 14:15:31 +01:00
multitask.pyx	Replace negative rows with 0 in StaticVectors (#7674 )	2021-04-22 18:04:15 +10:00
ner.pyx	Document scorers in registry and components from #8766 (#8929 )	2021-08-12 12:50:03 +02:00
pipe.pxd	TrainablePipe (#6213 )	2020-10-08 21:33:49 +02:00
pipe.pyi	Add Pipe.hide_labels to omit labels from pipeline meta (#10175 )	2022-02-05 17:59:24 +01:00
pipe.pyx	Add Pipe.hide_labels to omit labels from pipeline meta (#10175 )	2022-02-05 17:59:24 +01:00
sentencizer.pyx	Add overwrite settings for more components (#9050 )	2021-09-30 15:35:55 +02:00
senter.pyx	Tagger: use unnormalized probabilities for inference (#10197 )	2022-03-15 14:15:31 +01:00
spancat.py	Save span candidates produced by spancat suggesters (#10413 )	2022-03-14 16:46:58 +01:00
tagger.pyx	Tagger: use unnormalized probabilities for inference (#10197 )	2022-03-15 14:15:31 +01:00
textcat.py	Bugfixes and test for rehearse (#10347 )	2022-02-23 16:10:05 +01:00
textcat_multilabel.py	Fix Scorer.score_cats for missing labels (#9443 )	2021-12-29 11:04:39 +01:00
tok2vec.py	Fix Tok2Vec for empty batches (#10324 )	2022-02-21 10:22:36 +01:00
trainable_pipe.pxd	Refactor scoring methods to use registered functions (#8766 )	2021-08-10 15:13:39 +02:00
trainable_pipe.pyx	Pass excludes when serializing vocab (#8824 )	2021-08-03 14:42:44 +02:00
transition_parser.pxd	TrainablePipe (#6213 )	2020-10-08 21:33:49 +02:00
transition_parser.pyx	Document scorers in registry and components from #8766 (#8929 )	2021-08-12 12:50:03 +02:00