spaCy/spacy
Adriane Boyd 2263bc7b28
Update develop from master for v3.0.0rc5 (#6811)
* Fix `spacy.util.minibatch` when the size iterator is finished (#6745)

* Skip 0-length matches (#6759)

Add hack to prevent matcher from returning 0-length matches.

* support IS_SENT_START in PhraseMatcher (#6771)

* support IS_SENT_START in PhraseMatcher

* add unit test and friendlier error

* use IDS.get instead

* ensure span.text works for an empty span (#6772)

* Remove unicode_literals

Co-authored-by: Santiago Castro <bryant@montevideo.com.uy>
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2021-01-26 14:52:45 +11:00
..
cli WIP: Various small training changes (#6818) 2021-01-26 14:51:52 +11:00
displacy Refactor Docs.is_ flags (#6044) 2020-09-17 00:14:01 +02:00
lang raise NotImplementedError when noun_chunks iterator is not implemented (#6711) 2021-01-17 19:56:05 +08:00
matcher Update develop from master for v3.0.0rc5 (#6811) 2021-01-26 14:52:45 +11:00
ml Avoid assuming encode.get_dim('nO') is set in tok2vec (#6800) 2021-01-24 14:37:33 +11:00
pipeline Revert "Set annotations in update" (#6810) 2021-01-25 22:18:45 +08:00
tests Update develop from master for v3.0.0rc5 (#6811) 2021-01-26 14:52:45 +11:00
tokens Update develop from master for v3.0.0rc5 (#6811) 2021-01-26 14:52:45 +11:00
training WIP: Various small training changes (#6818) 2021-01-26 14:51:52 +11:00
__init__.pxd
__init__.py require_cpu functionality (#6336) 2020-12-08 14:42:40 +08:00
__main__.py Tidy up 2020-06-22 00:45:40 +02:00
about.py Set annotations in update (#6767) 2021-01-20 11:49:25 +11:00
attrs.pxd
attrs.pyx
compat.py Use Literal type for nr_feature_tokens 2020-09-23 16:00:03 +02:00
default_config.cfg Add initialize.before_init and after_init callbacks 2021-01-12 13:07:44 +01:00
default_config_pretraining.cfg pretrain architectures (#6451) 2020-12-08 14:41:03 +08:00
errors.py WIP: Various small training changes (#6818) 2021-01-26 14:51:52 +11:00
glossary.py
kb.pxd Revert added_strings change (#6236) 2020-10-10 18:55:07 +02:00
kb.pyx Revert added_strings change (#6236) 2020-10-10 18:55:07 +02:00
language.py warn when frozen components break listener pattern (#6766) 2021-01-20 11:12:35 +11:00
lexeme.pxd Fix Lexeme.from_ptr 2020-08-10 16:43:37 +02:00
lexeme.pyx Update docs links in codebase 2020-09-04 12:58:50 +02:00
lookups.py Always serialize lookups and vectors to disk 2020-10-05 09:40:20 +02:00
morphology.pxd Add Lemmatizer and simplify related components (#5848) 2020-08-07 15:27:13 +02:00
morphology.pyx Prevent 0-length mem alloc (#6653) 2021-01-06 12:50:17 +11:00
parts_of_speech.pxd
parts_of_speech.pyx
pipe_analysis.py Tidy up and auto-format 2020-09-29 21:39:28 +02:00
schemas.py Add initialize.before_init and after_init callbacks 2021-01-12 13:07:44 +01:00
scorer.py WIP: Various small training changes (#6818) 2021-01-26 14:51:52 +11:00
strings.pxd Remove 'cleanup' of strings (#6007) 2020-09-01 16:12:15 +02:00
strings.pyx Update docs links in codebase 2020-09-04 12:58:50 +02:00
structs.pxd Add SpanGroup and Graph container types to represent arbitrary annotations (#6696) 2021-01-14 17:30:41 +11:00
symbols.pxd introduce token.has_head and refer to MISSING_DEP_ (WIP) 2021-01-12 17:17:06 +01:00
symbols.pyx introduce token.has_head and refer to MISSING_DEP_ (WIP) 2021-01-12 17:17:06 +01:00
tokenizer.pxd Simplify specials and cache checks (#6012) 2020-09-03 09:42:49 +02:00
tokenizer.pyx Merge remote-tracking branch 'upstream/master' into chore/update-develop-from-master-rc3 2021-01-14 11:49:58 +01:00
typedefs.pxd Merge remote-tracking branch 'upstream/master' into chore/update-develop-from-master 2020-11-25 11:49:34 +01:00
typedefs.pyx
util.py Fix error code 2021-01-18 11:43:45 +11:00
vectors.pyx Update docs links in codebase 2020-09-04 12:58:50 +02:00
vocab.pxd Merge remote-tracking branch 'upstream/master' into chore/update-develop-from-master 2020-11-25 11:49:34 +01:00
vocab.pyx Fix Doc.copy bugs (#6809) 2021-01-25 21:40:18 +08:00