spaCy/spacy
Adriane Boyd c4112a1da3
Require that all SpanGroup spans are from the current doc (#12569)
* Require that all SpanGroup spans are from the current doc

The restriction on only adding spans from the current doc were already
implemented for all operations except for `SpanGroup.__init__`.

Initialize copied spans for `SpanGroup.copy` with `Doc.char_span` in
order to validate the character offsets and to make it possible to copy
spans between documents with differing tokenization. Currently there is
no validation that the document texts are identical, but the span char
offsets must be valid spans in the target doc, which prevents you from
ending up with completely invalid spans.

* Undo change in test_beam_overfitting_IO
2023-06-01 19:19:17 +02:00
..
cli Add scorer option to return per-component scores (#12540) 2023-05-12 15:36:54 +02:00
displacy Allow passing a Span to displacy.parse_deps (#12477) 2023-03-31 09:44:01 +02:00
kb rely on is_empty property instead of __len__ (#12347) 2023-03-01 12:06:07 +01:00
lang Malay language support (#12602) 2023-05-17 12:45:21 +02:00
matcher perf(REL_OP): Replace some token.children with token.rights or token.lefts (#12528) 2023-04-17 13:16:34 +02:00
ml Spancat speed improvement (#12577) 2023-04-27 15:27:13 +02:00
pipeline Fix pickle for ngram suggester (#12486) 2023-03-31 13:43:51 +02:00
tests Require that all SpanGroup spans are from the current doc (#12569) 2023-06-01 19:19:17 +02:00
tokens Require that all SpanGroup spans are from the current doc (#12569) 2023-06-01 19:19:17 +02:00
training In initialize only calculate current vectors hash if needed (#12607) 2023-05-08 16:51:58 +02:00
__init__.pxd
__init__.py Simplify and clarify enable/disable behavior of spacy.load() (#11459) 2022-09-27 14:22:36 +02:00
__main__.py
about.py Set version to v3.6.0.dev0 2023-05-08 09:10:35 +02:00
attrs.pxd
attrs.pyx Intify IOB (#9738) 2022-01-20 13:19:38 +01:00
compat.py Custom component types in spacy.ty (#9469) 2021-10-21 15:31:06 +02:00
default_config.cfg Add `training.before_update` callback (#11739) 2022-11-23 17:54:58 +01:00
default_config_pretraining.cfg Add new parameter for saving every n epoch in pretraining (#8912) 2021-08-12 11:14:48 +02:00
errors.py Require that all SpanGroup spans are from the current doc (#12569) 2023-06-01 19:19:17 +02:00
glossary.py Add glossary entry for root (#10821) 2022-05-20 09:56:32 +02:00
language.py Add scorer option to return per-component scores (#12540) 2023-05-12 15:36:54 +02:00
lexeme.pxd
lexeme.pyi fix types (#12365) 2023-03-07 13:29:08 +01:00
lexeme.pyx fix types (#12365) 2023-03-07 13:29:08 +01:00
lookups.py Fix issues for Mypy 0.950 and Pydantic 1.9.0 (#10786) 2022-05-25 09:33:54 +02:00
morphology.pxd Clean up Morphology imports and definitions (#7441) 2021-04-26 16:54:23 +02:00
morphology.pyx Clean up Morphology imports and definitions (#7441) 2021-04-26 16:54:23 +02:00
parts_of_speech.pxd
parts_of_speech.pyx
pipe_analysis.py 🏷 Add Mypy check to CI and ignore all existing Mypy errors (#9167) 2021-10-14 15:21:40 +02:00
py.typed Add py.typed 2021-03-16 09:48:31 +01:00
schemas.py Auto-format code with black (#12100) 2023-01-13 10:12:10 +01:00
scorer.py Add scorer option to return per-component scores (#12540) 2023-05-12 15:36:54 +02:00
strings.pxd `StringStore`-related optimizations (#10938) 2022-07-04 15:04:03 +02:00
strings.pyi Fix StringStore.__getitem__ return type depending on parameter types (#10741) 2022-05-03 17:57:07 +02:00
strings.pyx `StringStore`-related optimizations (#10938) 2022-07-04 15:04:03 +02:00
structs.pxd
symbols.pxd
symbols.pyx
tokenizer.pxd Add tokenizer option to allow Matcher handling for all rules (#10452) 2022-03-24 13:21:32 +01:00
tokenizer.pyx Load exceptions last in Tokenizer.from_bytes (#12553) 2023-04-20 11:30:34 +02:00
ty.py Custom component types in spacy.ty (#9469) 2021-10-21 15:31:06 +02:00
typedefs.pxd
typedefs.pyx
util.py Add tests for projects to master (#12303) 2023-02-23 10:22:57 +01:00
vectors.pyx Add equality definition for vectors (#11806) 2022-11-16 09:44:42 +01:00
vocab.pxd Add support for floret vectors (#8909) 2021-10-27 14:08:31 +02:00
vocab.pyi Add vector deduplication (#10551) 2022-03-30 08:54:23 +02:00
vocab.pyx fix comparison of constants (#11834) 2022-11-21 08:12:03 +01:00