mirror of https://github.com/explosion/spaCy.git
Adjust formatting [ci skip]
This commit is contained in:
parent
02a44c5be2
commit
abd5c06374
|
@ -38,7 +38,7 @@ be shown.
|
||||||
| Name | Type | Description |
|
| Name | Type | Description |
|
||||||
| --------------------------------------- | --------------- | ------------------------------------------------------------------------------------------- |
|
| --------------------------------------- | --------------- | ------------------------------------------------------------------------------------------- |
|
||||||
| `vocab` | `Vocab` | The vocabulary object, which must be shared with the documents the matcher will operate on. |
|
| `vocab` | `Vocab` | The vocabulary object, which must be shared with the documents the matcher will operate on. |
|
||||||
| `max_length` | int | Deprecated argument - the `PhraseMatcher` does not have a phrase length limit anymore. |
|
| `max_length` | int | Deprecated argument - the `PhraseMatcher` does not have a phrase length limit anymore. |
|
||||||
| `attr` <Tag variant="new">2.1</Tag> | int / unicode | The token attribute to match on. Defaults to `ORTH`, i.e. the verbatim token text. |
|
| `attr` <Tag variant="new">2.1</Tag> | int / unicode | The token attribute to match on. Defaults to `ORTH`, i.e. the verbatim token text. |
|
||||||
| `validate` <Tag variant="new">2.1</Tag> | bool | Validate patterns added to the matcher. |
|
| `validate` <Tag variant="new">2.1</Tag> | bool | Validate patterns added to the matcher. |
|
||||||
| **RETURNS** | `PhraseMatcher` | The newly constructed object. |
|
| **RETURNS** | `PhraseMatcher` | The newly constructed object. |
|
||||||
|
@ -72,15 +72,16 @@ Find all token sequences matching the supplied patterns on the `Doc`.
|
||||||
|
|
||||||
<Infobox title="Note on retrieving the string representation of the match_id" variant="warning">
|
<Infobox title="Note on retrieving the string representation of the match_id" variant="warning">
|
||||||
|
|
||||||
Because spaCy stores all strings as integers, the match_id you get back will be an integer, too – but you can always get the string representation by looking it up in the vocabulary's StringStore, i.e. nlp.vocab.strings:
|
Because spaCy stores all strings as integers, the `match_id` you get back will
|
||||||
|
be an integer, too – but you can always get the string representation by looking
|
||||||
|
it up in the vocabulary's `StringStore`, i.e. `nlp.vocab.strings`:
|
||||||
|
|
||||||
```
|
```python
|
||||||
match_id_string = nlp.vocab.strings[match_id]
|
match_id_string = nlp.vocab.strings[match_id]
|
||||||
```
|
```
|
||||||
|
|
||||||
</Infobox>
|
</Infobox>
|
||||||
|
|
||||||
|
|
||||||
## PhraseMatcher.pipe {#pipe tag="method"}
|
## PhraseMatcher.pipe {#pipe tag="method"}
|
||||||
|
|
||||||
Match a stream of documents, yielding them in turn.
|
Match a stream of documents, yielding them in turn.
|
||||||
|
|
Loading…
Reference in New Issue