diff --git a/website/docs/api/phrasematcher.md b/website/docs/api/phrasematcher.md index 4119c8fc0..a72277420 100644 --- a/website/docs/api/phrasematcher.md +++ b/website/docs/api/phrasematcher.md @@ -38,7 +38,7 @@ be shown. | Name | Type | Description | | --------------------------------------- | --------------- | ------------------------------------------------------------------------------------------- | | `vocab` | `Vocab` | The vocabulary object, which must be shared with the documents the matcher will operate on. | -| `max_length` | int | Deprecated argument - the `PhraseMatcher` does not have a phrase length limit anymore. | +| `max_length` | int | Deprecated argument - the `PhraseMatcher` does not have a phrase length limit anymore. | | `attr` 2.1 | int / unicode | The token attribute to match on. Defaults to `ORTH`, i.e. the verbatim token text. | | `validate` 2.1 | bool | Validate patterns added to the matcher. | | **RETURNS** | `PhraseMatcher` | The newly constructed object. | @@ -72,15 +72,16 @@ Find all token sequences matching the supplied patterns on the `Doc`. -Because spaCy stores all strings as integers, the match_id you get back will be an integer, too – but you can always get the string representation by looking it up in the vocabulary's StringStore, i.e. nlp.vocab.strings: +Because spaCy stores all strings as integers, the `match_id` you get back will +be an integer, too – but you can always get the string representation by looking +it up in the vocabulary's `StringStore`, i.e. `nlp.vocab.strings`: -``` +```python match_id_string = nlp.vocab.strings[match_id] ``` - ## PhraseMatcher.pipe {#pipe tag="method"} Match a stream of documents, yielding them in turn.