spaCy/website/docs/api/architectures.md

---
title: Model Architectures
teaser: Pre-defined model architectures included with the core library
source: spacy/ml/models
menu:
  - ['Tok2Vec', 'tok2vec']
  - ['Transformers', 'transformers']
  - ['Parser & NER', 'parser']
  - ['Tagging', 'tagger']
  - ['Text Classification', 'textcat']
  - ['Entity Linking', 'entitylinker']
---

TODO: intro and how architectures work, link to
[`registry`](/api/top-level#registry),
[custom models](/usage/training#custom-models) usage etc.

## Tok2Vec architectures {#tok2vec source="spacy/ml/models/tok2vec.py"}

### spacy.HashEmbedCNN.v1 {#HashEmbedCNN}

<!-- TODO: intro -->

> #### Example Config
>
> ```ini
> [model]
> @architectures = "spacy.HashEmbedCNN.v1"
> # TODO: ...
>
> [model.tok2vec]
> # ...
> ```

| Name                 | Type  | Description |
| -------------------- | ----- | ----------- |
| `width`              | int   |             |
| `depth`              | int   |             |
| `embed_size`         | int   |             |
| `window_size`        | int   |             |
| `maxout_pieces`      | int   |             |
| `subword_features`   | bool  |             |
| `dropout`            | float |             |
| `pretrained_vectors` | bool  |             |

### spacy.HashCharEmbedCNN.v1 {#HashCharEmbedCNN}

### spacy.HashCharEmbedBiLSTM.v1 {#HashCharEmbedBiLSTM}

## Transformer architectures {#transformers source="github.com/explosion/spacy-transformers/blob/master/spacy_transformers/architectures.py"}

The following architectures are provided by the package
[`spacy-transformers`](https://github.com/explosion/spacy-transformers). See the
[usage documentation](/usage/transformers) for how to integrate the
architectures into your training config.

### spacy-transformers.TransformerModel.v1 {#TransformerModel}

<!-- TODO: description -->

> #### Example Config
>
> ```ini
> [model]
> @architectures = "spacy-transformers.TransformerModel.v1"
> name = "roberta-base"
> tokenizer_config = {"use_fast": true}
>
> [model.get_spans]
> @span_getters = "strided_spans.v1"
> window = 128
> stride = 96
> ```

| Name               | Type             | Description                                                                                                                                                                                                     |
| ------------------ | ---------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `name`             | str              | Any model name that can be loaded by [`transformers.AutoModel`](https://huggingface.co/transformers/model_doc/auto.html#transformers.AutoModel).                                                                |
| `get_spans`        | `Callable`       | Function that takes a batch of [`Doc`](/api/doc) object and returns lists of [`Span`](/api) objects to process by the transformer. [See here](/api/transformer#span_getters) for built-in options and examples. |
| `tokenizer_config` | `Dict[str, Any]` | Tokenizer settings passed to [`transformers.AutoTokenizer`](https://huggingface.co/transformers/model_doc/auto.html#transformers.AutoTokenizer).                                                                |

### spacy-transformers.Tok2VecListener.v1 {#Tok2VecListener}

<!-- TODO: description -->

> #### Example Config
>
> ```ini
> [model]
> @architectures = "spacy-transformers.Tok2VecListener.v1"
> grad_factor = 1.0
>
> [model.pooling]
> @layers = "reduce_mean.v1"
> ```

| Name          | Type                      | Description                                                                                    |
| ------------- | ------------------------- | ---------------------------------------------------------------------------------------------- |
| `grad_factor` | float                     | Factor for weighting the gradient if multiple components listen to the same transformer model. |
| `pooling`     | `Model[Ragged, Floats2d]` | Pooling layer to determine how the vector for each spaCy token will be computed.               |

## Parser & NER architectures {#parser source="spacy/ml/models/parser.py"}

### spacy.TransitionBasedParser.v1 {#TransitionBasedParser}

> #### Example Config
>
> ```ini
> [model]
> @architectures = "spacy.TransitionBasedParser.v1"
> nr_feature_tokens = 6
> hidden_width = 64
> maxout_pieces = 2
>
> [model.tok2vec]
> # ...
> ```

| Name                | Type                                       | Description |
| ------------------- | ------------------------------------------ | ----------- |
| `tok2vec`           | [`Model`](https://thinc.ai/docs/api-model) |             |
| `nr_feature_tokens` | int                                        |             |
| `hidden_width`      | int                                        |             |
| `maxout_pieces`     | int                                        |             |
| `use_upper`         | bool                                       |             |
| `nO`                | int                                        |             |

## Tagging architectures {#tagger source="spacy/ml/models/tagger.py"}

### spacy.Tagger.v1 {#Tagger}

<!-- TODO: intro -->

> #### Example Config
>
> ```ini
> [model]
> @architectures = "spacy.Tagger.v1"
> nO = null
>
> [model.tok2vec]
> # ...
> ```

| Name      | Type                                       | Description |
| --------- | ------------------------------------------ | ----------- |
| `tok2vec` | [`Model`](https://thinc.ai/docs/api-model) |             |
| `nO`      | int                                        |             |

## Text classification architectures {#textcat source="spacy/ml/models/textcat.py"}

A text classification architecture needs to take a `Doc` as input, and produce a
score for each potential label class. Textcat challenges can be binary (e.g.
sentiment analysis) or involve multiple possible labels. Multi-label challenges
can either have mutually exclusive labels (each example has exactly one label),
or multiple labels may be applicable at the same time.

As the properties of text classification problems can vary widely, we provide
several different built-in architectures. It is recommended to experiment with
different architectures and settings to determine what works best on your
specific data and challenge.

### spacy.TextCatEnsemble.v1 {#TextCatEnsemble}

Stacked ensemble of a bag-of-words model and a neural network model. The neural
network has an internal CNN Tok2Vec layer and uses attention.

> #### Example Config
>
> ```ini
> [model]
> @architectures = "spacy.TextCatEnsemble.v1"
> exclusive_classes = false
> pretrained_vectors = null
> width = 64
> embed_size = 2000
> conv_depth = 2
> window_size = 1
> ngram_size = 1
> dropout = null
> nO = null
> ```

| Name                 | Type  | Description                                                                                                                                 |
| -------------------- | ----- | ------------------------------------------------------------------------------------------------------------------------------------------- |
| `exclusive_classes`  | bool  | Whether or not categories are mutually exclusive.                                                                                           |
| `pretrained_vectors` | bool  | Whether or not pretrained vectors will be used in addition to the feature vectors.                                                          |
| `width`              | int   | Output dimension of the feature encoding step.                                                                                              |
| `embed_size`         | int   | Input dimension of the feature encoding step.                                                                                               |
| `conv_depth`         | int   | Depth of the Tok2Vec layer.                                                                                                                 |
| `window_size`        | int   | The number of contextual vectors to [concatenate](https://thinc.ai/docs/api-layers#expand_window) from the left and from the right.         |
| `ngram_size`         | int   | Determines the maximum length of the n-grams in the BOW model. For instance, `ngram_size=3`would give unigram, trigram and bigram features. |
| `dropout`            | float | The dropout rate.                                                                                                                           |
| `nO`                 | int   | Output dimension, determined by the number of different labels.                                                                             |

If the `nO` dimension is not set, the TextCategorizer component will set it when
`begin_training` is called.

### spacy.TextCatCNN.v1 {#TextCatCNN}

> #### Example Config
>
> ```ini
> [model]
> @architectures = "spacy.TextCatCNN.v1"
> exclusive_classes = false
> nO = null
>
> [model.tok2vec]
> @architectures = "spacy.HashEmbedCNN.v1"
> pretrained_vectors = null
> width = 96
> depth = 4
> embed_size = 2000
> window_size = 1
> maxout_pieces = 3
> subword_features = true
> dropout = null
> ```

A neural network model where token vectors are calculated using a CNN. The
vectors are mean pooled and used as features in a feed-forward network. This
architecture is usually less accurate than the ensemble, but runs faster.

| Name                | Type                                       | Description                                                     |
| ------------------- | ------------------------------------------ | --------------------------------------------------------------- |
| `exclusive_classes` | bool                                       | Whether or not categories are mutually exclusive.               |
| `tok2vec`           | [`Model`](https://thinc.ai/docs/api-model) | The [`tok2vec`](#tok2vec) layer of the model.                   |
| `nO`                | int                                        | Output dimension, determined by the number of different labels. |

If the `nO` dimension is not set, the TextCategorizer component will set it when
`begin_training` is called.

### spacy.TextCatBOW.v1 {#TextCatBOW}

An ngram "bag-of-words" model. This architecture should run much faster than the
others, but may not be as accurate, especially if texts are short.

> #### Example Config
>
> ```ini
> [model]
> @architectures = "spacy.TextCatBOW.v1"
> exclusive_classes = false
> ngram_size: 1
> no_output_layer: false
> nO = null
> ```

| Name                | Type  | Description                                                                                                                                 |
| ------------------- | ----- | ------------------------------------------------------------------------------------------------------------------------------------------- |
| `exclusive_classes` | bool  | Whether or not categories are mutually exclusive.                                                                                           |
| `ngram_size`        | int   | Determines the maximum length of the n-grams in the BOW model. For instance, `ngram_size=3`would give unigram, trigram and bigram features. |
| `no_output_layer`   | float | Whether or not to add an output layer to the model (`Softmax` activation if `exclusive_classes=True`, else `Logistic`.                      |
| `nO`                | int   | Output dimension, determined by the number of different labels.                                                                             |

If the `nO` dimension is not set, the TextCategorizer component will set it when
`begin_training` is called.

### spacy.TextCatLowData.v1 {#TextCatLowData}

## Entity linking architectures {#entitylinker source="spacy/ml/models/entity_linker.py"}

An `EntityLinker` component disambiguates textual mentions (tagged as named
entities) to unique identifiers, grounding the named entities into the "real
world". This requires 3 main components:

- A [`KnowledgeBase`](/api/kb) (KB) holding the unique identifiers, potential
  synonyms and prior probabilities.
- A candidate generation step to produce a set of likely identifiers, given a
  certain textual mention.
- A Machine learning [`Model`](https://thinc.ai/docs/api-model) that picks the
  most plausible ID from the set of candidates.

### spacy.EntityLinker.v1 {#EntityLinker}

The `EntityLinker` model architecture is a `Thinc` `Model` with a Linear output
layer.

> #### Example Config
>
> ```ini
> [model]
> @architectures = "spacy.EntityLinker.v1"
> nO = null
>
> [model.tok2vec]
> @architectures = "spacy.HashEmbedCNN.v1"
> pretrained_vectors = null
> width = 96
> depth = 2
> embed_size = 300
> window_size = 1
> maxout_pieces = 3
> subword_features = true
> dropout = null
>
> [kb_loader]
> @assets = "spacy.EmptyKB.v1"
> entity_vector_length = 64
>
> [get_candidates]
> @assets = "spacy.CandidateGenerator.v1"
> ```

| Name      | Type                                       | Description                                                                              |
| --------- | ------------------------------------------ | ---------------------------------------------------------------------------------------- |
| `tok2vec` | [`Model`](https://thinc.ai/docs/api-model) | The [`tok2vec`](#tok2vec) layer of the model.                                            |
| `nO`      | int                                        | Output dimension, determined by the length of the vectors encoding each entity in the KB |

If the `nO` dimension is not set, the Entity Linking component will set it when
`begin_training` is called.

### spacy.EmptyKB.v1 {#EmptyKB}

A function that creates a default, empty `KnowledgeBase` from a
[`Vocab`](/api/vocab) instance.

| Name                   | Type | Description                                                               |
| ---------------------- | ---- | ------------------------------------------------------------------------- |
| `entity_vector_length` | int  | The length of the vectors encoding each entity in the KB - 64 by default. |

### spacy.CandidateGenerator.v1 {#CandidateGenerator}

A function that takes as input a [`KnowledgeBase`](/api/kb) and a
[`Span`](/api/span) object denoting a named entity, and returns a list of
plausible [`Candidate` objects](/api/kb/#candidate_init).

The default `CandidateGenerator` simply uses the text of a mention to find its
potential aliases in the Knowledgebase. Note that this function is
case-dependent.
Update v3 docs 2020-07-03 14:48:21 +00:00			`---`
			`title: Model Architectures`
			`teaser: Pre-defined model architectures included with the core library`
			`source: spacy/ml/models`
Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00			`menu:`
			`- ['Tok2Vec', 'tok2vec']`
Update docstrings, docs and types 2020-07-29 09:36:42 +00:00			`- ['Transformers', 'transformers']`
Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00			`- ['Parser & NER', 'parser']`
Update docs and types 2020-07-31 15:02:54 +00:00			`- ['Tagging', 'tagger']`
Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00			`- ['Text Classification', 'textcat']`
			`- ['Entity Linking', 'entitylinker']`
Update v3 docs 2020-07-03 14:48:21 +00:00			`---`

Update API docs 2020-07-08 11:34:35 +00:00			`TODO: intro and how architectures work, link to`
			[`registry`](/api/top-level#registry),
			`[custom models](/usage/training#custom-models) usage etc.`

Update docstrings, docs and types 2020-07-29 09:36:42 +00:00			`## Tok2Vec architectures {#tok2vec source="spacy/ml/models/tok2vec.py"}`
Update API docs 2020-07-08 11:34:35 +00:00
Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00			`### spacy.HashEmbedCNN.v1 {#HashEmbedCNN}`

Update docs and types 2020-07-31 15:02:54 +00:00			`<!-- TODO: intro -->`

			`> #### Example Config`
			`>`
			> ```ini
			`> [model]`
			`> @architectures = "spacy.HashEmbedCNN.v1"`
			`> # TODO: ...`
			`>`
			`> [model.tok2vec]`
			`> # ...`
			> ```

			`\| Name \| Type \| Description \|`
			`\| -------------------- \| ----- \| ----------- \|`
			\| `width` \| int \| \|
			\| `depth` \| int \| \|
			\| `embed_size` \| int \| \|
			\| `window_size` \| int \| \|
			\| `maxout_pieces` \| int \| \|
			\| `subword_features` \| bool \| \|
			\| `dropout` \| float \| \|
			\| `pretrained_vectors` \| bool \| \|

Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00			`### spacy.HashCharEmbedCNN.v1 {#HashCharEmbedCNN}`

			`### spacy.HashCharEmbedBiLSTM.v1 {#HashCharEmbedBiLSTM}`

Update docstrings, docs and types 2020-07-29 09:36:42 +00:00			`## Transformer architectures {#transformers source="github.com/explosion/spacy-transformers/blob/master/spacy_transformers/architectures.py"}`

Update docs [ci skip] 2020-07-29 17:41:34 +00:00			`The following architectures are provided by the package`
			[`spacy-transformers`](https://github.com/explosion/spacy-transformers). See the
			`[usage documentation](/usage/transformers) for how to integrate the`
			`architectures into your training config.`

Update docstrings, docs and types 2020-07-29 09:36:42 +00:00			`### spacy-transformers.TransformerModel.v1 {#TransformerModel}`

Update docs [ci skip] 2020-07-29 17:41:34 +00:00			`<!-- TODO: description -->`

			`> #### Example Config`
			`>`
			> ```ini
			`> [model]`
			`> @architectures = "spacy-transformers.TransformerModel.v1"`
			`> name = "roberta-base"`
			`> tokenizer_config = {"use_fast": true}`
			`>`
			`> [model.get_spans]`
			`> @span_getters = "strided_spans.v1"`
			`> window = 128`
			`> stride = 96`
			> ```

			`\| Name \| Type \| Description \|`
			`\| ------------------ \| ---------------- \| --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- \|`
			\| `name` \| str \| Any model name that can be loaded by [`transformers.AutoModel`](https://huggingface.co/transformers/model_doc/auto.html#transformers.AutoModel). \|
			\| `get_spans` \| `Callable` \| Function that takes a batch of [`Doc`](/api/doc) object and returns lists of [`Span`](/api) objects to process by the transformer. [See here](/api/transformer#span_getters) for built-in options and examples. \|
			\| `tokenizer_config` \| `Dict[str, Any]` \| Tokenizer settings passed to [`transformers.AutoTokenizer`](https://huggingface.co/transformers/model_doc/auto.html#transformers.AutoTokenizer). \|

Fix docs [ci skip] 2020-07-29 16:45:12 +00:00			`### spacy-transformers.Tok2VecListener.v1 {#Tok2VecListener}`
Update docs [ci skip] 2020-07-29 16:44:10 +00:00
Update docs [ci skip] 2020-07-29 17:41:34 +00:00			`<!-- TODO: description -->`

			`> #### Example Config`
			`>`
			> ```ini
			`> [model]`
			`> @architectures = "spacy-transformers.Tok2VecListener.v1"`
			`> grad_factor = 1.0`
			`>`
			`> [model.pooling]`
			`> @layers = "reduce_mean.v1"`
			> ```

			`\| Name \| Type \| Description \|`
			`\| ------------- \| ------------------------- \| ---------------------------------------------------------------------------------------------- \|`
			\| `grad_factor` \| float \| Factor for weighting the gradient if multiple components listen to the same transformer model. \|
			\| `pooling` \| `Model[Ragged, Floats2d]` \| Pooling layer to determine how the vector for each spaCy token will be computed. \|

Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00			`## Parser & NER architectures {#parser source="spacy/ml/models/parser.py"}`

			`### spacy.TransitionBasedParser.v1 {#TransitionBasedParser}`
Update API docs 2020-07-08 11:34:35 +00:00
			`> #### Example Config`
			`>`
			> ```ini
			`> [model]`
			`> @architectures = "spacy.TransitionBasedParser.v1"`
			`> nr_feature_tokens = 6`
			`> hidden_width = 64`
			`> maxout_pieces = 2`
			`>`
			`> [model.tok2vec]`
			`> # ...`
			> ```

			`\| Name \| Type \| Description \|`
			`\| ------------------- \| ------------------------------------------ \| ----------- \|`
			\| `tok2vec` \| [`Model`](https://thinc.ai/docs/api-model) \| \|
			\| `nr_feature_tokens` \| int \| \|
			\| `hidden_width` \| int \| \|
			\| `maxout_pieces` \| int \| \|
			\| `use_upper` \| bool \| \|
			\| `nO` \| int \| \|
Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00
Update docs and types 2020-07-31 15:02:54 +00:00			`## Tagging architectures {#tagger source="spacy/ml/models/tagger.py"}`

			`### spacy.Tagger.v1 {#Tagger}`

			`<!-- TODO: intro -->`

			`> #### Example Config`
			`>`
			> ```ini
			`> [model]`
			`> @architectures = "spacy.Tagger.v1"`
			`> nO = null`
			`>`
			`> [model.tok2vec]`
			`> # ...`
			> ```

			`\| Name \| Type \| Description \|`
			`\| --------- \| ------------------------------------------ \| ----------- \|`
			\| `tok2vec` \| [`Model`](https://thinc.ai/docs/api-model) \| \|
			\| `nO` \| int \| \|

Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00			`## Text classification architectures {#textcat source="spacy/ml/models/textcat.py"}`

add textcat architectures documentation 2020-08-06 17:44:47 +00:00			A text classification architecture needs to take a `Doc` as input, and produce a
			`score for each potential label class. Textcat challenges can be binary (e.g.`
			`sentiment analysis) or involve multiple possible labels. Multi-label challenges`
			`can either have mutually exclusive labels (each example has exactly one label),`
			`or multiple labels may be applicable at the same time.`

			`As the properties of text classification problems can vary widely, we provide`
			`several different built-in architectures. It is recommended to experiment with`
			`different architectures and settings to determine what works best on your`
			`specific data and challenge.`

Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00			`### spacy.TextCatEnsemble.v1 {#TextCatEnsemble}`

add textcat architectures documentation 2020-08-06 17:44:47 +00:00			`Stacked ensemble of a bag-of-words model and a neural network model. The neural`
			`network has an internal CNN Tok2Vec layer and uses attention.`

			`> #### Example Config`
			`>`
			> ```ini
			`> [model]`
			`> @architectures = "spacy.TextCatEnsemble.v1"`
			`> exclusive_classes = false`
			`> pretrained_vectors = null`
			`> width = 64`
			`> embed_size = 2000`
			`> conv_depth = 2`
			`> window_size = 1`
			`> ngram_size = 1`
			`> dropout = null`
			`> nO = null`
			> ```

			`\| Name \| Type \| Description \|`
			`\| -------------------- \| ----- \| ------------------------------------------------------------------------------------------------------------------------------------------- \|`
			\| `exclusive_classes` \| bool \| Whether or not categories are mutually exclusive. \|
			\| `pretrained_vectors` \| bool \| Whether or not pretrained vectors will be used in addition to the feature vectors. \|
			\| `width` \| int \| Output dimension of the feature encoding step. \|
			\| `embed_size` \| int \| Input dimension of the feature encoding step. \|
			\| `conv_depth` \| int \| Depth of the Tok2Vec layer. \|
			\| `window_size` \| int \| The number of contextual vectors to [concatenate](https://thinc.ai/docs/api-layers#expand_window) from the left and from the right. \|
			\| `ngram_size` \| int \| Determines the maximum length of the n-grams in the BOW model. For instance, `ngram_size=3`would give unigram, trigram and bigram features. \|
			\| `dropout` \| float \| The dropout rate. \|
			\| `nO` \| int \| Output dimension, determined by the number of different labels. \|

			If the `nO` dimension is not set, the TextCategorizer component will set it when
			`begin_training` is called.
Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00
			`### spacy.TextCatCNN.v1 {#TextCatCNN}`

add textcat architectures documentation 2020-08-06 17:44:47 +00:00			`> #### Example Config`
			`>`
			> ```ini
			`> [model]`
			`> @architectures = "spacy.TextCatCNN.v1"`
			`> exclusive_classes = false`
			`> nO = null`
			`>`
			`> [model.tok2vec]`
			`> @architectures = "spacy.HashEmbedCNN.v1"`
			`> pretrained_vectors = null`
			`> width = 96`
			`> depth = 4`
			`> embed_size = 2000`
			`> window_size = 1`
			`> maxout_pieces = 3`
			`> subword_features = true`
			`> dropout = null`
			> ```

			`A neural network model where token vectors are calculated using a CNN. The`
			`vectors are mean pooled and used as features in a feed-forward network. This`
			`architecture is usually less accurate than the ensemble, but runs faster.`

			`\| Name \| Type \| Description \|`
			`\| ------------------- \| ------------------------------------------ \| --------------------------------------------------------------- \|`
			\| `exclusive_classes` \| bool \| Whether or not categories are mutually exclusive. \|
			\| `tok2vec` \| [`Model`](https://thinc.ai/docs/api-model) \| The [`tok2vec`](#tok2vec) layer of the model. \|
			\| `nO` \| int \| Output dimension, determined by the number of different labels. \|

			If the `nO` dimension is not set, the TextCategorizer component will set it when
			`begin_training` is called.

			`### spacy.TextCatBOW.v1 {#TextCatBOW}`

			`An ngram "bag-of-words" model. This architecture should run much faster than the`
			`others, but may not be as accurate, especially if texts are short.`

			`> #### Example Config`
			`>`
			> ```ini
			`> [model]`
			`> @architectures = "spacy.TextCatBOW.v1"`
			`> exclusive_classes = false`
			`> ngram_size: 1`
			`> no_output_layer: false`
			`> nO = null`
			> ```

			`\| Name \| Type \| Description \|`
			`\| ------------------- \| ----- \| ------------------------------------------------------------------------------------------------------------------------------------------- \|`
			\| `exclusive_classes` \| bool \| Whether or not categories are mutually exclusive. \|
			\| `ngram_size` \| int \| Determines the maximum length of the n-grams in the BOW model. For instance, `ngram_size=3`would give unigram, trigram and bigram features. \|
			\| `no_output_layer` \| float \| Whether or not to add an output layer to the model (`Softmax` activation if `exclusive_classes=True`, else `Logistic`. \|
			\| `nO` \| int \| Output dimension, determined by the number of different labels. \|

			If the `nO` dimension is not set, the TextCategorizer component will set it when
			`begin_training` is called.

Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00			`### spacy.TextCatLowData.v1 {#TextCatLowData}`

			`## Entity linking architectures {#entitylinker source="spacy/ml/models/entity_linker.py"}`

casing consistent 2020-08-06 21:20:13 +00:00			An `EntityLinker` component disambiguates textual mentions (tagged as named
EL architectures documentation 2020-08-06 15:41:26 +00:00			`entities) to unique identifiers, grounding the named entities into the "real`
			`world". This requires 3 main components:`

			- A [`KnowledgeBase`](/api/kb) (KB) holding the unique identifiers, potential
			`synonyms and prior probabilities.`
			`- A candidate generation step to produce a set of likely identifiers, given a`
			`certain textual mention.`
			- A Machine learning [`Model`](https://thinc.ai/docs/api-model) that picks the
			`most plausible ID from the set of candidates.`

Update arch docs WIP [ci skip] 2020-07-28 18:33:52 +00:00			`### spacy.EntityLinker.v1 {#EntityLinker}`
Update docs and types 2020-07-31 15:02:54 +00:00
EL architectures documentation 2020-08-06 15:41:26 +00:00			The `EntityLinker` model architecture is a `Thinc` `Model` with a Linear output
			`layer.`
Update docs and types 2020-07-31 15:02:54 +00:00
			`> #### Example Config`
			`>`
			> ```ini
			`> [model]`
			`> @architectures = "spacy.EntityLinker.v1"`
			`> nO = null`
			`>`
			`> [model.tok2vec]`
EL architectures documentation 2020-08-06 15:41:26 +00:00			`> @architectures = "spacy.HashEmbedCNN.v1"`
			`> pretrained_vectors = null`
			`> width = 96`
			`> depth = 2`
			`> embed_size = 300`
			`> window_size = 1`
			`> maxout_pieces = 3`
			`> subword_features = true`
			`> dropout = null`
add textcat architectures documentation 2020-08-06 17:44:47 +00:00			`>`
EL architectures documentation 2020-08-06 15:41:26 +00:00			`> [kb_loader]`
			`> @assets = "spacy.EmptyKB.v1"`
			`> entity_vector_length = 64`
add textcat architectures documentation 2020-08-06 17:44:47 +00:00			`>`
EL architectures documentation 2020-08-06 15:41:26 +00:00			`> [get_candidates]`
			`> @assets = "spacy.CandidateGenerator.v1"`
Update docs and types 2020-07-31 15:02:54 +00:00			> ```

EL architectures documentation 2020-08-06 15:41:26 +00:00			`\| Name \| Type \| Description \|`
			`\| --------- \| ------------------------------------------ \| ---------------------------------------------------------------------------------------- \|`
			\| `tok2vec` \| [`Model`](https://thinc.ai/docs/api-model) \| The [`tok2vec`](#tok2vec) layer of the model. \|
			\| `nO` \| int \| Output dimension, determined by the length of the vectors encoding each entity in the KB \|

			If the `nO` dimension is not set, the Entity Linking component will set it when
			`begin_training` is called.

			`### spacy.EmptyKB.v1 {#EmptyKB}`

casing consistent 2020-08-06 21:20:13 +00:00			A function that creates a default, empty `KnowledgeBase` from a
add textcat architectures documentation 2020-08-06 17:44:47 +00:00			[`Vocab`](/api/vocab) instance.
EL architectures documentation 2020-08-06 15:41:26 +00:00
add textcat architectures documentation 2020-08-06 17:44:47 +00:00			`\| Name \| Type \| Description \|`
			`\| ---------------------- \| ---- \| ------------------------------------------------------------------------- \|`
EL architectures documentation 2020-08-06 15:41:26 +00:00			\| `entity_vector_length` \| int \| The length of the vectors encoding each entity in the KB - 64 by default. \|

			`### spacy.CandidateGenerator.v1 {#CandidateGenerator}`

add textcat architectures documentation 2020-08-06 17:44:47 +00:00			A function that takes as input a [`KnowledgeBase`](/api/kb) and a
			[`Span`](/api/span) object denoting a named entity, and returns a list of
			plausible [`Candidate` objects](/api/kb/#candidate_init).
EL architectures documentation 2020-08-06 15:41:26 +00:00
			The default `CandidateGenerator` simply uses the text of a mention to find its
			`potential aliases in the Knowledgebase. Note that this function is`
			`case-dependent.`