spaCy/website/docs/usage/_benchmarks-models.md

44 lines
2.5 KiB
Markdown
Raw Normal View History

2020-09-12 15:05:10 +00:00
import { Help } from 'components/typography'; import Link from 'components/link'
<!-- TODO: update, add project template -->
<figure>
2020-09-23 20:02:31 +00:00
| System | Parser | Tagger | NER | WPS<br />CPU <Help>words per second on CPU, higher is better</Help> | WPS<br/>GPU <Help>words per second on GPU, higher is better</Help> |
| ---------------------------------------------------------- | -----: | -----: | ---: | ------------------------------------------------------------------: | -----------------------------------------------------------------: |
| [`en_core_web_trf`](/models/en#en_core_web_trf) (spaCy v3) | | | | | 6k |
| [`en_core_web_lg`](/models/en#en_core_web_lg) (spaCy v3) | | | | | |
| `en_core_web_lg` (spaCy v2) | 91.9 | 97.2 | 85.9 | 10k | |
2020-09-12 15:05:10 +00:00
<figcaption class="caption">
**Accuracy and speed on the
2020-09-23 20:02:31 +00:00
[OntoNotes 5.0](https://catalog.ldc.upenn.edu/LDC2013T19) corpus.**
2020-09-12 15:05:10 +00:00
</figcaption>
</figure>
<figure>
2020-09-23 20:02:31 +00:00
| Named Entity Recognition Model | OntoNotes | CoNLL '03 |
| ------------------------------------------------------------------------------ | --------: | --------- |
| spaCy RoBERTa (2020) |
| spaCy CNN (2020) | |
| spaCy CNN (2017) | 86.4 |
| [Stanza](https://stanfordnlp.github.io/stanza/) (StanfordNLP)<sup>1</sup> | 88.8 |
| <Link to="https://github.com/flairNLP/flair" hideIcon>Flair</Link><sup>2</sup> | 89.7 |
2020-09-12 15:05:10 +00:00
<figcaption class="caption">
2020-09-23 20:02:31 +00:00
**Named entity recognition accuracy** on the
[OntoNotes 5.0](https://catalog.ldc.upenn.edu/LDC2013T19) and
[CoNLL-2003](https://www.aclweb.org/anthology/W03-0419.pdf) corpora. See
[NLP-progress](http://nlpprogress.com/english/named_entity_recognition.html) for
more results. **1. ** [Qi et al. (2020)](https://arxiv.org/pdf/2003.07082.pdf).
**2. ** [Akbik et al. (2018)](https://www.aclweb.org/anthology/C18-1139/)
2020-09-12 15:05:10 +00:00
</figcaption>
</figure>