mirror of https://github.com/explosion/spaCy.git
Remove unfinished sentence
This commit is contained in:
parent
26a90f011b
commit
36e3532475
|
@ -622,13 +622,13 @@ categorizer is to use the [`spacy train`](/api/cli#train) command-line utility.
|
||||||
In order to use this, you'll need training and evaluation data in the
|
In order to use this, you'll need training and evaluation data in the
|
||||||
[JSON format](/api/annotation#json-input) spaCy expects for training.
|
[JSON format](/api/annotation#json-input) spaCy expects for training.
|
||||||
|
|
||||||
You can now train the model using a corpus for your language annotated with If
|
If your data is in one of the supported formats, the easiest solution might be
|
||||||
your data is in one of the supported formats, the easiest solution might be to
|
to use the [`spacy convert`](/api/cli#convert) command-line utility. This
|
||||||
use the [`spacy convert`](/api/cli#convert) command-line utility. This supports
|
supports several popular formats, including the IOB format for named entity
|
||||||
several popular formats, including the IOB format for named entity recognition,
|
recognition, the JSONL format produced by our annotation tool
|
||||||
the JSONL format produced by our annotation tool [Prodigy](https://prodi.gy),
|
[Prodigy](https://prodi.gy), and the
|
||||||
and the [CoNLL-U](http://universaldependencies.org/docs/format.html) format used
|
[CoNLL-U](http://universaldependencies.org/docs/format.html) format used by the
|
||||||
by the [Universal Dependencies](http://universaldependencies.org/) corpus.
|
[Universal Dependencies](http://universaldependencies.org/) corpus.
|
||||||
|
|
||||||
One thing to keep in mind is that spaCy expects to train its models from **whole
|
One thing to keep in mind is that spaCy expects to train its models from **whole
|
||||||
documents**, not just single sentences. If your corpus only contains single
|
documents**, not just single sentences. If your corpus only contains single
|
||||||
|
|
Loading…
Reference in New Issue