From 68b85ea950492e4f83d9b1552806ab4a9631236e Mon Sep 17 00:00:00 2001 From: Sofie Van Landeghem Date: Fri, 26 Jan 2024 12:10:05 +0100 Subject: [PATCH] Clarify data_path loading for apply CLI command (#13272) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit * attempt to clarify additional annotations on .spacy file * suggestion by Daniƫl * pipeline instead of pipe --- website/docs/api/cli.mdx | 3 +++ 1 file changed, 3 insertions(+) diff --git a/website/docs/api/cli.mdx b/website/docs/api/cli.mdx index 51cae960b..db91e1062 100644 --- a/website/docs/api/cli.mdx +++ b/website/docs/api/cli.mdx @@ -1296,6 +1296,9 @@ input formats are: When a directory is provided it is traversed recursively to collect all files. +When loading a .spacy file, any potential annotations stored on the `Doc` that are not overwritten by the pipeline will be preserved. +If you want to evaluate the pipeline on raw text only, make sure that the .spacy file does not contain any annotations. + ```bash $ python -m spacy apply [model] [data-path] [output-file] [--code] [--text-key] [--force-overwrite] [--gpu-id] [--batch-size] [--n-process] ```