Update transformers page

2020-08-16 20:29:50 +02:00 · 2020-08-16 20:29:50 +02:00 · be07567ac6
parent 8e5f99ee25
commit be07567ac6
1 changed files with 12 additions and 28 deletions
--- a/website/docs/usage/transformers.md
+++ b/website/docs/usage/transformers.md
@ -29,34 +29,16 @@ We recommend an NVIDIA GPU with at least 10GB of memory in order to work with
 transformer models. The exact requirements will depend on the transformer you
 model you choose and whether you're training the pipeline or simply running it.
 Training a transformer-based model without a GPU will be too slow for most
-practical purposes. A GPU will usually also achieve better
+practical purposes. You'll also need to make sure your GPU drivers are up-to-date
-price-for-performance when processing large batches of documents. The only
+and v9+ of the CUDA runtime is installed.
 context where a GPU might not be worthwhile is if you're serving the model in
 a context where documents are received individually, rather than in batches. In
 this context, CPU may be more cost effective.
 You'll also need to make sure your GPU drivers are up-to-date and v9+ of the
 CUDA runtime is installed. Unfortunately, there's little we can do to help with
 this part: the steps will vary depending on your device, operating system and
 the version of CUDA you're targetting (you'll want to use one that's well
 supported by cupy, PyTorch and Tensorflow).
 Once you have CUDA installed, you'll need to install two pip packages, `cupy`
-and `spacy-transformers`. The `cupy` library is just like `numpy`, but for GPU.
+and `spacy-transformers`. [CuPy](https://docs.cupy.dev/en/stable/install.html)
-The best way to install it is to choose a wheel that matches the version of CUDA
+is just like `numpy`, but for GPU. The best way to install it is to choose a
-you're using. For instance, if you're using CUDA 10.2, you would run
+wheel that matches the version of CUDA you're using. You may also need to set the
-`pip install cupy-cuda102`. Finally, if you've installed CUDA in a non-standard
+`CUDA_PATH` environment variable if your CUDA runtime is installed in
-location, you'll need to set the `CUDA_PATH` environment variable to the base
+a non-standard location. Putting it all together, if you had installed CUDA 10.2
-of your CUDA installation. See the cupy documentation for more details.
+in `/opt/nvidia/cuda`, you would run:
 download a few extra dependencies. 
 If provisioning a fresh environment, you'll generally have to download about
 5GB of data in total: 3GB for CUDA, about 400MB for the CuPy wheel, 800MB for
 PyTorch (required by `spacy-transformers`), 500MB for the transformer weights,
 and about 200MB in various other binaries.
 In summary, let's say you're using CUDA 10.2, and you've installed it in
 `/opt/nvidia/cuda`:
 ```
 export CUDA_PATH="/opt/nvidia/cuda"
@ -64,6 +46,10 @@ pip install cupy-cuda102
 pip install spacy-transformers
 ```
 Provisioning a new machine will require about 5GB of data to be downloaded in total:
 3GB for the CUDA runtime, 800MB for PyTorch, 400MB for CuPy, 500MB for the transformer
 weights, and about 200MB for spaCy and its various requirements.
 ## Runtime usage {#runtime}
 Transformer models can be used as **drop-in replacements** for other types of
@ -306,8 +292,6 @@ averages the wordpiece rows. We could instead use `reduce_last`,
 [`reduce_max`](https://thinc.ai/docs/api-layers#reduce_max), or a custom
 function you write yourself.
 <!--TODO: reduce_last: undocumented? -->
 You can have multiple components all listening to the same transformer model,
 and all passing gradients back to it. By default, all of the gradients will be
 **equally weighted**. You can control this with the `grad_factor` setting, which