lightning/docs/source-pytorch/accelerators/gpu_basic.rst

:orphan:

.. _gpu_basic:

GPU training (Basic)
====================
**Audience:** Users looking to save money and run large models faster using single or multiple

----

What is a GPU?
--------------
A Graphics Processing Unit (GPU), is a specialized hardware accelerator designed to speed up mathematical computations used in gaming and deep learning.

----

Train on 1 GPU
--------------

Make sure you're running on a machine with at least one GPU. There's no need to specify any NVIDIA flags
as Lightning will do it for you.

.. testcode::
    :skipif: torch.cuda.device_count() < 1

    trainer = Trainer(accelerator="gpu", devices=1)

----------------


.. _multi_gpu:

Train on multiple GPUs
----------------------

To use multiple GPUs, set the number of devices in the Trainer or the index of the GPUs.

.. code::

    trainer = Trainer(accelerator="gpu", devices=4)

Choosing GPU devices
^^^^^^^^^^^^^^^^^^^^

You can select the GPU devices using ranges, a list of indices or a string containing
a comma separated list of GPU ids:

.. testsetup::

    k = 1

.. testcode::
    :skipif: torch.cuda.device_count() < 2

    # DEFAULT (int) specifies how many GPUs to use per node
    Trainer(accelerator="gpu", devices=k)

    # Above is equivalent to
    Trainer(accelerator="gpu", devices=list(range(k)))

    # Specify which GPUs to use (don't use when running on cluster)
    Trainer(accelerator="gpu", devices=[0, 1])

    # Equivalent using a string
    Trainer(accelerator="gpu", devices="0, 1")

    # To use all available GPUs put -1 or '-1'
    # equivalent to list(range(torch.cuda.device_count()))
    Trainer(accelerator="gpu", devices=-1)

The table below lists examples of possible input formats and how they are interpreted by Lightning.

+------------------+-----------+---------------------+---------------------------------+
| `devices`        | Type      | Parsed              | Meaning                         |
+==================+===========+=====================+=================================+
| 3                | int       | [0, 1, 2]           | first 3 GPUs                    |
+------------------+-----------+---------------------+---------------------------------+
| -1               | int       | [0, 1, 2, ...]      | all available GPUs              |
+------------------+-----------+---------------------+---------------------------------+
| [0]              | list      | [0]                 | GPU 0                           |
+------------------+-----------+---------------------+---------------------------------+
| [1, 3]           | list      | [1, 3]              | GPUs 1 and 3                    |
+------------------+-----------+---------------------+---------------------------------+
| "3"              | str       | [0, 1, 2]           | first 3 GPUs                    |
+------------------+-----------+---------------------+---------------------------------+
| "1, 3"           | str       | [1, 3]              | GPUs 1 and 3                    |
+------------------+-----------+---------------------+---------------------------------+
| "-1"             | str       | [0, 1, 2, ...]      | all available GPUs              |
+------------------+-----------+---------------------+---------------------------------+

.. note::

    When specifying number of ``devices`` as an integer ``devices=k``, setting the trainer flag
    ``auto_select_gpus=True`` will automatically help you find ``k`` GPUs that are not
    occupied by other processes. This is especially useful when GPUs are configured
    to be in "exclusive mode", such that only one process at a time can access them.
    For more details see the :doc:`trainer guide <../common/trainer>`.
docs refactor 3/n (#12795) * updated titles + css * updated titles + css * levels structure * levels structure * levels structure * adding level indexes * finished intro guide layout * finished intro guide layout * general titles * general titles * added movie * added movie * finished 15 mins * levels * added core levels * added core levels * fixed api reference on the left * gpu guides * gpu guides * gpu guides * gpu guides * precision * hpu guide * added ipu * added ipu * added ipu * added ckpt docs * finished basic logging * intermediate * intermediate * intermediate * fixed * fixed margins * fixed margins * fixed margins * fixed margins * fixed margins * fixed margins * fixed margins * fixed margins * fixed margins * added logger stuff * added logger stuff * added logger stuff * added logger stuff * added logger stuff * ic * added inconsolata * added inconsolata * added inconsolata * added inconsolata * added inconsolata * added inconsolata * added inconsolata * updated menu * added basic cloud docs * added basic cloud docs * added basic cloud docs * added basic cloud docs * ic * ic * ic * ic * ic * ic * ic * ic * ic * ic * ic * ic * added demos folder * added demos folder * added demos folder * added demos folder * added demos folder * added demos folder * twocolumns directive * twocols * twocols * registry * registry * registry * registry * registry * registry * registry * registry * registry * registry * registry * registry * registry * registry * registry * registry * registry * registry * registry * cleaning up * cleaning up * cleaning up * cleaning up * cleaning up * cleaning up * cleaning up * cleaning up * cleaning up * updated titles + css * levels structure * adding level indexes * finished intro guide layout * general titles * added movie * finished 15 mins * levels * added core levels * fixed api reference on the left * gpu guides * precision * hpu guide * added ipu * added ckpt docs * finished basic logging * intermediate * fixed margins * added logger stuff * ic * added inconsolata * updated menu * added basic cloud docs * ic * added demos folder * twocolumns directive * registry * cleaning up * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * deconflict * deconflict * deconflict * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add testsetup sections wherever needed; fix errors in building docs * pre-commit fixes * Fix duplicate label * minor nit with pre-commit * Fix labels * More changes... * require * debug & cli * prec & model & visu * fix references * fix references * fix refs * fix refs - model_parallel * fix references * prune testsetup with global * refs in index * Fix duplicate label errors * Update orphan docs * Update orphan docs * Update orphan docs * fix links * Fix genindex and search index * fix refs * fix refs * Fix index rst related issues * fix refs * inc to rst * Fix links ref * fix more references * fix refs * deconflict * errors * errors * errors * fix refs * fix refs * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix warnings * Fix LightningCLI errors * Fix LightningCLI errors * Fix LightningCLI errors * Fix LightningCLI errors * fix doc build * Duplicate Label fix (docs) (#12800) Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com> * ignore typing in demo folder * Ignore demos for mypy Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Kushashwa Ravi Shrimali <kushashwaravishrimali@gmail.com> Co-authored-by: Jirka <jirka.borovec@seznam.cz> Co-authored-by: rohitgr7 <rohitgr1998@gmail.com> Co-authored-by: Kaushik B <kaushikbokka@gmail.com> Co-authored-by: otaj <ota@grid.ai> 2022-04-19 18:15:47 +00:00			`:orphan:`

			`.. _gpu_basic:`

			`GPU training (Basic)`
			`====================`
			`Audience: Users looking to save money and run large models faster using single or multiple`

			`----`

			`What is a GPU?`
			`--------------`
			`A Graphics Processing Unit (GPU), is a specialized hardware accelerator designed to speed up mathematical computations used in gaming and deep learning.`

			`----`

			`Train on 1 GPU`
			`--------------`

			`Make sure you're running on a machine with at least one GPU. There's no need to specify any NVIDIA flags`
			`as Lightning will do it for you.`

			`.. testcode::`
			`:skipif: torch.cuda.device_count() < 1`

			`trainer = Trainer(accelerator="gpu", devices=1)`

			`----------------`


			`.. _multi_gpu:`

			`Train on multiple GPUs`
			`----------------------`

			`To use multiple GPUs, set the number of devices in the Trainer or the index of the GPUs.`

			`.. code::`

			`trainer = Trainer(accelerator="gpu", devices=4)`

			`Choosing GPU devices`
			`^^^^^^^^^^^^^^^^^^^^`

			`You can select the GPU devices using ranges, a list of indices or a string containing`
			`a comma separated list of GPU ids:`

			`.. testsetup::`

			`k = 1`

			`.. testcode::`
			`:skipif: torch.cuda.device_count() < 2`

			`# DEFAULT (int) specifies how many GPUs to use per node`
			`Trainer(accelerator="gpu", devices=k)`

			`# Above is equivalent to`
			`Trainer(accelerator="gpu", devices=list(range(k)))`

			`# Specify which GPUs to use (don't use when running on cluster)`
			`Trainer(accelerator="gpu", devices=[0, 1])`

			`# Equivalent using a string`
			`Trainer(accelerator="gpu", devices="0, 1")`

			`# To use all available GPUs put -1 or '-1'`
			`# equivalent to list(range(torch.cuda.device_count()))`
			`Trainer(accelerator="gpu", devices=-1)`

			`The table below lists examples of possible input formats and how they are interpreted by Lightning.`

			`+------------------+-----------+---------------------+---------------------------------+`
			\| `devices` \| Type \| Parsed \| Meaning \|
			`+==================+===========+=====================+=================================+`
			`\| 3 \| int \| [0, 1, 2] \| first 3 GPUs \|`
			`+------------------+-----------+---------------------+---------------------------------+`
			`\| -1 \| int \| [0, 1, 2, ...] \| all available GPUs \|`
			`+------------------+-----------+---------------------+---------------------------------+`
			`\| [0] \| list \| [0] \| GPU 0 \|`
			`+------------------+-----------+---------------------+---------------------------------+`
			`\| [1, 3] \| list \| [1, 3] \| GPUs 1 and 3 \|`
			`+------------------+-----------+---------------------+---------------------------------+`
			`\| "3" \| str \| [0, 1, 2] \| first 3 GPUs \|`
			`+------------------+-----------+---------------------+---------------------------------+`
			`\| "1, 3" \| str \| [1, 3] \| GPUs 1 and 3 \|`
			`+------------------+-----------+---------------------+---------------------------------+`
			`\| "-1" \| str \| [0, 1, 2, ...] \| all available GPUs \|`
			`+------------------+-----------+---------------------+---------------------------------+`

			`.. note::`

			When specifying number of ``devices`` as an integer ``devices=k``, setting the trainer flag
			``auto_select_gpus=True`` will automatically help you find ``k`` GPUs that are not
			`occupied by other processes. This is especially useful when GPUs are configured`
			`to be in "exclusive mode", such that only one process at a time can access them.`
			For more details see the :doc:`trainer guide <../common/trainer>`.