lightning/pl_examples/basic_examples/README.md

61 lines
1.5 KiB
Markdown
Raw Normal View History

## Basic Examples
Use these examples to test how lightning works.
2019-10-05 18:13:55 +00:00
#### MNIST
Trains MNIST where the model is defined inside the `LightningModule`.
2019-10-05 18:13:55 +00:00
```bash
# cpu
python simple_image_classifier.py
2019-10-05 18:13:55 +00:00
# gpus (any number)
python simple_image_classifier.py --trainer.gpus 2
2019-10-05 18:13:55 +00:00
# dataparallel
python simple_image_classifier.py --trainer.gpus 2 --trainer.accelerator 'dp'
```
2019-10-05 18:13:55 +00:00
---
#### MNIST with DALI
The MNIST example above using [NVIDIA DALI](https://developer.nvidia.com/DALI).
Requires NVIDIA DALI to be installed based on your CUDA version, see [here](https://docs.nvidia.com/deeplearning/dali/user-guide/docs/installation.html).
```bash
python dali_image_classifier.py
```
---
#### Image classifier
Generic image classifier with an arbitrary backbone (ie: a simple system)
2019-10-05 18:13:55 +00:00
```bash
# cpu
python backbone_image_classifier.py
2019-10-05 18:13:55 +00:00
# gpus (any number)
python backbone_image_classifier.py --trainer.gpus 2
2019-10-05 18:13:55 +00:00
# dataparallel
python backbone_image_classifier.py --trainer.gpus 2 --trainer.accelerator 'dp'
2019-10-05 18:13:55 +00:00
```
---
#### Autoencoder
Showing the power of a system... arbitrarily complex training loops
2019-10-05 18:13:55 +00:00
```bash
# cpu
python autoencoder.py
# gpus (any number)
python autoencoder.py --trainer.gpus 2
# dataparallel
python autoencoder.py --trainer.gpus 2 --trainer.accelerator 'dp'
```
---
# Multi-node example
This demo launches a job using 2 GPUs on 2 different nodes (4 GPUs total).
To run this demo do the following:
1. Log into the jumphost node of your SLURM-managed cluster.
2. Create a conda environment with Lightning and a GPU PyTorch version.
3. Choose a script to submit