0e6ee9c39d
* add mdformat * exclude chlog * fix *** Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com> |
||
---|---|---|
.. | ||
README.md | ||
__init__.py | ||
autoencoder.py | ||
backbone_image_classifier.py | ||
dali_image_classifier.py | ||
mnist_datamodule.py | ||
profiler_example.py | ||
simple_image_classifier.py |
README.md
Basic Examples
Use these examples to test how lightning works.
MNIST
Trains MNIST where the model is defined inside the LightningModule
.
# cpu
python simple_image_classifier.py
# gpus (any number)
python simple_image_classifier.py --trainer.gpus 2
# dataparallel
python simple_image_classifier.py --trainer.gpus 2 --trainer.accelerator 'dp'
MNIST with DALI
The MNIST example above using NVIDIA DALI. Requires NVIDIA DALI to be installed based on your CUDA version, see here.
python dali_image_classifier.py
Image classifier
Generic image classifier with an arbitrary backbone (ie: a simple system)
# cpu
python backbone_image_classifier.py
# gpus (any number)
python backbone_image_classifier.py --trainer.gpus 2
# dataparallel
python backbone_image_classifier.py --trainer.gpus 2 --trainer.accelerator 'dp'
Autoencoder
Showing the power of a system... arbitrarily complex training loops
# cpu
python autoencoder.py
# gpus (any number)
python autoencoder.py --trainer.gpus 2
# dataparallel
python autoencoder.py --trainer.gpus 2 --trainer.accelerator 'dp'
Multi-node example
This demo launches a job using 2 GPUs on 2 different nodes (4 GPUs total). To run this demo do the following:
- Log into the jumphost node of your SLURM-managed cluster.
- Create a conda environment with Lightning and a GPU PyTorch version.
- Choose a script to submit