lightning/tests
Yi Wang 366fb39d2e
Support post-localSGD in Lightning DDP plugin (#8967)
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-08-26 08:24:49 +01:00
..
accelerators Add support for CPU AMP autocast (#9084) 2021-08-25 12:18:00 +00:00
base Replace `yapf` with `black` (#7783) 2021-07-26 13:37:35 +02:00
callbacks [2 / 3] improvements to saving and loading callback state (#7187) 2021-08-24 17:35:19 +00:00
checkpointing 3/n inter batch parallelism (#9052) 2021-08-24 18:45:54 +00:00
core Add `ShardedTensor` support in `LightningModule` (#8944) 2021-08-23 19:59:38 +00:00
deprecated_api Deprecate `prepare_data_per_node` flag on Trainer and set it as a property for DataHooks (#8958) 2021-08-23 12:43:45 +00:00
helpers feat: Add Rich Progress Bar (#8929) 2021-08-24 02:40:36 +00:00
loggers sanitize arrays when logging as hyperparameters in TensorBoardLogger (#9031) 2021-08-24 13:02:06 +02:00
loops 3/n inter batch parallelism (#9052) 2021-08-24 18:45:54 +00:00
models Add support for CPU AMP autocast (#9084) 2021-08-25 12:18:00 +00:00
overrides Replace `yapf` with `black` (#7783) 2021-07-26 13:37:35 +02:00
plugins Support post-localSGD in Lightning DDP plugin (#8967) 2021-08-26 08:24:49 +01:00
profiler Fix profiler test on Windows minimal (#8556) 2021-07-26 13:25:24 +00:00
trainer Rename test file from log_dir to test_log_dir (#9105) 2021-08-25 12:48:06 +00:00
tuner Integrate `total_batch_idx` with progress tracking (#8598) 2021-08-14 14:08:34 +02:00
utilities Add validate logic for precision (#9080) 2021-08-24 20:00:09 +00:00
README.md CI: add mdformat (#8673) 2021-08-03 18:19:09 +00:00
__init__.py Replace `yapf` with `black` (#7783) 2021-07-26 13:37:35 +02:00
conftest.py Add `ShardedTensor` support in `LightningModule` (#8944) 2021-08-23 19:59:38 +00:00
mnode_tests.txt Mnodes (#5020) 2021-02-04 20:55:40 +01:00
special_tests.sh Torch Elastic DDP DeadLock bug fix (#8655) 2021-08-02 21:48:43 +02:00

README.md

PyTorch-Lightning Tests

Most PL tests train a full MNIST model under various trainer conditions (ddp, ddp2+amp, etc...). This provides testing for most combinations of important settings. The tests expect the model to perform to a reasonable degree of testing accuracy to pass.

Running tests

git clone https://github.com/PyTorchLightning/pytorch-lightning
cd pytorch-lightning

# install dev deps
pip install -r requirements/devel.txt

# run tests
py.test -v

To test models that require GPU make sure to run the above command on a GPU machine. The GPU machine must have at least 2 GPUs to run distributed tests.

Note that this setup will not run tests that require specific packages installed such as Horovod, FairScale, NVIDIA/apex, NVIDIA/DALI, etc. You can rely on our CI to make sure all these tests pass.

Running Coverage

Make sure to run coverage on a GPU machine with at least 2 GPUs and NVIDIA apex installed.

cd pytorch-lightning

# generate coverage (coverage is also installed as part of dev dependencies under requirements/devel.txt)
coverage run --source pytorch_lightning -m py.test pytorch_lightning tests examples -v

# print coverage stats
coverage report -m

# exporting results
coverage xml

Building test image

You can build it on your own, note it takes lots of time, be prepared.

git clone <git-repository>
docker image build -t pytorch_lightning:devel-torch1.9 -f dockers/cuda-extras/Dockerfile --build-arg TORCH_VERSION=1.9 .

To build other versions, select different Dockerfile.

docker image list
docker run --rm -it pytorch_lightning:devel-torch1.9 bash
docker image rm pytorch_lightning:devel-torch1.9