History

Yi Wang 366fb39d2e Support post-localSGD in Lightning DDP plugin (#8967 ) Co-authored-by: ananthsub <ananth.subramaniam@gmail.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>		2021-08-26 08:24:49 +01:00
..
accelerators	Add support for CPU AMP autocast (#9084 )	2021-08-25 12:18:00 +00:00
base	Replace `yapf` with `black` (#7783 )	2021-07-26 13:37:35 +02:00
callbacks	[2 / 3] improvements to saving and loading callback state (#7187 )	2021-08-24 17:35:19 +00:00
checkpointing	3/n inter batch parallelism (#9052 )	2021-08-24 18:45:54 +00:00
core	Add `ShardedTensor` support in `LightningModule` (#8944 )	2021-08-23 19:59:38 +00:00
deprecated_api	Deprecate `prepare_data_per_node` flag on Trainer and set it as a property for DataHooks (#8958 )	2021-08-23 12:43:45 +00:00
helpers	feat: Add Rich Progress Bar (#8929 )	2021-08-24 02:40:36 +00:00
loggers	sanitize arrays when logging as hyperparameters in TensorBoardLogger (#9031 )	2021-08-24 13:02:06 +02:00
loops	3/n inter batch parallelism (#9052 )	2021-08-24 18:45:54 +00:00
models	Add support for CPU AMP autocast (#9084 )	2021-08-25 12:18:00 +00:00
overrides	Replace `yapf` with `black` (#7783 )	2021-07-26 13:37:35 +02:00
plugins	Support post-localSGD in Lightning DDP plugin (#8967 )	2021-08-26 08:24:49 +01:00
profiler	Fix profiler test on Windows minimal (#8556 )	2021-07-26 13:25:24 +00:00
trainer	Rename test file from log_dir to test_log_dir (#9105 )	2021-08-25 12:48:06 +00:00
tuner	Integrate `total_batch_idx` with progress tracking (#8598 )	2021-08-14 14:08:34 +02:00
utilities	Add validate logic for precision (#9080 )	2021-08-24 20:00:09 +00:00
README.md	CI: add mdformat (#8673 )	2021-08-03 18:19:09 +00:00
__init__.py	Replace `yapf` with `black` (#7783 )	2021-07-26 13:37:35 +02:00
conftest.py	Add `ShardedTensor` support in `LightningModule` (#8944 )	2021-08-23 19:59:38 +00:00
mnode_tests.txt	Mnodes (#5020 )	2021-02-04 20:55:40 +01:00
special_tests.sh	Torch Elastic DDP DeadLock bug fix (#8655 )	2021-08-02 21:48:43 +02:00

README.md

PyTorch-Lightning Tests

Most PL tests train a full MNIST model under various trainer conditions (ddp, ddp2+amp, etc...). This provides testing for most combinations of important settings. The tests expect the model to perform to a reasonable degree of testing accuracy to pass.

Running tests

git clone https://github.com/PyTorchLightning/pytorch-lightning
cd pytorch-lightning

# install dev deps
pip install -r requirements/devel.txt

# run tests
py.test -v

To test models that require GPU make sure to run the above command on a GPU machine. The GPU machine must have at least 2 GPUs to run distributed tests.

Note that this setup will not run tests that require specific packages installed such as Horovod, FairScale, NVIDIA/apex, NVIDIA/DALI, etc. You can rely on our CI to make sure all these tests pass.

Running Coverage

Make sure to run coverage on a GPU machine with at least 2 GPUs and NVIDIA apex installed.

cd pytorch-lightning

# generate coverage (coverage is also installed as part of dev dependencies under requirements/devel.txt)
coverage run --source pytorch_lightning -m py.test pytorch_lightning tests examples -v

# print coverage stats
coverage report -m

# exporting results
coverage xml

Building test image

You can build it on your own, note it takes lots of time, be prepared.

git clone <git-repository>
docker image build -t pytorch_lightning:devel-torch1.9 -f dockers/cuda-extras/Dockerfile --build-arg TORCH_VERSION=1.9 .

To build other versions, select different Dockerfile.

docker image list
docker run --rm -it pytorch_lightning:devel-torch1.9 bash
docker image rm pytorch_lightning:devel-torch1.9