History

Sean Naren 8a9ee403be Add Windows Support for DeepSpeed (#8488 ) * Modify deepspeed distributed to support windows * Add weak test * Cleanups * Capture more in tests * Add comment * Cleaner asserts		2021-07-20 13:55:52 +00:00
..
accelerators	Add support for devices flag to Trainer (#8440 )	2021-07-20 04:33:12 +00:00
base	…
callbacks	Fix: handle logical CUDA device IDs for GPUStatsMonitor if `CUDA_VISIBLE_DEVICES` set (#8260 )	2021-07-19 11:42:43 +00:00
checkpointing	Remove `check_checkpoint_callback` (#7724 )	2021-07-19 11:29:00 +00:00
core	Add `ModelCheckpoint(save_on_train_epoch_end)` (#8389 )	2021-07-13 14:47:59 +00:00
deprecated_api	move deprecation test to correct 1.6 test file (#8446 )	2021-07-19 10:46:03 +00:00
helpers	Use literal syntax instead of function calls to create data structure (#8406 )	2021-07-14 10:32:13 +00:00
loggers	Remove unnecessary comprehension (#8405 )	2021-07-19 08:30:24 +00:00
loops	Do not reset Loops total counters (#8475 )	2021-07-19 18:22:47 +02:00
metrics	Add missing logging tests (#8195 )	2021-06-29 22:52:50 +00:00
models	Set minimum PyTorch version to 1.6 (#8288 )	2021-07-13 17:12:49 +00:00
overrides	Remove unnecessary use of comprehension (#8149 )	2021-06-27 10:00:02 +01:00
plugins	Add Windows Support for DeepSpeed (#8488 )	2021-07-20 13:55:52 +00:00
profiler	Use `default_root_dir` as the `log_dir` with `LoggerCollection`s (#8187 )	2021-07-19 22:12:12 +00:00
trainer	Use `default_root_dir` as the `log_dir` with `LoggerCollection`s (#8187 )	2021-07-19 22:12:12 +00:00
tuner	Remove unnecessary comprehension (#8405 )	2021-07-19 08:30:24 +00:00
utilities	Remove unnecessary use of comprehension (#8471 )	2021-07-19 15:43:28 +02:00
README.md	Set minimum PyTorch version to 1.6 (#8288 )	2021-07-13 17:12:49 +00:00
__init__.py	…
conftest.py	Support `DDPPlugin` to be used on CPU (#6208 )	2021-07-02 12:00:24 +01:00
mnode_tests.txt	…
special_tests.sh	support launching Lightning ddp with traditional command (#7480 )	2021-07-14 11:25:36 +00:00

README.md

PyTorch-Lightning Tests

Most PL tests train a full MNIST model under various trainer conditions (ddp, ddp2+amp, etc...). This provides testing for most combinations of important settings. The tests expect the model to perform to a reasonable degree of testing accuracy to pass.

Running tests

git clone https://github.com/PyTorchLightning/pytorch-lightning
cd pytorch-lightning

# install dev deps
pip install -r requirements/devel.txt

# run tests
py.test -v

To test models that require GPU make sure to run the above command on a GPU machine. The GPU machine must have at least 2 GPUs to run distributed tests.

Note that this setup will not run tests that require specific packages installed such as Horovod, FairScale, NVIDIA/apex, NVIDIA/DALI, etc. You can rely on our CI to make sure all these tests pass.

Running Coverage

Make sure to run coverage on a GPU machine with at least 2 GPUs and NVIDIA apex installed.

cd pytorch-lightning

# generate coverage (coverage is also installed as part of dev dependencies under requirements/devel.txt)
coverage run --source pytorch_lightning -m py.test pytorch_lightning tests examples -v

# print coverage stats
coverage report -m

# exporting results
coverage xml

Building test image

You can build it on your own, note it takes lots of time, be prepared.

git clone <git-repository>
docker image build -t pytorch_lightning:devel-torch1.9 -f dockers/cuda-extras/Dockerfile --build-arg TORCH_VERSION=1.9 .

To build other versions, select different Dockerfile.

docker image list
docker run --rm -it pytorch_lightning:devel-torch1.9 bash
docker image rm pytorch_lightning:devel-torch1.9