lightning/tests
Sean Naren 8a9ee403be
Add Windows Support for DeepSpeed (#8488)
* Modify deepspeed distributed to support windows

* Add weak test

* Cleanups

* Capture more in tests

* Add comment

* Cleaner asserts
2021-07-20 13:55:52 +00:00
..
accelerators Add support for devices flag to Trainer (#8440) 2021-07-20 04:33:12 +00:00
base
callbacks Fix: handle logical CUDA device IDs for GPUStatsMonitor if `CUDA_VISIBLE_DEVICES` set (#8260) 2021-07-19 11:42:43 +00:00
checkpointing Remove `check_checkpoint_callback` (#7724) 2021-07-19 11:29:00 +00:00
core Add `ModelCheckpoint(save_on_train_epoch_end)` (#8389) 2021-07-13 14:47:59 +00:00
deprecated_api move deprecation test to correct 1.6 test file (#8446) 2021-07-19 10:46:03 +00:00
helpers Use literal syntax instead of function calls to create data structure (#8406) 2021-07-14 10:32:13 +00:00
loggers Remove unnecessary comprehension (#8405) 2021-07-19 08:30:24 +00:00
loops Do not reset Loops total counters (#8475) 2021-07-19 18:22:47 +02:00
metrics Add missing logging tests (#8195) 2021-06-29 22:52:50 +00:00
models Set minimum PyTorch version to 1.6 (#8288) 2021-07-13 17:12:49 +00:00
overrides Remove unnecessary use of comprehension (#8149) 2021-06-27 10:00:02 +01:00
plugins Add Windows Support for DeepSpeed (#8488) 2021-07-20 13:55:52 +00:00
profiler Use `default_root_dir` as the `log_dir` with `LoggerCollection`s (#8187) 2021-07-19 22:12:12 +00:00
trainer Use `default_root_dir` as the `log_dir` with `LoggerCollection`s (#8187) 2021-07-19 22:12:12 +00:00
tuner Remove unnecessary comprehension (#8405) 2021-07-19 08:30:24 +00:00
utilities Remove unnecessary use of comprehension (#8471) 2021-07-19 15:43:28 +02:00
README.md Set minimum PyTorch version to 1.6 (#8288) 2021-07-13 17:12:49 +00:00
__init__.py
conftest.py Support `DDPPlugin` to be used on CPU (#6208) 2021-07-02 12:00:24 +01:00
mnode_tests.txt
special_tests.sh support launching Lightning ddp with traditional command (#7480) 2021-07-14 11:25:36 +00:00

README.md

PyTorch-Lightning Tests

Most PL tests train a full MNIST model under various trainer conditions (ddp, ddp2+amp, etc...). This provides testing for most combinations of important settings. The tests expect the model to perform to a reasonable degree of testing accuracy to pass.

Running tests

git clone https://github.com/PyTorchLightning/pytorch-lightning
cd pytorch-lightning

# install dev deps
pip install -r requirements/devel.txt

# run tests
py.test -v

To test models that require GPU make sure to run the above command on a GPU machine. The GPU machine must have at least 2 GPUs to run distributed tests.

Note that this setup will not run tests that require specific packages installed such as Horovod, FairScale, NVIDIA/apex, NVIDIA/DALI, etc. You can rely on our CI to make sure all these tests pass.

Running Coverage

Make sure to run coverage on a GPU machine with at least 2 GPUs and NVIDIA apex installed.

cd pytorch-lightning

# generate coverage (coverage is also installed as part of dev dependencies under requirements/devel.txt)
coverage run --source pytorch_lightning -m py.test pytorch_lightning tests examples -v

# print coverage stats
coverage report -m

# exporting results
coverage xml

Building test image

You can build it on your own, note it takes lots of time, be prepared.

git clone <git-repository>
docker image build -t pytorch_lightning:devel-torch1.9 -f dockers/cuda-extras/Dockerfile --build-arg TORCH_VERSION=1.9 .

To build other versions, select different Dockerfile.

docker image list
docker run --rm -it pytorch_lightning:devel-torch1.9 bash
docker image rm pytorch_lightning:devel-torch1.9