lightning/tests
Jirka Borovec 448be60701
update GPU to PT 1.5 (#2779)
* update gpu PT 1.6

* fix docker

* use PT 1.5

* Update tests/install_AMP.sh

Co-authored-by: Nathan Raw <nxr9266@g.rit.edu>

Co-authored-by: Nathan Raw <nxr9266@g.rit.edu>
2020-08-02 08:14:53 -04:00
..
base Call DataModule hooks implicitly in trainer (#2755) 2020-08-01 20:17:57 -04:00
callbacks pytorch 1.6 (#2745) 2020-07-31 11:18:32 +02:00
core Call DataModule hooks implicitly in trainer (#2755) 2020-08-01 20:17:57 -04:00
loggers Add missing methods to logger collection (#2723) 2020-07-29 23:53:02 +02:00
metrics metrics: add SSIM (#2671) 2020-07-23 12:13:52 -04:00
models update CI testing with pip upgrade (#2380) 2020-07-31 14:50:06 -04:00
trainer Fix shuffle for distributed sampler (#2789) 2020-08-01 23:22:57 -04:00
utilities Bugfix/torchtext include lengths (#2689) 2020-07-31 07:53:08 -04:00
README.md test dockers & add AMP in pt-1.6 (#1584) 2020-07-31 08:23:13 -04:00
__init__.py default test logger (#1478) 2020-04-21 20:33:10 -04:00
collect_env_details.py cleaning (#2030) 2020-06-04 11:25:07 -04:00
conftest.py repair CI for Win (#2358) 2020-06-26 21:38:25 -04:00
install_AMP.sh update GPU to PT 1.5 (#2779) 2020-08-02 08:14:53 -04:00
test_deprecated.py remove deprecated in v0.9 (#2760) 2020-07-30 23:19:28 +02:00
test_profiler.py RC & Docs/changelog (#1776) 2020-05-11 21:57:53 -04:00

README.md

PyTorch-Lightning Tests

Most PL tests train a full MNIST model under various trainer conditions (ddp, ddp2+amp, etc...). This provides testing for most combinations of important settings. The tests expect the model to perform to a reasonable degree of testing accuracy to pass.

Running tests

The automatic travis tests ONLY run CPU-based tests. Although these cover most of the use cases, run on a 2-GPU machine to validate the full test-suite.

To run all tests do the following:

Install Open MPI or another MPI implementation. Learn how to install Open MPI on this page.

git clone https://github.com/PyTorchLightning/pytorch-lightning
cd pytorch-lightning

# install AMP support
bash tests/install_AMP.sh

# install dev deps
pip install -r requirements/devel.txt

# run tests
py.test -v

To test models that require GPU make sure to run the above command on a GPU machine. The GPU machine must have:

  1. At least 2 GPUs.
  2. NVIDIA-apex installed.
  3. Horovod with NCCL support: HOROVOD_GPU_ALLREDUCE=NCCL HOROVOD_GPU_BROADCAST=NCCL pip install horovod

Running Coverage

Make sure to run coverage on a GPU machine with at least 2 GPUs and NVIDIA apex installed.

cd pytorch-lightning

# generate coverage (coverage is also installed as part of dev dependencies under requirements/devel.txt)
coverage run --source pytorch_lightning -m py.test pytorch_lightning tests examples -v

# print coverage stats
coverage report -m

# exporting results
coverage xml

Building test image

You can build it on your own, note it takes lots of time, be prepared.

git clone <git-repository>
docker image build -t pytorch_lightning:devel-torch1.4 -f dockers/cuda-extras/Dockerfile --build-arg TORCH_VERSION=1.4 .

To build other versions, select different Dockerfile.

docker image list
docker run --rm -it pytorch_lightning:devel-torch1.4 bash
docker image rm pytorch_lightning:devel-torch1.4