History

Adrian Wälchli bb7d188318 Fix ModelCheckpoint race condition in file existence check (#5155 ) Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com> Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Nicki Skafte <skaftenicki@gmail.com>		2021-02-05 21:40:39 +01:00
..
accelerators	flake8 + yapf	2021-02-04 20:55:58 +01:00
base	[feat] Add PyTorch Profiler. (#5560 )	2021-01-26 06:48:54 -05:00
callbacks	[BugOnFeat] Resolve bug with Finetuning (#5744 )	2021-02-04 18:36:54 +00:00
checkpointing	Fix ModelCheckpoint race condition in file existence check (#5155 )	2021-02-05 21:40:39 +01:00
core	update tests with new auto_opt api (#5466 )	2021-02-03 19:39:28 +01:00
deprecated_api	Refactor LightningDataParallel (#5670 )	2021-01-31 06:08:16 -05:00
loggers	Ignore `step` param in Neptune logger's log_metric method (#5510 )	2021-02-04 20:55:41 +01:00
metrics	Fix `num_classes` arg in F1 metric (#5663 )	2021-02-05 21:40:30 +01:00
models	Fix ModelCheckpoint race condition in file existence check (#5155 )	2021-02-05 21:40:39 +01:00
overrides	Refactor LightningDataParallel (#5670 )	2021-01-31 06:08:16 -05:00
plugins	update tests with new auto_opt api (#5466 )	2021-02-03 19:39:28 +01:00
trainer	move progress bar test to correct test folder (#5667 )	2021-02-05 21:40:29 +01:00
tuner	refactor - check F841 (#5202 )	2020-12-21 11:10:55 +05:30
utilities	Unify attribute finding logic, fix not using dataloader when hparams present (#4559 )	2021-02-04 20:55:41 +01:00
README.md	Fix pre-commit trailing-whitespace and end-of-file-fixer hooks. (#5387 )	2021-01-26 14:27:56 +01:00
__init__.py	tests for legacy checkpoints (#5223 )	2021-01-26 14:27:56 +01:00
collect_env_details.py	add copyright to tests (#5143 )	2021-01-05 09:57:37 +01:00
conftest.py	update isort config (#5335 )	2021-01-06 12:49:23 +01:00
mnode_tests.txt	Mnodes (#5020 )	2021-02-04 20:55:40 +01:00
special_tests.sh	[Feat-BugFix] Resolve custom DataLoader (#5745 )	2021-02-05 09:03:18 +00:00
test_profiler.py	update isort config (#5335 )	2021-01-06 12:49:23 +01:00

README.md

PyTorch-Lightning Tests

Most PL tests train a full MNIST model under various trainer conditions (ddp, ddp2+amp, etc...). This provides testing for most combinations of important settings. The tests expect the model to perform to a reasonable degree of testing accuracy to pass.

Running tests

The automatic travis tests ONLY run CPU-based tests. Although these cover most of the use cases, run on a 2-GPU machine to validate the full test-suite.

To run all tests do the following:

Install Open MPI or another MPI implementation. Learn how to install Open MPI on this page.

git clone https://github.com/PyTorchLightning/pytorch-lightning
cd pytorch-lightning

# install AMP support
bash requirements/install_AMP.sh

# install dev deps
pip install -r requirements/devel.txt

# run tests
py.test -v

To test models that require GPU make sure to run the above command on a GPU machine. The GPU machine must have:

At least 2 GPUs.
NVIDIA-apex installed.
Horovod with NCCL support: HOROVOD_GPU_OPERATIONS=NCCL pip install horovod

Running Coverage

Make sure to run coverage on a GPU machine with at least 2 GPUs and NVIDIA apex installed.

cd pytorch-lightning

# generate coverage (coverage is also installed as part of dev dependencies under requirements/devel.txt)
coverage run --source pytorch_lightning -m py.test pytorch_lightning tests examples -v

# print coverage stats
coverage report -m

# exporting results
coverage xml

Building test image

You can build it on your own, note it takes lots of time, be prepared.

git clone <git-repository>
docker image build -t pytorch_lightning:devel-torch1.4 -f dockers/cuda-extras/Dockerfile --build-arg TORCH_VERSION=1.4 .

To build other versions, select different Dockerfile.

docker image list
docker run --rm -it pytorch_lightning:devel-torch1.4 bash
docker image rm pytorch_lightning:devel-torch1.4