lightning/tests
Maxim Ostroukhov c208ac68c8
Added experiment_id to NeptuneLogger (#3462)
* 1) Added experiment_id to NeptuneLogger initialization input arguments.
2) Now function _create_or_get_experiment() overrides "experiment_name", "params", "properties", "tags".

* Added test case for existing experiment.

* Revert "Added test case for existing experiment."

This reverts commit 9f3ba2e37b.

* Added test case for existing experiment.

* Fix merging issue.

* Moved experiment_id assignment directly to the part with experiment initialization.

* Update pytorch_lightning/loggers/neptune.py

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2020-11-16 23:50:23 +05:30
..
backends ref: unify slurm and TE under backendPlugin 3/n (#4581) 2020-11-08 15:32:37 -05:00
base [feat] Logging refactor 2/n - train (#4495) 2020-11-05 22:27:04 +00:00
callbacks isolate PL debugger in tests (#4643) 2020-11-14 11:22:56 +00:00
checkpointing isolate PL debugger in tests (#4643) 2020-11-14 11:22:56 +00:00
core Add dirpath and filename parameter in ModelCheckpoint (#4213) 2020-10-23 09:59:12 +05:30
loggers Added experiment_id to NeptuneLogger (#3462) 2020-11-16 23:50:23 +05:30
metrics [metrics] change default behaviour of state dict (#4685) 2020-11-16 12:33:45 +00:00
models allow decorate model init with saving hparams (#4662) 2020-11-16 11:02:26 +01:00
plugins Switch to PyTorch 1.6 in Drone CI (#4393) 2020-11-03 18:01:51 +00:00
trainer [HOTFIX] Logging for evaluation (#4684) 2020-11-15 10:41:33 -05:00
tuner fix: `nb` is set total number of devices, when nb is -1. (#4209) 2020-10-29 10:50:37 +01:00
utilities Find parameters which are specified in the LightningDataModule, only (#4347) 2020-11-10 14:01:20 +01:00
README.md Horovod: fixed early stopping and added metrics aggregation (#3775) 2020-11-05 12:52:02 -05:00
__init__.py changelogs clean (#3082) 2020-08-20 22:58:53 +00:00
collect_env_details.py fix tensorboard version (#3132) 2020-09-15 23:48:48 +02:00
conftest.py repair CI for Win (#2358) 2020-06-26 21:38:25 -04:00
test_deprecated.py update docs on checkpoint_callback Trainer argument (#4461) 2020-11-02 06:18:20 +01:00
test_profiler.py RC & Docs/changelog (#1776) 2020-05-11 21:57:53 -04:00

README.md

PyTorch-Lightning Tests

Most PL tests train a full MNIST model under various trainer conditions (ddp, ddp2+amp, etc...). This provides testing for most combinations of important settings. The tests expect the model to perform to a reasonable degree of testing accuracy to pass.

Running tests

The automatic travis tests ONLY run CPU-based tests. Although these cover most of the use cases, run on a 2-GPU machine to validate the full test-suite.

To run all tests do the following:

Install Open MPI or another MPI implementation. Learn how to install Open MPI on this page.

git clone https://github.com/PyTorchLightning/pytorch-lightning
cd pytorch-lightning

# install AMP support
bash requirements/install_AMP.sh

# install dev deps
pip install -r requirements/devel.txt

# run tests
py.test -v

To test models that require GPU make sure to run the above command on a GPU machine. The GPU machine must have:

  1. At least 2 GPUs.
  2. NVIDIA-apex installed.
  3. Horovod with NCCL support: HOROVOD_GPU_OPERATIONS=NCCL pip install horovod

Running Coverage

Make sure to run coverage on a GPU machine with at least 2 GPUs and NVIDIA apex installed.

cd pytorch-lightning

# generate coverage (coverage is also installed as part of dev dependencies under requirements/devel.txt)
coverage run --source pytorch_lightning -m py.test pytorch_lightning tests examples -v

# print coverage stats
coverage report -m

# exporting results
coverage xml

Building test image

You can build it on your own, note it takes lots of time, be prepared.

git clone <git-repository>
docker image build -t pytorch_lightning:devel-torch1.4 -f dockers/cuda-extras/Dockerfile --build-arg TORCH_VERSION=1.4 .

To build other versions, select different Dockerfile.

docker image list
docker run --rm -it pytorch_lightning:devel-torch1.4 bash
docker image rm pytorch_lightning:devel-torch1.4