9446390779
* added tpu params test * added tests * removed xla imports * added test cases for TPU * fix pep 8 issues * refactorings and comments * add message to MisconfigurationException Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> * test if device is set correctly * added TPU device check removed mark.spawn * removed device selection * remove xla_device call * readded spawn due to test failures * add TODO for tpu check * Apply suggestions from code review * Apply suggestions from code review * flake8 * added tpu args to cli tests * added support for tpu_core selection via cli * fixed flake formatting * replaced default_save_path with default_root_dir * added check for data type for tpu_cores * fixed flake indent * protected * protected * added tpu params test * added tests * removed xla imports * test if device is set correctly * added support for tpu_core selection via cli * replaced default_save_path with default_root_dir * added check for data type for tpu_cores * chlog * fixed tpu cores error * rebased with latest changes * flake fix * Update pytorch_lightning/trainer/distrib_parts.py added suggesstion Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Jirka <jirka@pytorchlightning.ai> |
||
---|---|---|
.. | ||
base | ||
callbacks | ||
core | ||
loggers | ||
metrics | ||
models | ||
trainer | ||
utilities | ||
Dockerfile | ||
README.md | ||
__init__.py | ||
collect_env_details.py | ||
conftest.py | ||
install_AMP.sh | ||
test_deprecated.py | ||
test_profiler.py |
README.md
PyTorch-Lightning Tests
Most PL tests train a full MNIST model under various trainer conditions (ddp, ddp2+amp, etc...). This provides testing for most combinations of important settings. The tests expect the model to perform to a reasonable degree of testing accuracy to pass.
Running tests
The automatic travis tests ONLY run CPU-based tests. Although these cover most of the use cases, run on a 2-GPU machine to validate the full test-suite.
To run all tests do the following:
Install Open MPI or another MPI implementation. Learn how to install Open MPI on this page.
git clone https://github.com/PyTorchLightning/pytorch-lightning
cd pytorch-lightning
# install AMP support
bash tests/install_AMP.sh
# install dev deps
pip install -r requirements/devel.txt
# run tests
py.test -v
To test models that require GPU make sure to run the above command on a GPU machine. The GPU machine must have:
- At least 2 GPUs.
- NVIDIA-apex installed.
- Horovod with NCCL support:
HOROVOD_GPU_ALLREDUCE=NCCL HOROVOD_GPU_BROADCAST=NCCL pip install horovod
Running Coverage
Make sure to run coverage on a GPU machine with at least 2 GPUs and NVIDIA apex installed.
cd pytorch-lightning
# generate coverage (coverage is also installed as part of dev dependencies under requirements/devel.txt)
coverage run --source pytorch_lightning -m py.test pytorch_lightning tests examples -v
# print coverage stats
coverage report -m
# exporting results
coverage xml
Building test image
You can build it on your own, note it takes lots of time, be prepared.
git clone <git-repository>
docker image build -t pytorch_lightning:devel-pt_1_4 -f tests/Dockerfile --build-arg TORCH_VERSION=1.4 .
To build other versions, select different Dockerfile.
docker image list
docker run --rm -it pytorch_lightning:devel-pt_1_4 bash
docker image rm pytorch_lightning:devel-pt_1_4