* add MNIST DALI example, update README.md
* Fix PEP8 warnings
* reformatted using black
* add mnist_dali to test_examples.py
* Add documentation as docstrings
* add nvidia-pyindex and nvidia-dali-cuda100
* replace nvidia-pyindex with --extra-index-url
* mark mnist_dali test as Linux and GPU only
* adjust CUDA docker and examples.txt, fix import error in test_examples.py
* adjust the GPU check
* Exit when DALI is not available
* remove requirements-examples.txt and DALI pip install
* Refactored example, moved to new logging api, added runtime check for test and dali script
* Patch to reflect the mnist example module
* add req.
* Apply suggestions from code review
* Removed requirement as it breaks CPU install, added note in README to install DALI
* add DALI to Drone
* test examples
* Apply suggestions from code review
* imports
* ABC
* cuda
* cuda
* pip DALI
* Move build into init function
Co-authored-by: SeanNaren <sean@grid.ai>
Co-authored-by: Jirka Borovec <jirka@pytorchlightning.ai>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Sean Naren <sean.narenthiran@gmail.com>
* switch to 1.6
* readme
* 1.7
* back to normal [ci skip]
* horovodrun --verbose
* try with apex
* add apex test
* change base
* description
* test with 1.7
* back to 1.6
* no gradient_clip_val
* re-add gradient_clip_val
* no amp
* temp skip torch.cuda.amp + horovod test
* Apply suggestion from code review
Co-authored-by: Sean Naren <sean.narenthiran@gmail.com>
* Fix formatting
* ddp
* Moved extended model outside of function to prevent pickling issue for drone
* typo
* resolve bug
* extract automatic_automization
Co-authored-by: Sean Naren <sean.narenthiran@gmail.com>
Co-authored-by: SeanNaren <sean@grid.ai>
Co-authored-by: chaton <thomas@grid.ai>
* upgrade PT version
* update docker
* docker
* try 1.5
* badge
* fix typo: dor -> for (#3918)
* prune
* prune
* env
* echo
* try
* notes
* env
* env
* env
* notes
* docker
* prune
* maintainer
* CI
* update
* just 1.5
* CI
* CI
* CI
* CI
* CI
* CI
* CI
* CI
* CI
* CI
* CI
* docker
* CI
* CI
* CI
* CI
* CI
* CI
* CI
* CI
* CI
* push
* try
* prune
* CI
* CI
* CI
* CI
Co-authored-by: Klyukin Valeriy <mr.clyukin@gmail.com>
Co-authored-by: Jeff Yang <ydcjeff@outlook.com>
* activated color in all pytest runs
* Update .drone.yml
Co-authored-by: Jeff Yang <ydcjeff@outlook.com>
Co-authored-by: Jeff Yang <ydcjeff@outlook.com>
* Add Github Action to run TPU tests.
* Trigger new Github Actions run.
* Clean up more comments.
* Use different fixed version of ml-testing-accelerators and update config to match.
* use cluster in us-central1-a
* Run 'gcloud logging read' directly without 'echo' to preserve newlines.
* cat coverage.xml on the TPU VM side and upload xml on the Github Action side
* Use new commit on ml-testing-accelerators so command runs fully.
* Preserve newlines in the xml and use if: always() temporarily to upload codecov
* Use pytorch_lightning for coverage instead of pytorch-lightning
* Remove the debug cat of coverage xml
* Apply suggestions from code review
* jsonnet rename
* name
* add codecov flags
* add codecov flags
* codecov
* codecov
* revert codecov
* Clean up after apt-get and remove old TODOs.
* More codefactor cleanups.
* drone
* drone
* disable codecov
* cleaning
* docker py versions
* docker py 3.7
* readme
* bash
* docker
* freeze conda
* py3.6
* Stop using apt-get clean.
* Dont rm pytorch-lightning
* Update docker/tpu/Dockerfile
* Longer timeout in the Github Action to wait for GKE to finish.
* job1
* job2
* job3
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Jirka <jirka@pytorchlightning.ai>
* adding test
* adding test
* added base parity model
* added base parity model
* added parity test
* added parity test
* added parity test
* added parity test
* added parity test
* added parity test
* added parity test
* added parity test
* added parity test
* added parity test
* added parity test
* added parity test
* added parity test
* added parity test
* added parity test
* move parity to benchmark
* formatting
* fixed gradient acc sched
* move parity to benchmark
* formatting
* fixed gradient acc sched
* skip for CPU
* call last
Co-authored-by: J. Borovec <jirka.borovec@seznam.cz>