174c32a9f1
* [App] Introduce basic auth to Lightning CLI (#16105) * Introduce basic auth to Lightning CLI for app creation * Parsing creds added * Adding auth field to app instance body * Adding tests * Adding changelog entry * Adding more tests * Update runtime.py * Setting auth on update * Fix test * Update lightning-cloud dep * Apply suggestions from code review Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> * Update runtime.py * Fix for release * Update base.txt Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> * docs: fix order of on_fit_start() hook (#16180) Fixes https://github.com/Lightning-AI/lightning/issues/16170 * docs: updated broken links (#16191) Co-authored-by: Shashwat <shashwat> Fixes https://github.com/Lightning-AI/lightning/issues/16186 * Always use the local rank zero imports (#16178) * update BYOC documentation with AWS details (#16044) Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> * Using internal ip + port in a load balancer instead of URL exposed (#16119) Co-authored-by: thomas chaton <thomas@grid.ai> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Remove CUDA_LAUNCH_BLOCKING from Lite tests (#16177) * simplify torch.Tensor (#16190) * Fix inclusion of `model_parallel` document (#16197) fix link to gpu/advanced section * Update numpy requirement from <1.23.1,>=1.17.2 to >=1.17.2,<1.24.1 in /requirements (#16199) Update numpy requirement in /requirements Updates the requirements on [numpy](https://github.com/numpy/numpy) to permit the latest version. - [Release notes](https://github.com/numpy/numpy/releases) - [Changelog](https://github.com/numpy/numpy/blob/main/doc/RELEASE_WALKTHROUGH.rst) - [Commits](https://github.com/numpy/numpy/compare/v1.17.2...v1.24.0) --- updated-dependencies: - dependency-name: numpy dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Update s3fs requirement from <2022.8.3,>=2022.5.0 to >=2022.5.0,<2022.11.1 in /requirements (#16198) Update s3fs requirement in /requirements Updates the requirements on [s3fs](https://github.com/fsspec/s3fs) to permit the latest version. - [Release notes](https://github.com/fsspec/s3fs/releases) - [Changelog](https://github.com/fsspec/s3fs/blob/main/release-procedure.md) - [Commits](https://github.com/fsspec/s3fs/compare/2022.5.0...2022.11.0) --- updated-dependencies: - dependency-name: s3fs dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * ci: upload only with release (#16194) * Update endpoint access examples: added info about accessing auth-protected apps (#16145) * Skip a failing Bagua test for manual optimization (#16225) * Fix type error when dividing chunk size in colossalai strategy (#16212) Co-authored-by: awaelchli <aedu.waelchli@gmail.com> * Fix DDP on XLA (#16020) Co-authored-by: awaelchli <aedu.waelchli@gmail.com> * Added support and test for custom artifact names in WandbLogger (#16173) * Update ipython[all] requirement from <8.6.1 to <8.7.1 in /requirements (#16220) Updates the requirements on [ipython[all]](https://github.com/ipython/ipython) to permit the latest version. - [Release notes](https://github.com/ipython/ipython/releases) - [Commits](https://github.com/ipython/ipython/compare/rel-0.8.4...8.7.0) --- updated-dependencies: - dependency-name: ipython[all] dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Improvements to checkpoint migration (#16233) Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com> * Support arbitrary `Optimizable`s as optimizers (#16189) * Differences with master Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: Dmitry Frolov <dmitsf@gmail.com> Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: David Gilbertson <gilbertson.david@gmail.com> Co-authored-by: Shashwat Agrawal <72117025+ShashwatAgrawal20@users.noreply.github.com> Co-authored-by: Raphael Randschau <nicolai86@users.noreply.github.com> Co-authored-by: thomas chaton <thomas@grid.ai> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: HELSON <c2h214748@gmail.com> Co-authored-by: Liyang90 <liyanglu@google.com> Co-authored-by: Manan Goel <manangoel1999@gmail.com> Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com> |
||
---|---|---|
.. | ||
README.md | ||
_build-packages.yml | ||
ci-dockers-pytorch.yml | ||
ci-examples-app.yml | ||
ci-pkg-install.yml | ||
ci-schema.yml | ||
ci-tests-app.yml | ||
ci-tests-fabric.yml | ||
ci-tests-pytorch.yml | ||
code-checks.yml | ||
docs-checks.yml | ||
docs-deploy.yml | ||
labeler.yml | ||
legacy-checkpoints.yml | ||
probot-auto-cc.yml | ||
probot-check-group.yml | ||
release-docker.yml | ||
release-pypi.yml | ||
schedule-clear-cache.yml | ||
tpu-tests.yml |
README.md
Continuous Integration and Delivery
Unit and Integration Testing
workflow name | workflow file | action | accelerator* |
---|---|---|---|
Test PyTorch full | .github/workflows/ci-tests-pytorch.yml | Run all tests except for accelerator-specific, standalone and slow tests. | CPU |
| pytorch-lightning (IPUs) | .azure-pipelines/ipu-tests.yml | Run only IPU-specific tests. | IPU | | pytorch-lightning (HPUs) | .azure-pipelines/hpu-tests.yml | Run only HPU-specific tests. | HPU | | pytorch-lightning (GPUs) | .azure-pipelines/gpu-tests-pytorch.yml | Run all CPU and GPU-specific tests, standalone, and examples. Each standalone test needs to be run in separate processes to avoid unwanted interactions between test cases. | GPU | | PyTorchLightning.Benchmark | .azure-pipelines/gpu-benchmark.yml | Run speed/memory benchmarks for parity with pure PyTorch. | GPU | | test-on-tpus | .github/workflows/tpu-tests.yml | Run only TPU-specific tests. | TPU |
-
*Accelerators used in CI
- GPU: 2 x NVIDIA Tesla V100
- TPU: Google GKE TPUv3
- IPU: Colossus MK1 IPU
- HPU: Intel Habana Gaudi SYS-420GH-TNGR which has 8 Gaudi accelerators
-
To check which versions of Python or PyTorch are used for testing in our CI, see the corresponding workflow files or checkgroup cofig file at
.github/checkgroup.yml
.
Documentation
workflow file | action |
---|---|
.github/workflows/docs-checks.yml | Run doctest. |
Code Quality
workflow file | action |
---|---|
.codecov.yml | Measure test coverage with codecov.io |
.github/workflows/code-checks.yml | Check Python typing with MyPy. |
.github/workflows/ci-schema.yml | Validate the syntax of workflow files. |
Others
workflow file | action |
---|---|
.github/workflows/cicd-pytorch-dockers.yml | Build docker images used for testing in CI. If run on nightly schedule, push to the Docker Hub. |
.github/workflows/ci-pkg-install.yml | Test if pytorch-lightning is successfully installed using pip. |
.github/workflows/events-recurrent.yml | Terminate TPU jobs that live more than one hour to avoid possible resource exhaustion due to hangs. |
Deployment
workflow file | action |
---|---|
.github/workflows/release-pypi.yml | Publish a release to PyPI. |
.github/workflows/release-docker.yml | Build Docker images from dockers/*/Dockerfile and publish them on hub.docker.com. |
.github/workflows/legacy-checkpoints.yml | App on request generate legacy checkpoints and upload them to AWS S3. |
.github/workflows/events-nightly.yml | Publish the package to TestPyPI. Publish Docker images on hub.docker.com. |
Bots
workflow file | action |
---|---|
.github/mergify.yml | Label PRs as conflicts or ready, and request reviews if needed. |
.github/stale.yml | Close inactive issues/PRs sometimes after adding the "won't fix" label to them. |
.github/workflows/probot-auto-cc.yml, .github/lightning-probot.yml | Notify maintainers of interest depending on labels added to an issue We utilize lightning-probot forked from PyTorch’s probot. |
.github/workflows/probot-check-group.yml, .github/checkgroup.yml | Checks whether the relevant jobs were successfully run based on the changed files in the PR |
.pre-commit-config.yaml | pre-commit.ci runs a set of linters and formatters, such as black, flake8 and isort. When formatting is applied, the bot pushes a commit with its change. This configuration is also used for running pre-commit locally. |