Commit Graph

7542 Commits

Author SHA1 Message Date
Dmitry Frolov 693116f3b5
[CLI] Fix status message on cluster creation (#14477)
* Fix message on BYOC cluster creation

Co-authored-by: thomas chaton <thomas@grid.ai>
2022-09-04 14:03:38 -04:00
Carlos Mocholí ce0bde702a
Remove deprecated `test_tube` dependency (#14513) 2022-09-04 11:29:56 +00:00
Carlos Mocholí fed1d8dc6a
Pin protobuf (#14512) 2022-09-03 14:38:30 +00:00
Laverne Henderson b19b15546b
Updated basic debugging (#14488)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-09-02 14:23:29 +05:30
Adrian Wälchli 291dc1b615
Standalone Lite CI setup (#14451)
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-09-01 22:13:12 +00:00
Carlos Mocholí e0c2c3e677
Clean up fairscale imports (#14476) 2022-09-01 18:08:40 +02:00
Benjamin Krala cce55b6cd3
Precise description of reload_dataloaders_every_n_epochs (#14245) 2022-09-01 10:06:28 -04:00
Adrian Wälchli 28e18881a9
Mark stage argument in hooks as required (#14064)
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
2022-09-01 15:47:40 +02:00
Adam Bobowski 764b348249
[App] Handling s3 rate limiting in framework (#14411)
bump of fsspec and s3fs to version supporting retry on "SlowDown" response
2022-09-01 11:12:04 +00:00
Rohit Gupta 4a81b9b99e
Update changelog after v1.7.4 release (#14479) 2022-09-01 11:08:50 +00:00
Mansy 1bcb5c301d
[App][CLI] Fix lightning cli --version (#14433)
* [App][CLI] Fix lightning cli --version
2022-09-01 07:58:09 +00:00
Sherin Thomas c1a7254b7a
Dependency pinning (#14463)
* deps pinned
* Apply suggestions from code review

Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-08-31 21:24:28 +00:00
Rohit Gupta e90ac769d6
Reset dataloaders on failure in tuner (#14372) 2022-08-31 21:00:18 +00:00
thomas chaton a7c01c4754
Introduce lightning connect (#14452)
* update

* update

* update

* update

* Review of content

* Formatting updates

* Fomatting updates

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Updates based on new commits

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update

* update

* update

* update

* Introduce lightning connect (#14183)


Co-authored-by: Luca Antiga <luca.antiga@gmail.com>

Co-authored-by: Felonious-Spellfire <felonious.spellfire@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Luca Antiga <luca.antiga@gmail.com>
2022-08-31 15:59:03 -04:00
Jirka Borovec 314d951806
CI: Azure clear workspace (#14460) 2022-08-31 21:11:06 +02:00
Carlos Mocholí 10adcd5d12
Remove the unused pyDeprecate dependency (#14472)
* Remove the unused pyDeprecate dependency
* CHANGELOG
2022-09-01 03:30:18 +09:00
Carlos Mocholí 2e3d85af84
Remove deprecated rank zero utilities (#14471)
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-08-31 18:29:11 +00:00
Anner 626827c872
update rng state save/load test to also run on cuda gpu (#14396)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-08-31 16:36:35 +00:00
Akihiro Nitta f2cacf4b10
CI: Reuse check schema (#14469)
* rm _check-shema.yml
* Reuse devtools' check schema
2022-08-31 17:37:49 +02:00
Carlos Mocholí a1dd718781
Remove deprecated support for passing the warning category positionally (#14470) 2022-08-31 17:34:56 +02:00
Sherin Thomas ebc650bd95
E2E fix for custom base image (#14468)
* new custom base image

* image tag
2022-08-31 09:27:21 -04:00
Carlos Mocholí 00aefa82b7
Cleanup TPU CI script error management (#14389) 2022-08-31 11:38:54 +00:00
Dmitry Frolov 9372981263
[CLI] Cluster logs CLI improvements: new log labels + test coverage increasing (#14459)
* Cluster logs improvements

* Unit tests added

* Labels for processing deletion errors
2022-08-30 22:22:01 -04:00
Jirka Borovec 74538f797c
CI: skip examples with draft (#14453) 2022-08-30 23:20:06 +00:00
Adam Bobowski e8121610a4
[App] fix panel requirements (#14450)
* update base requirements

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* try main

* Apply suggestions from code review

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* extract into separate function

* drop

* up

* up

* optional

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* .

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-08-30 16:13:43 -04:00
Jirka Borovec 74304db6f8
CI: update TPU docker (#14448) 2022-08-31 00:47:38 +05:30
Jirka Borovec 6188593abd
ci: drop group probot (#14456) 2022-08-30 13:38:51 -04:00
Jirka Borovec 36aefdab90
CI: set probot timeout (#14455) 2022-08-30 13:03:32 -04:00
William Falcon 255b46941e
Update README.md 2022-08-30 12:39:15 -04:00
Jirka Borovec 208bf6faa8
prepare space for fused docs (#14160)
* copy app conf

* ci + req.

* script symlink

* wip

* keep only App

* add also PL

* lightning

* artifact
2022-08-30 09:25:05 -04:00
Jirka Borovec 7987a1b453
update notebooks (#14340) 2022-08-30 12:12:48 +00:00
Jirka Borovec ce2c193662
CI: add e2e cron job (#14402)
* add e2e cron job
* trigger
* workflow_dispatch

Co-authored-by: Mansy <ahmed.mansy156@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2022-08-30 12:33:58 +02:00
Carlos Mocholí 291267c3bf
Unify rank zero messaging utilities (#14116) 2022-08-30 09:51:30 +00:00
Marc Skov Madsen 18e2a8eecd
PanelFrontend and Panel Web UI Intermediate docs (#13531)
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Marc Skov Madsen <masma@orsted.com>
Co-authored-by: Laverne Henderson <laverne.henderson@coupa.com>
Co-authored-by: Felonious-Spellfire <felonious.spellfire@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Mansy <ahmed.mansy156@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-08-30 00:33:21 +00:00
Jirka Borovec 2374465b01
Azure: local id for e2e (#14432) 2022-08-29 20:48:40 +00:00
ananthsub d0d1818d50
Update `has_len_all_ranks` to use `Strategy.root_device` (#12144)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-08-29 20:23:34 +00:00
Jirka Borovec 80b1987ad9
Update GHA job names (#14400)
* update CJ job names
* groups
* filter
* Apply suggestions from code review

Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-08-29 20:03:37 +02:00
Mansy 66ca093332
Fix cloud e2e, artifacts and cleanup (#14392) 2022-08-29 15:16:22 +02:00
Carlos Mocholí f202e84f4b
Remove the legacy `get_deprecated_arg_names` (#14415) 2022-08-29 14:53:57 +02:00
dependabot[bot] 2b7cd58e06
Bump tj-actions/changed-files from 28 to 29.0.1 (#14430)
Bumps [tj-actions/changed-files](https://github.com/tj-actions/changed-files) from 28 to 29.0.1.
- [Release notes](https://github.com/tj-actions/changed-files/releases)
- [Changelog](https://github.com/tj-actions/changed-files/blob/main/HISTORY.md)
- [Commits](https://github.com/tj-actions/changed-files/compare/v28...v29.0.1)

---
updated-dependencies:
- dependency-name: tj-actions/changed-files
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-29 17:08:37 +09:00
Krishna Kalyan 1a3fe39571
Removed deprecated `Trainer.num_processes` property in favour of `Trainer.num_devices` (#14423)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-28 23:59:24 +02:00
Krishna Kalyan 5cbe1f48d2
Removed the deprecated `Trainer.data_parallel_device_ids` function in favour of `Trainer.device_ids` (#14422)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-28 18:07:00 +00:00
Krishna Kalyan cea9a72d9d
Removed the deprecated the `trainer.lr_schedulers` (#14408)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-28 18:06:09 +00:00
otaj 1e04951206
Remove deprecated `TrainerCallbackHookMixin` (#14401)
* remove deprecated callback hook

* changelog
2022-08-28 10:56:37 +00:00
JongMok Lee 03f2f32445
Fix mypy errors in `pytorch_lightning/strategies/sharded.py` (#14184)
Co-authored-by: otaj <ota@lightning.ai>
2022-08-27 22:07:36 +00:00
Rohit Gupta af688dee69
Update changelog after v1.7.3 release (#14398) 2022-08-27 19:44:54 +00:00
Rohit Gupta f3574176e2
Change `trainer.should_stop` to not stop in between an epoch and run until `min_steps/min_epochs` only (#13890) 2022-08-27 12:12:24 +00:00
Mansy 0e30e4a5a0
[App][CI] Fix psutil requirement CI (#14413) 2022-08-26 23:55:22 +00:00
Adrian Wälchli 250c06e406
Remove deprecated HPC model hooks (#14315)
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-08-26 20:59:32 +00:00
dependabot[bot] 714137d4d6
Bump ravsamhq/notify-slack-action from 1 to 2 (#14290)
Bumps [ravsamhq/notify-slack-action](https://github.com/ravsamhq/notify-slack-action) from 1 to 2.
- [Release notes](https://github.com/ravsamhq/notify-slack-action/releases)
- [Commits](https://github.com/ravsamhq/notify-slack-action/compare/v1...v2)

---
updated-dependencies:
- dependency-name: ravsamhq/notify-slack-action
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-26 20:41:36 +00:00