Commit Graph

8925 Commits

Author SHA1 Message Date
Carlos Mocholí 83b88996cd
Move `_TrainingEpochLoop` (#16801) 2023-02-20 13:33:01 +01:00
Carlos Mocholí 365bf10936
Resolve FitLoop setter TODOs (#16803) 2023-02-20 13:32:36 +01:00
Carlos Mocholí 781768d2b2
Remove `Trainer(multiple_trainloader_mode)` in favor of `CombinedLoader(mode)` (#16800) 2023-02-20 13:32:06 +01:00
Adrian Wälchli 81b7c30291
Make DDP subprocess the default launcher for multi-device (#16780) 2023-02-20 11:20:50 +00:00
Mauricio Villegas 3a0519143a
Fix bug in lightning_cli_advanced_3.rst (#16792) 2023-02-20 12:12:17 +01:00
Sebastian Raschka a4f4b5372a
Add missing docs quote (#16797)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2023-02-20 11:09:55 +01:00
Adrian Wälchli 2844e9e246
Fix XLAEnvironment detection on TPU pod (#16806) 2023-02-20 11:01:06 +01:00
Justus Schock c7962a1619
Add back external colossalai test (#16817) 2023-02-20 09:46:40 +00:00
dependabot[bot] 60004eb468
Bump Lightning-AI/utilities from 0.4.1 to 0.6.0 (#16812)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-02-20 08:17:52 +01:00
Yurij Mikhalevich 6950a07eaa
[App] fix `lightning open` command & better redirects (#16794)
* fix(app): URLs, create run on app run

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix tests

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-02-17 18:24:23 +00:00
Justus Schock 0fee28409b
Introduce new precision layout in PL (#16783) 2023-02-17 17:58:14 +01:00
Carlos Mocholí ec4f592ecf
Sequential `CombinedLoader` to flatten the eval and predict loops (#16726) 2023-02-17 17:37:11 +01:00
Adrian Wälchli ccd2a481d0
Update changelog after 1.9.2 release (#16777)
changelog

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-02-17 08:52:03 -05:00
Ethan Harris 7f92d5c9d4
[App] Refactor plugins to be a standalone `LightningPlugin` (#16765) 2023-02-17 11:01:38 +00:00
Justus Schock ac5fa03385
Introduce new precision layout in fabric (#16767) 2023-02-17 10:41:18 +00:00
Ethan Harris 3a354acc61
[App] Reserve APP_SERVER_PORT in cloud port allocation (#16782)
Co-authored-by: thomas chaton <thomas@grid.ai>
2023-02-17 09:33:17 +00:00
Noha Alon 1a6331f88f
fix warning so the user has a clear next step (#16751) 2023-02-17 09:26:44 +02:00
Adrian Wälchli 91e692c767
Rename the TPUSpawnStrategy to XLAStrategy (#16781)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-02-17 02:06:24 +00:00
Ethan Harris 6e359dcc86
[App] Fix idle timeout e2e (#16786) 2023-02-17 01:52:46 +00:00
Adrian Wälchli 57c1138525
Trigger colossalai integration test in CI (#16789) 2023-02-16 23:45:09 +00:00
Carlos Mocholí d27881e388
Fix `set_epoch` not getting called for prediction dataloaders (#16785) 2023-02-16 23:08:13 +01:00
Carlos Mocholí 746c734e6a
`SequentialMode` and `dataloader_iter` improvements (#16784) 2023-02-16 23:07:46 +01:00
Adrian Wälchli ad698f049b
Update Colossal AI docs and integration (#16778) 2023-02-16 16:14:24 +00:00
Carlos Mocholí cc22ddc716
Remove duplicate no_grad context managers (#16773) 2023-02-16 14:25:56 +00:00
Carlos Mocholí 51d44f57dd
Prefetch if it's not a sized iterable (#16776) 2023-02-16 14:10:16 +00:00
Carlos Mocholí c9452df005
Run XLA's dataloader validation per dataloader (#16775) 2023-02-16 15:09:23 +01:00
Carlos Mocholí 57f2f1c0b4
Fix RunningStage properties for sanity checking (#16774) 2023-02-16 15:08:51 +01:00
Carlos Mocholí f6cad324c0
Use the local batch_idx to update the progress bar (#16760) 2023-02-16 15:08:25 +01:00
Justus Schock 47c69cd8eb
Remove DP (#16748)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-02-16 05:12:08 +00:00
Carlos Mocholí 52c981267e
Make the trainer a required loop argument (#16771) 2023-02-15 21:40:34 -05:00
Lightning Forever 41dd0d1f85
Remove the QuantizationAwareTraining callback (#16750) 2023-02-15 17:29:49 -05:00
Adrian Wälchli 3b7f186a05
Update colossalai version in Dockerfile (#16766)
update docker
2023-02-15 14:20:13 -05:00
Jirka Borovec 0628ec83c4
ci/hotfix: if cache/wheels missing (#16769) 2023-02-15 18:23:23 +00:00
Carlos Mocholí 89ef9e750a
Remove dead code in the loops (#16754) 2023-02-15 17:04:06 +01:00
Carlos Mocholí 318c9c5625
Remove the `Trainer.prediction_writer_callbacks` property (#16759) 2023-02-15 17:03:08 +01:00
Adrian Wälchli 83f4c83582
Replace ColossalAIStrategy with external implementation (#16757) 2023-02-15 15:11:52 +00:00
Adrian Wälchli 565d3bb8c6
CI: Update colossalai version (#16747)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-02-14 14:07:29 -05:00
Carlos Mocholí fbbbbf6ad7
Make dataloader_idx optional for batch start/end hooks (#16753) 2023-02-14 18:31:09 +01:00
Jirka Borovec 2b6c65c863
ci: cleaning caches (#16752) 2023-02-14 18:30:12 +01:00
thomas chaton 104290efa5
[App] Add rm one level below project level (#16740)
Co-authored-by: Ethan Harris <ethanwharris@gmail.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
Co-authored-by: thomas <thomas@thomass-MacBook-Pro.local>
2023-02-14 14:12:51 +00:00
Adrian Wälchli c4074419b5
Remove the BaguaStrategy (#16746)
* remove bagua

* remove

* remove docker file entry
2023-02-14 08:58:58 -05:00
Adrian Wälchli 39020887d2
Remove Trainer's `track_grad_norm` argument (#16745) 2023-02-14 12:38:17 +00:00
thomas chaton 7e8400d277
[App] Add support for private data (#16738)
Co-authored-by: thomas <thomas@thomass-MacBook-Pro.local>
2023-02-14 12:34:44 +00:00
Jirka Borovec 32e71377a8
ci: fix dependabot (#16749) 2023-02-14 12:05:26 +00:00
Carlos Mocholí 53f44e174a
Remove to-device functionality from fetchers (#16731) 2023-02-14 12:06:21 +01:00
Carlos Mocholí 6c037a479f
"sequential" mode for `CombinedLoader` (#16743) 2023-02-14 06:51:48 +01:00
Carlos Mocholí a342410e25
Remove the unused `utilities.parsing.flatten_dict` (#16744) 2023-02-14 06:50:11 +01:00
Carlos Mocholí 513d2e2e12
Group trainer call methods as functions (#16702) 2023-02-14 00:41:51 +00:00
Adrian Wälchli 99cb2cd056
Remove argparse utils (#16708)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-02-13 20:44:30 +00:00
Adrian Wälchli 67c09e3092
Separate the Gradient Accumulation Scheduler from Trainer (#16729)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-02-13 20:15:38 +00:00