Commit Graph

9430 Commits

Author SHA1 Message Date
Mauricio Villegas e38c71b828
Fix `LightningCLI` not saving correctly seed_everything for `run=True` (#18056) 2023-07-12 11:53:39 +02:00
Jirka Borovec b16c35d673
drop AWS action (#18050) 2023-07-11 15:45:53 +02:00
Nicki Skafte Detlefsen 4fc6b560a7
Fix compatability with pydantic 2.0+ (#18030)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2023-07-10 23:51:39 +02:00
Adrian Wälchli a97c559d92
Make model test more robust (#18043)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-10 20:36:18 +00:00
Adrian Wälchli 69d7cfe5d8
Enable `self.device` access in setup hook (#18021) 2023-07-10 16:49:47 +02:00
Carlos Mocholí ad74f8623f
Don't reapply activation checkpointing (#18006) 2023-07-10 13:24:09 +00:00
Justus Schock 7ca49f2cb7
Requirements update (#18014)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-10 13:00:20 +00:00
Jirka Borovec 1a8baf61de
drop environment.yml (#18040) 2023-07-10 13:52:14 +02:00
Adrian Wälchli 6d888b5ce0
Fix param_group -> param_groups typo (#18020) 2023-07-09 19:13:33 +00:00
Adrian Wälchli acc70d0ae5
Support all half-precision modes in FSDP precision plugin (#17807)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-07-09 18:40:46 +00:00
Giorgio Strano f95275005a
Add option to change "=" symbol in ModelCheckpoint filenames (#17999)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-07-09 18:37:52 +00:00
Kilian Lieret 9780dfddc0
Fix doc for creating custom progress bar (#18024) 2023-07-09 18:35:06 +00:00
Jirka Borovec 913fa99f1b
pin `pydantic <2.0` (#18022)
pin pydantic <2.0
2023-07-08 16:04:38 +02:00
Carlos Mocholí 9a2bb85d82
Drop `torchdistx` support (#17995) 2023-07-08 02:15:05 +00:00
Leng Yue 734a3253cd
Support PyTorch Lightning's FSDP optimizer states saving and loading (#17819)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-07-07 17:55:59 +00:00
Jirka Borovec 1b43aacadd
Update name in pyproject.toml (#18010) 2023-07-07 17:08:59 +00:00
Justus Schock 821ea004b0
Requirements update (#17998)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-07 17:36:44 +02:00
Carlos Mocholí 9c775c99fa
Add `trainer.print` (#17980)
Co-authored-by: Luca Antiga <luca.antiga@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-07-07 13:31:44 +00:00
Adrian Wälchli ed86a02896
Fix broken links in README (#18017)
* fix readme links

* fix

* x
2023-07-07 14:59:42 +02:00
Ethan Harris 2c3dfc0fb7
[App] Content for plugins (#17243)
Co-authored-by: Yurij Mikhalevich <yurij@grid.ai>
Co-authored-by: Luca Antiga <luca.antiga@gmail.com>
2023-07-07 11:05:58 +01:00
Minh-Long Luu (刘明龙) c8656f1a27
docs: `fabric_methods.rst` using deprecated parameters (#18015) 2023-07-07 11:32:09 +02:00
Luca Antiga bb4751729e
Add exponential backoff to HTTPQueue put (#18013) 2023-07-07 11:11:09 +02:00
Jirka Borovec 2d5964d7e1
update lightning short desc (#18011) 2023-07-07 08:52:43 +01:00
Adrian Wälchli f4240ca42c
Avoid info message when loading 0 entry point callbacks (#17990) 2023-07-06 12:46:24 +00:00
Adrian Wälchli b14ddd9c49
Fix state dict loading for ddp/dp in Fabric (#17997)
* fix state dict loading for ddp/dp

* test

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* changelog

* update test

* move params to same device before equality test

* test strategy

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-06 13:47:17 +02:00
Adam J. Stewart 017262e5e0
Better type hints for `LightningModule` steps (#17842)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-05 07:14:43 +00:00
Adrian Wälchli 3f4790bd27
Validate selected device indices in `DeepSpeedStrategy` (#17952) 2023-07-04 18:58:38 +00:00
Carlos Mocholí 4532ef7b6e
Update `trainer.fit` docstring (#17981) 2023-07-04 15:53:15 +00:00
Jirka Borovec e49a5b6662
tests: flaky `test_servable_module_validator` (#17985) 2023-07-04 10:04:16 -04:00
Adrian Wälchli 036bd74e47
Disable video controls in Fabric teaser animations (#17984) 2023-07-04 13:27:54 +02:00
Adrian Wälchli c5fae6426e
Show CUDA matmul precision info only ever once (#17960) 2023-07-04 03:47:27 -04:00
github-actions[bot] aafa4db836
Adding test for legacy checkpoint created with 2.0.4 (#17901)
Co-authored-by: Borda <Borda@users.noreply.github.com>
2023-07-03 22:34:00 +00:00
dependabot[bot] 014b8a7bd0
Update fastapi requirement from <0.98.0 to <0.100.0 in /requirements (#17970)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2023-07-03 21:59:15 +00:00
dependabot[bot] efc9852354
Update numpy requirement from <1.24.4,>=1.17.2 to >=1.17.2,<1.25.1 in /requirements (#17860)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-03 21:51:40 +00:00
Jirka Borovec d249248d1e
ci: mark job as canceled if not TPU allocation fails (#17978)
* ci: mark job as canceled if not TPU allocation fails

* try

* env

* env

* env

* env

* env

* env

* 0

* ${{ github.run_id }}

* token

* cleaning

* if
2023-07-03 22:39:12 +02:00
dependabot[bot] ca3df1ba30
Update redis requirement from <=4.5.5,>=4.0.1 to >=4.0.1,<=4.6.0 in /requirements (#17964)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2023-07-03 18:45:44 +00:00
Jirka Borovec 00a0f2e8f4
docs: fetch external sources (#17941)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-03 20:16:45 +02:00
Jirka Borovec 37c244f94b
bump Lit-Utils to 0.9 (#17955) 2023-07-03 17:49:00 +00:00
dependabot[bot] 3892500fb3
Update jsonargparse[signatures] requirement from <4.22.0,>=4.18.0 to >=4.18.0,<4.23.0 in /requirements (#17968)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-03 16:49:45 +00:00
dependabot[bot] 2e70b75f63
Update croniter requirement from <1.4.0,>=1.3.0 to >=1.3.0,<1.5.0 in /requirements (#17923)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2023-07-03 16:30:57 +00:00
Adrian Wälchli 199dc8fe35
Change default init_module behavior for FSDP (#17905) 2023-07-03 17:39:58 +02:00
Jirka Borovec 4265c11e8c
docker: CUDA with runtime (#17977) 2023-07-03 17:39:09 +02:00
dependabot[bot] 8de56eaac7
Update websockets requirement from <=10.4 to <=11.0.3 in /requirements (#17965)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2023-07-03 15:13:07 +00:00
dependabot[bot] c578342ad0
Update panel requirement from <=1.0.2,>=1.0.0 to >=1.0.0,<=1.1.1 in /requirements (#17967)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-03 15:06:28 +00:00
dependabot[bot] f8bec421e7
Update docker requirement from <6.1.3,>=5.0.0 to >=5.0.0,<6.1.4 in /requirements (#17969)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2023-07-03 15:02:52 +00:00
dependabot[bot] 0cc964a266
Bump Lightning-AI/utilities from 0.8.0 to 0.9.0 (#17971)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-03 16:37:16 +02:00
Adrian Wälchli c03dd38c6c
Refactor more Fabric tests that use the old .run() method (#17930)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-03 16:26:58 +02:00
Callum Hay 7278fa5007
docs: add example note for gradient accumulation in manual optimization mode (#17975)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-03 10:16:04 -04:00
rjarun8 1484116478
Fix: added map_location as an argument for the cls load_from_checkpoint (#17950)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2023-07-03 12:45:26 +00:00
jere357 28007bd076
Consistent early exit logic for validation loop in Fabric Trainer example #17962 2023-07-03 08:26:50 -04:00