Carlos Mocholí
|
53ceb156c4
|
Integrate lightning_utilities==0.4.2 (#15817)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
|
2022-12-13 13:13:51 +00:00 |
Adrian Wälchli
|
86568521fd
|
FSDP (native) support for LightningLite (#14967)
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
|
2022-11-21 13:58:37 +00:00 |
Carlos Mocholí
|
6ba00af1e0
|
Drop PyTorch 1.9 support (#15347)
* Drop 1.9
* Everything else
* READMEs
* Missed some
* IPU skips
* Remove exception type
* Add back
|
2022-11-10 08:59:13 -05:00 |
Adrian Wälchli
|
9c20cad40e
|
Fix srun detection causing permission error on non-SLURM platforms (#15485)
* improve srun detection
* changelog
* try catch is obsolete
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2022-11-03 03:14:15 +01:00 |
Adrian Wälchli
|
576757fd79
|
Validate SRUN variables when launching in SLURM (#15011)
|
2022-10-19 21:42:11 +00:00 |
Carlos Mocholí
|
24c26f7db2
|
Standardize Lite's filenames (#15058)
|
2022-10-19 14:09:41 +02:00 |
Carlos Mocholí
|
7ef87464dd
|
Refactor XLA and TPU checks across codebase (#14550)
|
2022-10-04 22:54:14 +00:00 |
Adrian Wälchli
|
d7af8ce2a5
|
Simplify root node resolution for SLURM environment (#14912)
Co-authored-by: Seppo Enarvi <seppo.git@marjaniemi.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
|
2022-09-30 15:40:43 +00:00 |
Adrian Wälchli
|
619e76f22d
|
Remove silent behavior when `num_slurm_tasks` does not correspond to number of processes in Trainer (#14300)
* simplify logic
* remove hpc
* update
* add changelog
* more tests
* update test
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
|
2022-09-16 11:00:09 +00:00 |
Adrian Wälchli
|
024e7b8204
|
Standalone Lite: Cluster Environments (#14509)
|
2022-09-12 12:20:08 +02:00 |