Commit Graph

28 Commits

Author SHA1 Message Date
Adrian Wälchli 00909ba3ff
Raise environment variable collision errors only when Fabric CLI is used (#17679)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-05-22 19:12:26 -04:00
Jirka Borovec 384c203532
ruff: PT some more fixes (#17569) 2023-05-05 08:25:15 +02:00
Jirka Borovec f55d10f5ee
ruff: autofix PT (#17541)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-05-04 11:50:39 -04:00
Carlos Mocholí 6ec9a6bd9e
[TPU] Rename classes to use XLA instead of TPU (#17383) 2023-04-28 12:36:22 -04:00
Adrian Wälchli 614dcdf502
True half-precision support in Fabric (#17287)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-04-27 12:37:33 +00:00
Adrian Wälchli 4d17b5fe77
Improved model initialization API for Fabric (#17462)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-04-26 11:25:33 -04:00
Jirka Borovec df97141781
add & apply flake8-simplify (#17386) 2023-04-24 21:57:08 +00:00
Carlos Mocholí 8dac251273
[TPU] Fix PjRT tests (#17408) 2023-04-19 16:39:00 +02:00
Adrian Wälchli 50662eb078
Fixes around `Strategy.set_world_ranks` (#16966)
* don't call set_world_ranks in xla strategy

* update

* fabric and other strategies

* CHANGELOG

* Typos

* Reuse test

---------

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-04-13 17:45:42 +02:00
Carlos Mocholí 0489f2efed
[TPU] v4 support (#17227) 2023-04-11 22:24:11 +00:00
Jirka Borovec 760612fb8a
update list of fist party packages (#16859)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-03-03 16:55:48 +00:00
Carlos Mocholí 888686e72b
Fix tests on single-GPU machine (#16911) 2023-03-03 01:33:45 +01:00
Carlos Mocholí d486f94dd2
Fabric: auto default (#16842) 2023-02-23 13:45:27 +00:00
Adrian Wälchli 0e4ca7c286
Set accelerator through CLI only if set explicitly (#16818) 2023-02-20 13:45:06 +00:00
Adrian Wälchli 81b7c30291
Make DDP subprocess the default launcher for multi-device (#16780) 2023-02-20 11:20:50 +00:00
Adrian Wälchli 2844e9e246
Fix XLAEnvironment detection on TPU pod (#16806) 2023-02-20 11:01:06 +01:00
Justus Schock ac5fa03385
Introduce new precision layout in fabric (#16767) 2023-02-17 10:41:18 +00:00
Adrian Wälchli c4c4793d56
Fix strategy type validation in connectors (#16693) 2023-02-10 10:50:56 +00:00
Jirka Borovec 770b792925
copyright Lightning AI team (#16647)
* copyright Lightning AI team

* more...
2023-02-06 15:26:51 +01:00
Jirka Borovec 377210d85d
tests: switch imports for fabric (#16592) 2023-02-01 20:34:38 +00:00
Carlos Mocholí ef2a6088ff
Drop support for PyTorch 1.10 (#16492)
* Drop support for PyTorch 1.10

* CHANGELOG

* READMEs

* mypy

* ls

* New poplar version

* Fixed tests

* links

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* skip azure badges

* Table

* Matching dockerfiles

* Drop unnecessary channels and packages

* Push nightly

* Undo unrelated changes

* Revert "Push nightly"

This reverts commit 9618f737c4.

---------

Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-02-01 14:09:12 -05:00
Carlos Mocholí d78cf99176
Remove the "native" suffix from the codebase (#16490) 2023-01-25 14:09:09 +00:00
Adrian Wälchli 9faa25f86f
Test that connector defaults match the ones in Trainer/Fabric (#16463) 2023-01-23 05:09:45 -05:00
Nikhil Shenoy 81914c7167
LightningFabric: Error handling for accelerator="mps" and ddp strategy pairing (#16455)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2023-01-22 17:57:24 +00:00
Adrian Wälchli 285cc53738
Make subprocess launcher the default in Lite (#16388) 2023-01-17 10:16:33 +00:00
Adrian Wälchli 0a2ee68ea0
Fix configuration validation error message in Lite CLI (#16334) 2023-01-12 15:09:28 +00:00
Carlos Mocholí 428844d01d
Fabric: drop FairScale's sharded implementation (#16329)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-01-11 17:08:18 +00:00
Carlos Mocholí 15ef52bc73
Rename LightningLite to Fabric (#16244)
* Rename LightningLite to Fabric

* Fix introspection test

* Fix deprecated Lite tests

* Undo accidental Horovod removal

* Fixes
2023-01-04 10:57:18 -05:00