Commit Graph

1931 Commits

Author SHA1 Message Date
Nicki Skafte Detlefsen 2b05e2f113
Fix problems in trainer docs (#17561) 2023-05-04 16:18:30 +02:00
Jakub Kaczmarzyk ebbd1169d8
add note in LightningCLI docs that `--optimizer` must be given for `--lr_scheduler` to work (#17552) 2023-05-04 13:35:05 +02:00
Jirka Borovec e0f335809d
Adding tests for legacy checkpoints - 1.8.x (#17374)
* Adding tests for legacy checkpoints

* 2.0

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix simple

* utils

* import

* pl

* num_features=24

* num_features=24

* length=6000

* other

* rm

* dru run

* rm

* prune

* import

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-05-04 12:33:46 +02:00
Carlos Mocholí 6ec9a6bd9e
[TPU] Rename classes to use XLA instead of TPU (#17383) 2023-04-28 12:36:22 -04:00
Jirka Borovec db9f095b0b
Replace IPU with external implementation (#17075)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-04-27 16:09:51 +00:00
Adrian Wälchli 614dcdf502
True half-precision support in Fabric (#17287)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-04-27 12:37:33 +00:00
pre-commit-ci[bot] 91cb4b9b87
[pre-commit.ci] pre-commit suggestions (#17271)
* [pre-commit.ci] pre-commit suggestions

updates:
- [github.com/PyCQA/docformatter: v1.4 → v1.6.0](https://github.com/PyCQA/docformatter/compare/v1.4...v1.6.0)
- [github.com/psf/black: 22.12.0 → 23.3.0](https://github.com/psf/black/compare/22.12.0...23.3.0)
- [github.com/charliermarsh/ruff-pre-commit: v0.0.237 → v0.0.260](https://github.com/charliermarsh/ruff-pre-commit/compare/v0.0.237...v0.0.260)

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update

* apply

* fixing

* docs/lines

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2023-04-26 21:37:41 +02:00
Adrian Wälchli 4d17b5fe77
Improved model initialization API for Fabric (#17462)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-04-26 11:25:33 -04:00
Adrian Wälchli d48ec08d76
Update gradient clipping docs in Fabric (#17470) 2023-04-25 20:20:05 +00:00
Jirka Borovec a5c43d3b2b
docs: fix past versions location (#17432) 2023-04-24 14:07:22 +00:00
Adrian Wälchli 20303e4736
Add Fabric.launch to Fabric methods section (#17437)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-04-24 10:28:52 +02:00
Adrian Wälchli 92b379e90a
Document access to Fabric attributes inside LightningModule (#17440) 2023-04-22 09:10:06 +02:00
Carlos Mocholí 16377339cb
Remove reference to outdated Kaggle tutorial (#17390) 2023-04-17 19:12:44 +02:00
Carlos Mocholí 13905a3464
Support all `CombinedLoader` modes during evaluation (#17163) 2023-04-16 20:01:52 +02:00
Carlos Mocholí 97a61868fb
Sync module states during non-fit (#17370)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-04-15 02:35:51 +00:00
Carlos Mocholí 1aa23267ab
Various Fabric documentation updates (#17236) 2023-04-11 23:05:57 +00:00
Carlos Mocholí 0489f2efed
[TPU] v4 support (#17227) 2023-04-11 22:24:11 +00:00
Adrian Wälchli 0c02c44c6d
Simplified setup of optimizers in FSDP (#17309)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-04-11 20:13:41 +00:00
Jirka Borovec 86dbe38913
fix missing tutorials (#17311) 2023-04-11 15:18:18 -04:00
Carlos Mocholí ca27a147db
Update CODEOWNERS (#17322) 2023-04-11 08:45:24 -04:00
Adrian Wälchli ef7da5c445
Minimal Transformer Example (#17282) 2023-04-06 18:32:23 +00:00
Adrian Wälchli 8482e675a5
Fix broken links in README (#17292)
* update links

* one more
2023-04-06 13:50:24 -04:00
edenlightning c2d9ec4e1b
Update docs index (#17246)
* update docs

* update docs index

* Delete version.info

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update glossary

* typo

* restructure

* rm newline

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-04-05 12:06:40 -04:00
Jirka Borovec 5c5ce60e55
fixing master (#17268)
* fixing master

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-04-03 15:23:32 -04:00
edenlightning d5ca30aaf1
Add link to benchmarks docs (#17239)
* Update benchmarks.rst

* small fix
2023-04-02 19:54:59 +02:00
Eden Afek 65eb3e7c45 revert docs 2023-03-30 20:14:54 -04:00
Eden Afek 14cf89c788 revert docs 2023-03-30 20:13:42 -04:00
Eden Afek e84e135de8 revert docs 2023-03-30 20:13:07 -04:00
Eden Afek d8cae765c4 Update 2 2023-03-30 18:59:41 -04:00
Eden Afek 3232097fa2 Update 2 2023-03-30 18:57:11 -04:00
Eden Afek b73fb292e2 Update docs structure 2023-03-30 18:46:25 -04:00
Yuxuan Lu 0360a6b42f
Document how to use TensorBoardLogger with fsspec (#16320) 2023-03-27 08:26:42 +00:00
Iain Carmichael 3458258905
Fix `*_batch_end` typos (#17188)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-03-25 07:21:19 +01:00
Gerson Kroiz 7ec9a97b94
fix typo in docs migration 1_6_regular (#17186) 2023-03-24 20:00:40 -04:00
Yi Heng Lim 27c1faaf8d
Update Custom Callback Docs (#17161)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-03-24 12:42:15 +00:00
Jirka Borovec e7ef8db57e
Replace HPU with external implementation (#17067)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-03-24 13:30:53 +01:00
Maurice Deserno 572f99b857
Updated conda install commands in docs. (#17162) 2023-03-24 09:57:10 +01:00
Jirka Borovec 32a6d2ff62
docs: update links to 1.6 1.5 1.4 (#17181) 2023-03-24 01:45:31 -04:00
bafangchaoshi a705b21de1
Support BaguaStrategy with external implementation (#17029)
* Support BaguaStrategy with external implementation

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add an introduction of BaguaStrategy

* update the introduction of BaguaStrategy

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Apply suggestions from code review

* link

* Fix document formatting issues

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update lightning-bagua version

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update lightning-bagua version

* Apply suggestions from code review

---------

Co-authored-by: Yafen Fang <fangyaf@spaceml1.ethz.ch>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2023-03-21 12:31:53 -07:00
Kshiteej K 04a2f3309d
update for neptune 1.0 (#16761)
Co-authored-by: Sabine <sabine.nyholm@neptune.ai>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-03-21 19:00:07 +00:00
Lightning Forever c886317c0c
Restructure Fabric docs (2/n) (#17126)
Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-03-20 18:11:26 -04:00
Jirka Borovec a77e33666b
docs: migration guide to the latest [2/n] (#17103)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-03-20 16:27:10 +00:00
Jirka Borovec 69615f3195
docs order contributors alphabetically (#17102) 2023-03-20 16:24:35 +00:00
Jirka Borovec 5b5f4100cd
docs: build docs for specific tags (#17055)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-03-20 16:24:06 +00:00
Carlos Mocholí e5eab4e0a3
Reference the compatibility matrix (#17091) 2023-03-17 14:51:39 +01:00
Carlos Mocholí 1835cc0bb2
Reference trainer properties in docs (#16969) 2023-03-17 07:28:43 -04:00
Soroush Saryazdi 8074219f02
Typo fix in upgrade from 1.9.x to 2.0 docs: `use_distributed_sample`-> `use_distributed_sampler` (#17113)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-03-17 07:16:07 -04:00
Carlos Mocholí 4d9b845e01
Update docs for alternative dataset projects (#17096) 2023-03-17 12:00:02 +01:00
Adam J. Stewart b26e5f6e97
Typo fix: on_training_epoch_end -> on_train_epoch_end (#17110)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-03-17 10:01:00 +00:00
Lightning Forever 88c0d4d81b
Restructure Fabric docs (#17111)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-03-17 09:42:58 +01:00
Jirka Borovec 5661988253
rtfd: building on PRs only (#17086)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-03-15 21:04:10 +01:00
Jirka Borovec 2f087ae30e
docs: update broken links & latest/stable (#16994)
* docs: update links to PL latest

* also stable

* last

* Apply suggestions from code review

* Apply suggestions from code review

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

* fixing

* .

* fabric

* fixing

* .

---------

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-03-15 20:19:41 +01:00
Jirka Borovec ad1c994d96
docs: fix link in guide (#17090) 2023-03-15 13:18:43 +00:00
Adrian Wälchli 01834c8529
Exclude some examples from docs navigation (#17081)
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2023-03-15 12:55:34 +00:00
Jirka Borovec f25dfbe2b5
update links to Discord (#17087)
* update links to Discord

* link

* Apply suggestions from code review

Co-authored-by: Luca Antiga <luca.antiga@gmail.com>

* slack

* Update docs/source-app/levels/expert/index.rst

* Apply suggestions from code review

---------

Co-authored-by: Luca Antiga <luca.antiga@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-03-15 13:33:40 +01:00
Jirka Borovec 2f39bb656e
docs: migration guide to the `latest` [1/n] (#17034)
* docs: migration guide - structure

* update

* try

* ...

* 1.9

* 1.8

* 1.7

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* cleaning 1.9

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* crosslink

* ...

* placeholder

* 1.6

* placeholders

* 1.5

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* 1.4

* Apply suggestions from code review

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-03-15 13:32:44 +01:00
Adrian Wälchli 550fa5fec0
Organize app examples (#17045)
* move

* init files

* reset change

* missed

* more

* relative path

* e2e tests

* update checkgroup

* reference

* update app_*
2023-03-14 14:48:17 -04:00
Aniket Maurya 5d58736f08
Rename PL installation to `pip install lightning` (#17074)
* rename PL installation

* update

* Apply suggestions from code review

* Update docs/source-pytorch/starter/installation.rst

Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>

* Update docs/source-pytorch/starter/installation.rst

---------

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>
2023-03-14 14:47:56 -04:00
Carlos Mocholí 281efc682f
Update data docs (#16839)
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2023-03-14 18:44:49 +01:00
Carlos Mocholí 9cd131c2a6
Add `CombinedLoader` to the API reference (#17062)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2023-03-14 15:52:05 +01:00
Carlos Mocholí bab80f2e1f
Remove outdated Trainer animation (#17060) 2023-03-14 15:51:07 +01:00
Tianshu Wang f3a20d0a50
Rename `ProgressBarBase` to `ProgressBar` (#17058) 2023-03-14 13:22:47 +00:00
Atharva Phatak ea708da55a
Add `is_wrapped` utility function for Fabric (#16953)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2023-03-14 13:03:38 +00:00
Carlos Mocholí f1f8050e0e
Remove outdated trainer argument videos (#17071) 2023-03-14 12:44:07 +01:00
Jirka Borovec a8c4d1f552
docs: adding hivemind (#17038)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-03-13 18:17:37 +00:00
Adrian Wälchli 22230c97b1
Add link to past versions in the docs header (#17063)
add past versions link
2023-03-13 18:51:01 +01:00
Carlos Mocholí 8e35384557
Remove outdated LightningModule animation (#17057) 2023-03-13 13:33:14 -04:00
Wouter Zwerink 61fe3d5c9d
Revert "import neptune instead of import neptune.new" (#16898)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-03-13 14:50:30 +00:00
Carlos Mocholí 05084e52cc
Simplify the Trainer core logic (#17017) 2023-03-13 13:43:57 +00:00
Adrian Wälchli a49fd02417
Update introduction video (#17059) 2023-03-13 09:42:29 -04:00
Jirka Borovec 7d16a9e370
docs: update version matrix (#17041) 2023-03-12 23:17:04 +01:00
Adrian Wälchli 6fcccea3fa
Add cute teaser animations to Fabric docs (#17021) 2023-03-10 12:16:07 -05:00
Adrian Wälchli 7c80fe6990
Update links to latest PL docs (#17031) 2023-03-10 14:21:15 +00:00
Jirka Borovec bb69e2c916
rtfd: try to collapse docs (#17020) 2023-03-09 20:05:50 -05:00
Jirka Borovec bb960b841c
rtfd: try another redirect (#17018) 2023-03-09 21:25:39 +01:00
Jirka Borovec 7f44dba7ed
RTFD: building docs for with redirect (#16993)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-03-09 13:05:30 -05:00
Jirka Borovec 9b168dcf9b
docs: listing past versions (#17014)
Co-authored-by: Luca Antiga <luca.antiga@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-03-09 17:52:20 +00:00
Carlos Mocholí f7b89cd233
Review APIs experimental status (#17012) 2023-03-09 16:56:49 +00:00
Adrian Wälchli 66966b405e
Update Fabric docs with installation instructions (#16996) 2023-03-09 13:28:06 +01:00
Carlos Mocholí 3328b047fb
Inline the `ModelIO` interface (#16999) 2023-03-08 18:28:28 +01:00
Carlos Mocholí c294158823
Add compatibility matrix (#16998) 2023-03-08 18:03:03 +01:00
Aniket Maurya da572d7cca
Refactor PL examples to `examples/pytorch/` (#16925)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2023-03-07 11:24:46 -05:00
Adrian Wälchli 1e42783973
Miscellaneous updates in Fabric docs (#16980) 2023-03-07 15:43:47 +00:00
Ruslan Mukhametshin 863f575caa
Update documentation about configuration files structure (#16956) 2023-03-07 13:31:11 +01:00
Adrian Wälchli 0e84f01b09
Document how to use multiple models and optimizers in Fabric (#16952) 2023-03-07 13:19:43 +01:00
Adrian Wälchli 7749525cbd
Document SLURM interactive mode (#16955) 2023-03-06 20:58:46 +00:00
Adrian Wälchli 54147e0745
Update Fabric docs navigation (#16957) 2023-03-06 16:13:51 +01:00
Adrian Wälchli 5870714224
Switch theme for Fabric (#16961) 2023-03-06 15:50:46 +01:00
Adrian Wälchli f2caa01bb3
Document gradient clipping in Fabric (#16943) 2023-03-05 17:03:57 +00:00
Jirka Borovec ac4180fc2f
docs: add link to lightning-colossalai (#16945) 2023-03-05 10:43:17 +00:00
Jirka Borovec 0e8ac7e1c9
docs: move fabric on its own (#16742)
* docs: move fabric to Lai

* update imports

* links

* drop link to Trainer

* own docs

* ci

* trigger

* prune cross-links

* cleaning

* cleaning

* template

* imports

* template

* path

* links

* tensorboardX

* plugins

* label

* drop fixme

* drop copy nb + examples

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Apply suggestions from code review

* try again

* rev

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-03-01 12:36:14 +01:00
Jirka Borovec ac815ec8a1
docs: typo in habana docs (#16910) 2023-03-01 09:22:43 +01:00
Wouter Zwerink dfa35dac99
Require neptune 1.0 (#16888)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-02-28 12:13:42 +01:00
Jirka Borovec f697fff5db
docs: rename source-app (#16863)
* docs: rename source-app

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* ci

* group check

* trigger

* param

* fix

* cleaning

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-02-28 10:04:43 +01:00
Jirka Borovec 52a39c03f8
docs: update `pytorch_lightning` imports (#16864)
* update docs imports

* ci

* fabric

* trigger

* links

* .

* docstring

* chlog

* cleaning
2023-02-27 15:14:23 -05:00
Adrian Wälchli a54f37391f
Explain `configure_sharded_model` in ColossalAI docs (#16872) 2023-02-27 20:45:15 +01:00
Adrian Wälchli 07b89c87ee
Merge DDPStrategy and DDPSpawnStrategy in PL (#16809) 2023-02-27 14:43:23 -05:00
Adrian Wälchli e48613207a
Promote `Fabric.launch()` as the default experience in Fabric docs (#16878) 2023-02-27 08:19:54 -05:00
Darren Tuit cdf21a1305
Fix imports for lightning cli examples (#16871) 2023-02-26 00:09:34 +01:00
Aditya Kane df6e37da1c
Update torch_xla installation instructions in tpu_basic.rst (#16865) 2023-02-26 00:07:40 +01:00
Carlos Mocholí 0130273eb5
Trainer: auto default (#16847) 2023-02-23 18:42:17 +01:00
Carlos Mocholí d486f94dd2
Fabric: auto default (#16842) 2023-02-23 13:45:27 +00:00
Justus Schock 598c2476cd
Remove implicit frontend testing from `testing.run_app_in_cloud` (#16741)
Co-authored-by: Ethan Harris <ethanwharris@gmail.com>
2023-02-22 14:48:10 +00:00
Carlos Mocholí 914effa04c
Rename `replace_sampler_ddp|replace_sampler` to `use_distributed_sampler` (#16829)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-02-22 14:07:02 +01:00
Carlos Mocholí b30a43f783
Move the `CombinedLoader` to an utility file (#16819) 2023-02-20 18:06:35 +01:00
Adrian Wälchli 65e66814f8
Remove the `*_step_end` hooks (#16791)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-02-20 13:04:40 +00:00
Carlos Mocholí 781768d2b2
Remove `Trainer(multiple_trainloader_mode)` in favor of `CombinedLoader(mode)` (#16800) 2023-02-20 13:32:06 +01:00
Adrian Wälchli 81b7c30291
Make DDP subprocess the default launcher for multi-device (#16780) 2023-02-20 11:20:50 +00:00
Mauricio Villegas 3a0519143a
Fix bug in lightning_cli_advanced_3.rst (#16792) 2023-02-20 12:12:17 +01:00
Sebastian Raschka a4f4b5372a
Add missing docs quote (#16797)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2023-02-20 11:09:55 +01:00
Justus Schock 0fee28409b
Introduce new precision layout in PL (#16783) 2023-02-17 17:58:14 +01:00
Justus Schock ac5fa03385
Introduce new precision layout in fabric (#16767) 2023-02-17 10:41:18 +00:00
Adrian Wälchli 91e692c767
Rename the TPUSpawnStrategy to XLAStrategy (#16781)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-02-17 02:06:24 +00:00
Adrian Wälchli ad698f049b
Update Colossal AI docs and integration (#16778) 2023-02-16 16:14:24 +00:00
Justus Schock 47c69cd8eb
Remove DP (#16748)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-02-16 05:12:08 +00:00
Lightning Forever 41dd0d1f85
Remove the QuantizationAwareTraining callback (#16750) 2023-02-15 17:29:49 -05:00
Adrian Wälchli 83f4c83582
Replace ColossalAIStrategy with external implementation (#16757) 2023-02-15 15:11:52 +00:00
Adrian Wälchli c4074419b5
Remove the BaguaStrategy (#16746)
* remove bagua

* remove

* remove docker file entry
2023-02-14 08:58:58 -05:00
Adrian Wälchli 39020887d2
Remove Trainer's `track_grad_norm` argument (#16745) 2023-02-14 12:38:17 +00:00
Adrian Wälchli 99cb2cd056
Remove argparse utils (#16708)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-02-13 20:44:30 +00:00
Adrian Wälchli 67c09e3092
Separate the Gradient Accumulation Scheduler from Trainer (#16729)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-02-13 20:15:38 +00:00
Jirka Borovec 0be025e8b7
rename docs/source-app & adjust docs links for lightning (#16676)
* update CI

* config / import

* lightning_app imports

* source/ dir

* html

* ci: dirs

* pr

* req dir

* on push

* rename

* drop

* cleaning
2023-02-13 10:59:02 +01:00
Carlos Mocholí 457cd76d1a
Remove the unused `utilities.finite_checks` (#16682) 2023-02-09 21:11:05 +01:00
Adrian Wälchli 18106a8f95
Split train- and val progress into separate bars (#16695) 2023-02-09 19:43:50 +00:00
Adrian Wälchli 83296cc6cf
Update Fabric introduction (#16672)
Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>
2023-02-09 18:06:29 +00:00
Carlos Mocholí bf51844917
Remove memory-retaining epoch-end hooks (#16520) 2023-02-06 17:00:36 +00:00
Adrian Wälchli cd0eedb082
Set `find_unused_parameters=False` as the default (#16611) 2023-02-06 16:51:21 +01:00
Sebastian Raschka ce424d235f
Move fsdp_native to fine-tuning recommendation (#16630) 2023-02-05 15:09:46 +01:00
JiHoon Kim 65abdeea88
Fabric docs typo correction (#16635) 2023-02-05 01:09:58 +01:00
Adrian Wälchli 0f75dce8b4
Add MPI cluster environment (#16570)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-02-03 10:45:11 +00:00
Adrian Wälchli acb7ee223c
Ignore generated package files (#16605)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2023-02-02 09:24:07 +00:00
Carlos Mocholí 21d8fbfb2f
Fix broken links after reverse mirror changes (#16600) 2023-02-01 20:00:44 +00:00
Carlos Mocholí ef2a6088ff
Drop support for PyTorch 1.10 (#16492)
* Drop support for PyTorch 1.10

* CHANGELOG

* READMEs

* mypy

* ls

* New poplar version

* Fixed tests

* links

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* skip azure badges

* Table

* Matching dockerfiles

* Drop unnecessary channels and packages

* Push nightly

* Undo unrelated changes

* Revert "Push nightly"

This reverts commit 9618f737c4.

---------

Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-02-01 14:09:12 -05:00
Adrian Wälchli 01b152f169
Update docs for multiple optimizers in 2.0 (#16588) 2023-02-01 17:34:55 +00:00
Carlos Mocholí dc298f2340
Drop support for Python 3.7 (#16579)
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2023-02-01 01:36:42 +00:00
Andrea Tupini d634846b5e
Minor formatting fix on model_parallel docs (#16565) 2023-01-30 12:40:03 -05:00
Adrian Wälchli 8aca46a192
Remove `using_lbfgs` argument from `optimizer_step` module hook (#16538) 2023-01-30 12:49:35 +00:00
Adrian Wälchli 1008f313e8
Remove `on_tpu` argument from `optimizer_step` module hook (#16537)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-01-30 13:17:20 +01:00
Jirka Borovec 879701f52f
ci: hotfix precommit/poetry/isort (#16549) 2023-01-30 11:07:52 +01:00
Adrian Wälchli bb7b8d601a
Fabric docs feedback 2/n (#16480) 2023-01-27 20:13:20 +01:00
Carlos Mocholí 226290cfc1
PyTorch 2.0 switched the `set_to_none` default (#16531)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-01-27 16:51:56 +00:00
Adrian Wälchli b216a114a7
Decouple Tuner from Trainer (#16462)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-01-27 15:08:40 +00:00
Carlos Mocholí d562319a61
Make the `FaultToleranceCheckpoint` callback opt-in (#16512) 2023-01-27 13:02:14 +00:00
belerico b5599e1320
Add reinforcement learning example for Fabric (#16506)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
Co-authored-by: Luca Antiga <luca@lightning.ai>
2023-01-27 11:28:25 +00:00
Carlos Mocholí 76cb048b29
Remove docs about automatic fault tolerance (#16500)
Remove docs about the experimental automatic fault tolerance
2023-01-26 19:47:40 +01:00
Adrian Wälchli c68cfd686e
Rename LiteMultiNode to FabricMultiNode (#16505) 2023-01-26 11:36:27 +00:00
Adrian Wälchli dfd8d80cb1
Multi-node documentation for Fabric (#16495)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-01-25 22:07:09 +00:00
Rick Izzo 0aef0375cb
Docs: Update BYOC IAM Policy Permissions (#16474)
Added codebuild permissions to IAM policy JSON
2023-01-25 16:27:15 +00:00
Carlos Mocholí 486b4d5d9d
Remove old platform docs (#16499)
* Remove old platform docs

* More

* More
2023-01-25 16:16:51 +01:00
Shreya Purohit 139e2659f4
Fix minor typos in sharing_components.rst (#16468)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-01-25 15:15:45 +01:00
Carlos Mocholí d78cf99176
Remove the "native" suffix from the codebase (#16490) 2023-01-25 14:09:09 +00:00
Adrian Wälchli 8147e9b111
Grammar corrections for Fabric docs (#16494) 2023-01-25 11:45:09 +01:00
Jirka Borovec 3f365c8997
docs: move `assets_lightning` to pl-public-data (#16419) 2023-01-24 22:55:53 +00:00
Adrian Wälchli c87bb71fa8
Add `Fabric.all_reduce` (#16459) 2023-01-24 22:35:00 +00:00
Carlos Mocholí 5891cdc940
Mark the loop classes as protected (#16445) 2023-01-23 16:30:13 +00:00
Carlos Mocholí 39b7cb80ca
Remove the FairScale integration (#16400)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-01-23 13:39:04 +00:00
Adrian Wälchli 3611fcd152
Update Fabric docs based on user feedback (#16460)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-01-23 14:28:20 +01:00
Nash cc56539cd3
fixing typos reported by community user (#16457) 2023-01-21 01:21:45 +01:00
Carlos Mocholí d3de5c64d7
Remove the deprecated code in `pl.utilities.data` (#16440) 2023-01-20 01:03:55 +01:00
Adrian Wälchli 39acb81b9b
Fabric checkpointing 1/n: base implementation (#16434)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-01-19 20:40:12 +00:00
Carlos Mocholí 8f736372ed Loop flattening: reduce base interface (#16429)
* Loop flattening: remove the default `.run()` implementation

* None return

* mypy

* Loop flattening: reduce base interface

* Fix

* DOcs

* Bad merge

* Fix

* Fix
2023-01-19 18:39:36 +01:00
Carlos Mocholí da82d490f3 Remove the deprecated code in `pl.utilities.optimizer` (#16439) 2023-01-19 18:39:36 +01:00
Carlos Mocholí 0cf0e90e4a Remove the deprecated code in `pl.utilities.cloud_io` (#16438) 2023-01-19 18:39:36 +01:00
Carlos Mocholí df795b45c0 Remove the deprecated code in `pl.utilities.seed` (#16422) 2023-01-19 18:39:36 +01:00
Carlos Mocholí f031f1e453 Remove the `HivemindStrategy` (#16407)
Remove the collaborative strategy
2023-01-19 18:39:36 +01:00
Carlos Mocholí 04b929c2af Remove the deprecated code in `pl.utilities.apply_func` (#16413) 2023-01-19 18:39:36 +01:00
Carlos Mocholí 256199ff7c Remove support for logging multiple metrics together (#16389) 2023-01-19 18:39:36 +01:00
Carlos Mocholí 46246c3336 Loop flattening: remove `.connect()` (#16384) 2023-01-19 18:39:36 +01:00
Carlos Mocholí e9a5c7489a Loop flattening: remove `.replace()` (#16361) 2023-01-19 18:39:36 +01:00
Adrian Wälchli 03a699693b Remove truncated backpropagation from loops (#16337)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-01-19 18:39:36 +01:00
Adrian Wälchli d3b04816ff Remove the deprecated `resume_from_checkpoint` Trainer argument (#16167)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-01-19 18:39:36 +01:00
Adrian Wälchli 44e8f6a980 Remove the deprecated Trainer device arguments (#16171)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-01-19 18:39:36 +01:00
Jirka Borovec 799ced8430
ci: replace flake8 by ruff (#16433)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-01-19 11:48:28 -05:00
Adrian Wälchli 85786d0c83
Distributed communication docs for Lite (#16373)
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-01-18 22:30:51 +00:00
Akihiro Nitta fb12879fde
[docs][App] Include components in the API reference (#16414) 2023-01-18 09:06:21 +00:00
Carlos Mocholí cfe87a0b56
Clarify cluster advanced docs (#16403) 2023-01-17 14:58:01 +00:00
Justus Schock e61bead324
rename flow _exit (#16378)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-01-17 11:06:50 +00:00
Carlos Mocholí 0f4f809a7c
Deprecate the FairScale integration (#16353) 2023-01-17 02:29:06 +01:00
Jirka Borovec 13c3136063
move `tensorboardX` to extra (#16349)
* tensorboardX > extra

* default

* chlog

* doctest_skip

* mypy

* Update docs

* plus

* Apply suggestions from code review

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Luca Antiga <luca.antiga@gmail.com>

* fix

* mypy

* docs

* Apply suggestions from code review

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

* .

* ll

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Luca Antiga <luca.antiga@gmail.com>
2023-01-16 09:06:14 +01:00
Akihiro Nitta fa73327828
[App] Rename gradio.py gradio_server.py (#16201) 2023-01-14 13:00:37 +00:00
Raphael Randschau 1778c20e6f
update documentation with upcoming supported regions (#16331) 2023-01-13 14:12:17 +01:00
Adrian Wälchli 8f1269283f
Add CSVLogger for Lightning Lite (#16346)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-01-13 13:09:44 +00:00
Akihiro Nitta 5de93975c6
Document how to run apps with a local version of Lightning on the cloud (#16163)
Co-authored-by: Akihiro Nitta <akihiro@lightning.ai>
2023-01-12 17:48:35 +00:00
Adrian Wälchli 54d3e2c3ee
Lite Example: Model Agnostic Meta Learning (MAML) (#16333)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-01-12 14:31:34 +00:00
Adrian Wälchli b0ea836b16
Update Lightning Lite docs (6/n) (#16342) 2023-01-12 13:37:24 +00:00
Adrian Wälchli 426c463721
Address feedback for new Lite docs (#16330) 2023-01-12 13:08:32 +01:00
Carlos Mocholí 428844d01d
Fabric: drop FairScale's sharded implementation (#16329)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-01-11 17:08:18 +00:00
Carlos Mocholí 4bc2080c71
Remove lightning_transformers from our docs (#16335) 2023-01-11 15:45:09 +01:00
Adrian Wälchli a8605b4540
Update Lightning Lite docs (5/n) (#16291)
* organize

* organize

* organize

* organize

* Fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* accelerator

* distributed launch

* notebooks

* code structure

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* lightning_module

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* x

* update

* conflicts

* fix duplicates

* links.rst

* api folder

* add todo for build errors

* resolve duplicate reference warnings

* address review by eden

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-01-10 14:11:03 -05:00
XuhuiRen ce4cbe2b4b
Add a introduction documents for using Intel Neural Compressor to conduct post-training quantization (#16085)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2023-01-10 17:33:27 +00:00
Nikhil Shenoy 5a9b5e363d
Fixes in sphinx docs links (#16255)
Co-authored-by: Nikhil Shenoy <nikhilshenoy@dhcp-128-189-229-137.ubcsecure.wireless.ubc.ca>
Co-authored-by: Nikhil Shenoy <nikhilshenoy@dhcp-128-189-227-69.ubcsecure.wireless.ubc.ca>
Co-authored-by: Nikhil Shenoy <nikhilshenoy@dhcp-128-189-228-93.ubcsecure.wireless.ubc.ca>
Fixes https://github.com/Lightning-AI/lightning/issues/8107
2023-01-10 13:19:16 +01:00
Lightning Forever f24349bb64
Logger support in Lite (#16121)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-01-09 18:33:18 +00:00
Mohammed Kamran 2166ce2bd9
Fix-multiple-loggers-typo (#16305) 2023-01-09 17:20:02 +01:00
Adrian Wälchli 6bf6540ee5
Update Lightning Lite docs (4/n) (#16246)
Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>
2023-01-06 15:54:19 +00:00
Adrian Wälchli a913db8e88
Update Lightning Lite docs (3/n) (#16245) 2023-01-06 04:08:55 -05:00
Nikhil Shenoy a6d23ed898
Skip sphinx `linkcheck` on CHANGELOG files (#16259)
Co-authored-by: Nikhil Shenoy <nikhilshenoy@dhcp-128-189-229-137.ubcsecure.wireless.ubc.ca>
2023-01-05 13:25:14 -05:00
Adrian Wälchli b0c272e8b7
Restructure Lite examples and add GAN (#16240)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-01-05 14:07:43 +00:00
pre-commit-ci[bot] b59941cc52
[pre-commit.ci] pre-commit suggestions (#16224)
* [pre-commit.ci] pre-commit suggestions

updates:
- [github.com/pre-commit/pre-commit-hooks: v4.3.0 → v4.4.0](https://github.com/pre-commit/pre-commit-hooks/compare/v4.3.0...v4.4.0)
- [github.com/asottile/pyupgrade: v2.34.0 → v3.3.1](https://github.com/asottile/pyupgrade/compare/v2.34.0...v3.3.1)
- https://github.com/myint/docformatterhttps://github.com/PyCQA/docformatter
- [github.com/PyCQA/docformatter: v1.4 → v1.5.1](https://github.com/PyCQA/docformatter/compare/v1.4...v1.5.1)
- [github.com/asottile/yesqa: v1.3.0 → v1.4.0](https://github.com/asottile/yesqa/compare/v1.3.0...v1.4.0)
- [github.com/PyCQA/isort: 5.10.1 → 5.11.4](https://github.com/PyCQA/isort/compare/5.10.1...5.11.4)
- [github.com/psf/black: 22.6.0 → 22.12.0](https://github.com/psf/black/compare/22.6.0...22.12.0)
- [github.com/executablebooks/mdformat: 0.7.14 → 0.7.16](https://github.com/executablebooks/mdformat/compare/0.7.14...0.7.16)

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-01-04 18:48:35 -05:00
Nikhil Shenoy 99936ed3f4
Added Optional `sphinx` docs linkcheck (#16234)
Fixes https://github.com/Lightning-AI/lightning/issues/8107
2023-01-04 23:05:12 +01:00
Adrian Wälchli a3e2b4c021
Update Lightning Lite docs (2/n) (#16239)
Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>
2023-01-04 18:11:29 +00:00
Adrian Wälchli 085ad548ca
Update Lightning Lite docs (1/n) (#16250)
* fabric docs

* fix reference

* fabric
2023-01-04 12:25:36 -05:00
Carlos Mocholí 15ef52bc73
Rename LightningLite to Fabric (#16244)
* Rename LightningLite to Fabric

* Fix introspection test

* Fix deprecated Lite tests

* Undo accidental Horovod removal

* Fixes
2023-01-04 10:57:18 -05:00
Adrian Wälchli 42e72564b7
Fix inclusion of `model_parallel` document (#16197)
fix link to gpu/advanced section
2022-12-26 14:08:55 +09:00
Raphael Randschau 5e1db28bf4
update BYOC documentation with AWS details (#16044)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-12-24 04:35:06 +01:00
David Gilbertson 94e6d52b7e
docs: fix order of on_fit_start() hook (#16180)
Fixes https://github.com/Lightning-AI/lightning/issues/16170
2022-12-22 21:06:57 +01:00
Adrian Wälchli ca88f813a4
Deprecate `auto_select_gpus` (#16147)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-12-22 17:44:07 +01:00
edenlightning 1c196da309
Update fault_tolerant_training_basic.rst (#16012) 2022-12-22 07:16:02 +00:00
Carlos Mocholí 5c6ddee09f
Remove deprecated items from api reference (#16151) 2022-12-21 22:25:25 +01:00
Adrian Wälchli 1c0228833d
Fix broken link in "Build a Model" section of docs (#16025)
fix build model link
2022-12-21 12:21:50 +01:00
Carlos Mocholí bf8e568845
Deprecate the `HorovodStrategy` (#16141) 2022-12-20 18:38:28 +01:00
Carlos Mocholí 14f441c393
Deprecate nvidia/apex (#16039) 2022-12-20 11:09:39 +01:00
Sherin Thomas 7cbdc68449
[App] Cold start proxy in autoscaler (#16094)
* cold start proxy

* Update src/lightning_app/components/serve/auto_scaler.py

* changelog

* better-doc

Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2022-12-20 09:47:10 +00:00
Akihiro Nitta 39d27f6370
[App] Improve the autoscaler UI (#16063)
[App] Improve the autoscaler UI (#16063)
2022-12-19 10:25:24 +00:00
Jirka Borovec c8b4bce3ee
docs: add PT version (#16010)
* docs: add PT version

* stable

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-12-17 08:27:21 +01:00
Carlos Mocholí b1ce2639f4
Better check for programmatic lightningignore (#16080)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2022-12-16 09:49:17 +00:00
Nikhil Shenoy 53759825bb
Document Gradient Clipping during Manual Optimization (#16023)
Co-authored-by: Nikhil Shenoy <nikhilshenoy@dhcp-128-189-224-163.ubcsecure.wireless.ubc.ca>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-12-13 21:21:59 +00:00
Carlos Mocholí edd2b4259a
Introduce `{Work,Flow}.lightningignore` (#15818) 2022-12-13 13:50:08 +00:00
Mauricio Villegas ed52823c3f
LightningCLI support for optimizers and schedulers via dependency injection (#15869)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-12-12 15:36:19 +00:00
Quazi Marufur Rahman 2577285dd5
Use LRScheduler for torch >= 1.14 otherwise use _LRScheduler (#15768)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-12-12 13:33:26 +00:00
Adrian Wälchli 7a1e0e801e
Fix typo in definition of world size in docs (#15954) 2022-12-08 18:06:12 +00:00
Akihiro Nitta e6f4c8484b
[docs] Include all components in the API reference (#15805)
* Update docs

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2022-12-08 04:38:27 +00:00
Ethan Harris e250dfe2b3
[App] Remove `SingleProcessRuntime` (#15933)
* Remove SingleProcessRuntime
* Remove unused queues
* Docs
2022-12-07 23:22:40 +00:00
Rick Izzo 20419089b6
ENG-627: Docs for CloudCompute Mount Argument (#15182)
fixed conflicts
2022-12-07 15:11:52 +00:00
Akihiro Nitta 64b19fb16f
[App] Introduce auto scaler (#15769)
* Exlucde __pycache__ in setuptools

* Add load balancer example

* wip

* Update example

* rename

* remove prints

* _LoadBalancer -> LoadBalancer

* AutoScaler(work)

* change var name

* remove locust

* Update docs

* include autoscaler in api ref

* docs typo

* docs typo

* docs typo

* docs typo

* remove unused loadtest

* remove unused device_type

* clean up

* clean up

* clean up

* Add docstring

* type

* env vars to args

* expose an API for users to override to customise autoscaling logic

* update example

* comment

* udpate var name

* fix scale mechanism and clean up

* Update exampl

* ignore mypy

* Add test file

* .

* update impl and update tests

* Update changlog

* .

* revert docs

* update test

* update state to keep calling 'flow.run()'

Co-authored-by: Aniket Maurya <theaniketmaurya@gmail.com>

* Add aiohttp to base requirements

* Update docs

Co-authored-by: Luca Antiga <luca.antiga@gmail.com>

* Use deserializer utility

* fake trigger

* wip: protect /system/* with basic auth

* read password at runtime

* Change env var name

* import torch as optional

* Don't overcreate works

* simplify imports

* Update example

* aiohttp

* Add work_args work_kwargs

* More docs

* remove FIXME

* Apply Jirka's suggestions

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* clean example device

* add comment on init threshold value

* bad merge

* nit: logging format

* {in,out}put_schema -> {in,out}put_type

* lowercase

* docs on seconds

* process_time -> processing_time

* Dont modify work state from flow

* Update tests

* worker_url -> endpoint

* fix exampl

* Fix default scale logic

* Fix default scale logic

* Fix num_pending_works

* Update num_pending_works

* Fix bug creating too many works

* Remove up/downscale_threshold args

* Update example

* Add typing

* Fix example in docstring

* Fix default scale logic

* Update src/lightning_app/components/auto_scaler.py

Co-authored-by: Noha Alon <nohalon@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* rename method

* rename locvar

* Add todo

* docs ci

* docs ci

* asdfafsdasdf pls docs

* Apply suggestions from code review

Co-authored-by: Ethan Harris <ethanwharris@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* .

* doc

* Update src/lightning_app/components/auto_scaler.py

Co-authored-by: Noha Alon <nohalon@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Revert "[pre-commit.ci] auto fixes from pre-commit.com hooks"

This reverts commit 24983a0a5a.

* Revert "Update src/lightning_app/components/auto_scaler.py"

This reverts commit 56ea78b45f.

* Remove redefinition

* Remove load balancer run blocker

* raise RuntimeError

* remove has_sent

* lower the default timeout_batching from 10 to 1

* remove debug

* update the default timeout_batching

* .

* tighten condition

* fix endpoint

* typo in runtimeerror cond

* async lock update severs

* add a test

* {in,out}put_type typing

* Update examples/app_server_with_auto_scaler/app.py

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>

* Update .actions/setup_tools.py

Co-authored-by: Aniket Maurya <theaniketmaurya@gmail.com>
Co-authored-by: Luca Antiga <luca.antiga@gmail.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Noha Alon <nohalon@gmail.com>
Co-authored-by: Ethan Harris <ethanwharris@gmail.com>
Co-authored-by: Akihiro Nitta <aki@pop-os.localdomain>
Co-authored-by: thomas chaton <thomas@grid.ai>
2022-12-07 14:27:44 +01:00
Jerome Anand 6aaac8b910
Enable back inference mode support with hpu & update links (#15918)
* Enable back inference mode support with hpu
* Remove unused
* Update document link and address comment

Signed-off-by: Jerome <janand@habana.ai>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-12-07 14:21:22 +01:00
Adrian Wälchli 2debd1c6b6
Simplify enabling CPU offload in FSDP (#15832)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2022-12-07 02:55:47 +00:00
Adrian Wälchli 05dbf48ad0
Activation checkpointing in FSDP without boilerplate (#15826)
* initial
* input type
* checkpointing
* fsdp in pl
* all_close

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-12-06 15:45:33 +00:00
Raphael Randschau a82be2ffad
[CLI] drop name column from cluster list (#15721)
* drop name column from cluster list

* change create cluster to accept id as well

* rename validator

* remove cluster name from logs

* fix merge with master

* more merge with master issues
2022-12-02 19:27:24 -08:00
Carlos Mocholí 1bc3a97e95
Remove checks for torch greater than 1.10 (#15846) 2022-12-01 21:41:22 +01:00
Luca Furst 33e1f9345d
[App] Improve cluster creation / deletion experience (#15458)
Cluster creation and deletion can take a long time. Instead of having these long running operations happen in the background, they should happen in the foreground. The advantage is that failures are brought to the users attention immediately, instead of the next time they decide to run `lightning list clusters`.

While the CLI waits for the cluster to run / delete, it will display cluster status changes to the user.

This PR also hides the `--enable-performance` and `--edit-before-creation` creation flags, as well as the `--force` deletion flag. They are either not frequently used (performance mode is expensive), or prone to misuse.


Co-authored-by: Neven Miculinic <neven.miculinic@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Raphael Randschau <nicolai86@users.noreply.github.com>
2022-11-28 16:38:43 +00:00
Mauricio Villegas 3fad651f99
Deprecate description and env parameters in LightningCLI.__init__ (#15651)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2022-11-27 10:52:01 +00:00
Laverne Henderson 70126df494
Added note about custom base images (#14125)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2022-11-26 21:58:38 +00:00
Rick Izzo 317591d2e2
Fix App Docs for lightning ssh-keys command (#15773)
fixed ssh-keys docs
2022-11-22 15:39:20 -05:00
Kaushik B 2b61c92ceb
Fix the `examples/app_dag` App (#14359)
* Fix app dag example
* Add test
* Update doc
* Update tests/tests_app_examples/test_app_dag.py

Co-authored-by: Sherin Thomas <sherin@grid.ai>
2022-11-22 09:39:44 +01:00
Justus Schock 006fde9747
FCCV Docs (#15598)
* add custom data iter docs

* add custom data iter docs

* Update docs/source-pytorch/data/custom_data_iterables.rst

* remove ToDevice

* nit

* Update docs/source-pytorch/data/custom_data_iterables.rst

Co-authored-by: Luca Antiga <luca.antiga@gmail.com>

* clarification for @lantiga

* typo

* Update docs/source-pytorch/data/custom_data_iterables.rst

* Update docs/source-pytorch/data/custom_data_iterables.rst

* Update docs/source-pytorch/data/custom_data_iterables.rst

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Luca Antiga <luca.antiga@gmail.com>
2022-11-21 15:30:58 -05:00
Ethan Harris bc797fd376
[App] Fix multi-node pytorch example CI (#15753) 2022-11-21 16:02:30 +00:00
Yurij Mikhalevich 1a31d13521
feature(docs/app/lit_tabs): add works (#15731) 2022-11-21 13:46:01 +01:00
Ethan Harris 83067977af
[App] Update multi-node examples (#15700)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-11-21 10:41:59 +00:00
Yiftach Beer f40eb2cb1c
Fix broken link to CLI docs (#15723) 2022-11-19 14:17:11 +01:00
yiftachbeer d925077253
Fix typo in script name (#15724) 2022-11-19 12:26:35 +01:00
dymil 609b258fe8
Fix typo 'wether' (#15710) 2022-11-18 00:09:58 +01:00
yiftachbeer b40ca0ece5
remove unused random_split import from tutorial (#15716) 2022-11-18 00:09:46 +01:00
Yurij Mikhalevich befd3f6c22
fix(docs/app/lit_tabs): remove unused app_id, enable run instead (#15702) 2022-11-17 10:13:09 +01:00
William Falcon 41f1a36c5b
docs 5/n (#15669)
* examples

* fix few examples

* Update pl_multinode.py

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-11-17 08:10:14 +01:00
Nash 9670fa8b18
Docs: Fix import for scikit in XGBoost template (#15693) 2022-11-16 09:19:03 +01:00
Yurij Mikhalevich 4837df47bd
fix(docs/app): broken links in the intermediate/web-ui section (#15691) 2022-11-15 15:43:51 +01:00
Rick Izzo 23f88cd0a4
Fixed Import in Docs For Multinode Trainer Name Which does Not Exist (#15663) 2022-11-12 19:17:29 -05:00
William Falcon 4e8cf85b0c
Update run_ptl_script.py 2022-11-11 13:37:42 -05:00
Adrian Wälchli 8c9b688f2b
Update docs with new Lightning Lite usage 1/n (#15600) 2022-11-11 16:43:25 +00:00
Akihiro Nitta 164e5a7b3e
Remove the docs for passing strategy args to accelerator (#15636) 2022-11-11 05:46:10 -05:00