Commit Graph

57 Commits

Author SHA1 Message Date
awaelchli e461e90f84
Update the Multi-GPU docs (#19525) 2024-02-26 22:29:26 -05:00
Carlos Mocholí a1dd9efcf7
Drop XLA XRT support (#19232)
* Drop XLA XRT support
* update test
* set launched
* update conftest
* xla available check
---------

Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-01-10 18:39:20 +01:00
Adrian Wälchli 90043798e4
Clarify `self.log(..., rank_zero_only=True|False)` (#19056)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2023-11-23 13:02:21 -05:00
Adrian Wälchli f6a36cf220
Consistent imports in docs for core APIs (#18869)
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2023-10-27 14:41:29 -04:00
Jirka Borovec 45237c7df9
docs: include external pages 2/2 (#17890) 2023-10-03 19:02:47 +00:00
Jirka Borovec b4b21e0152
docs: 2/3 enable Sphinx nitpicky [pytorch] part 1/n (#18484)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-09-20 19:09:34 +02:00
Adrian Wälchli 04c0675bd0
Update titles in FSDP docs (#18389)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2023-08-25 10:38:01 -04:00
Adrian Wälchli 36a203ba58
Revamp model parallel docs (2/n) (#18325) 2023-08-16 13:22:14 -04:00
Adrian Wälchli 7fe8756917
[TPU] Proper half-precision implementation for XLA (#18213)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-08-11 11:37:41 -04:00
Adrian Wälchli 6552d29a12
Document limitations of multi-GPU in Jupyter notebooks (#18132)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-24 15:22:16 -04:00
Jirka Borovec 00a0f2e8f4
docs: fetch external sources (#17941)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-03 20:16:45 +02:00
Jirka Borovec 6b0ec10ab0
docs: include external pages (#17826)
* pull docs

* local

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* ...

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* replace

* strategies

* 1.0.0

* skip

* links

* more

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-21 17:32:20 +02:00
Jirka Borovec db9f095b0b
Replace IPU with external implementation (#17075)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-04-27 16:09:51 +00:00
Carlos Mocholí 16377339cb
Remove reference to outdated Kaggle tutorial (#17390) 2023-04-17 19:12:44 +02:00
Carlos Mocholí 1aa23267ab
Various Fabric documentation updates (#17236) 2023-04-11 23:05:57 +00:00
Carlos Mocholí 0489f2efed
[TPU] v4 support (#17227) 2023-04-11 22:24:11 +00:00
Aniket Maurya 5d58736f08
Rename PL installation to `pip install lightning` (#17074)
* rename PL installation

* update

* Apply suggestions from code review

* Update docs/source-pytorch/starter/installation.rst

Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>

* Update docs/source-pytorch/starter/installation.rst

---------

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>
2023-03-14 14:47:56 -04:00
Carlos Mocholí f7b89cd233
Review APIs experimental status (#17012) 2023-03-09 16:56:49 +00:00
Aniket Maurya da572d7cca
Refactor PL examples to `examples/pytorch/` (#16925)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2023-03-07 11:24:46 -05:00
Jirka Borovec 52a39c03f8
docs: update `pytorch_lightning` imports (#16864)
* update docs imports

* ci

* fabric

* trigger

* links

* .

* docstring

* chlog

* cleaning
2023-02-27 15:14:23 -05:00
Aditya Kane df6e37da1c
Update torch_xla installation instructions in tpu_basic.rst (#16865) 2023-02-26 00:07:40 +01:00
Carlos Mocholí 0130273eb5
Trainer: auto default (#16847) 2023-02-23 18:42:17 +01:00
Adrian Wälchli 81b7c30291
Make DDP subprocess the default launcher for multi-device (#16780) 2023-02-20 11:20:50 +00:00
Adrian Wälchli 91e692c767
Rename the TPUSpawnStrategy to XLAStrategy (#16781)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-02-17 02:06:24 +00:00
Justus Schock 47c69cd8eb
Remove DP (#16748)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2023-02-16 05:12:08 +00:00
Adrian Wälchli c4074419b5
Remove the BaguaStrategy (#16746)
* remove bagua

* remove

* remove docker file entry
2023-02-14 08:58:58 -05:00
Carlos Mocholí bf51844917
Remove memory-retaining epoch-end hooks (#16520) 2023-02-06 17:00:36 +00:00
Adrian Wälchli acb7ee223c
Ignore generated package files (#16605)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2023-02-02 09:24:07 +00:00
Adrian Wälchli c68cfd686e
Rename LiteMultiNode to FabricMultiNode (#16505) 2023-01-26 11:36:27 +00:00
Carlos Mocholí 486b4d5d9d
Remove old platform docs (#16499)
* Remove old platform docs

* More

* More
2023-01-25 16:16:51 +01:00
Nikhil Shenoy 5a9b5e363d
Fixes in sphinx docs links (#16255)
Co-authored-by: Nikhil Shenoy <nikhilshenoy@dhcp-128-189-229-137.ubcsecure.wireless.ubc.ca>
Co-authored-by: Nikhil Shenoy <nikhilshenoy@dhcp-128-189-227-69.ubcsecure.wireless.ubc.ca>
Co-authored-by: Nikhil Shenoy <nikhilshenoy@dhcp-128-189-228-93.ubcsecure.wireless.ubc.ca>
Fixes https://github.com/Lightning-AI/lightning/issues/8107
2023-01-10 13:19:16 +01:00
Adrian Wälchli 42e72564b7
Fix inclusion of `model_parallel` document (#16197)
fix link to gpu/advanced section
2022-12-26 14:08:55 +09:00
Adrian Wälchli ca88f813a4
Deprecate `auto_select_gpus` (#16147)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-12-22 17:44:07 +01:00
Carlos Mocholí bf8e568845
Deprecate the `HorovodStrategy` (#16141) 2022-12-20 18:38:28 +01:00
Carlos Mocholí 14f441c393
Deprecate nvidia/apex (#16039) 2022-12-20 11:09:39 +01:00
Jerome Anand 6aaac8b910
Enable back inference mode support with hpu & update links (#15918)
* Enable back inference mode support with hpu
* Remove unused
* Update document link and address comment

Signed-off-by: Jerome <janand@habana.ai>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-12-07 14:21:22 +01:00
Carlos Mocholí 6ba00af1e0
Drop PyTorch 1.9 support (#15347)
* Drop 1.9

* Everything else

* READMEs

* Missed some

* IPU skips

* Remove exception type

* Add back
2022-11-10 08:59:13 -05:00
Adrian Wälchli 0f9156374d
Mark internal Lite APIs as protected (#15307)
* mark internal lite apis as protected
* formatting
* docs update

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-10-26 12:51:50 +00:00
Jerome Anand 672b5cbefe
Update obsolete URL in HPU docs (#15112) 2022-10-13 13:27:16 +02:00
Amrutha dfc7886b24
docs: replacement of method type_as in docs to Tensor.to (#15027) 2022-10-08 10:04:15 +00:00
Carlos Mocholí 7ef87464dd
Refactor XLA and TPU checks across codebase (#14550) 2022-10-04 22:54:14 +00:00
Jerome Anand e62521caf1
Update hpu mixed precision link (#14974)
Signed-off-by: Jerome <janand@habana.ai>
2022-10-03 09:05:17 +02:00
Adrian Wälchli 4f3c47294e
Add troubleshooting section to MPS docs (#14642)
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2022-09-12 17:35:18 +00:00
Jerome Anand 424e78d5b3
Add document to showcase scaleout on hpu (#14357)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-08-25 14:15:01 +05:30
Rohit Gupta e949362a6b
Enable `on_before_batch_transfer` for `DPStrategy` and `IPUAccelerator` (#14023)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-08-18 12:12:29 +00:00
Carlos Mocholí b88b700745
Remove the deprecated DDP2 strategy (#14026) 2022-08-04 20:27:35 +00:00
Rohit Gupta e78bf2044b
Raise an error if batch transfer hooks are overridden with IPUAccelerator (#13961)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-08-04 12:04:42 +00:00
Jerome Anand b3203d93d0
Added support for HPU device stats monitor (#13819)
* Added support for HPU device stats monitor

Signed-off-by: Jerome <janand@habana.ai>

* Update changelog

Signed-off-by: Jerome <janand@habana.ai>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Apply suggestions from code review

Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>

* Update reference

Signed-off-by: Jerome <janand@habana.ai>

* Apply suggestions from code review

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

* fix alignment

* add descriptions

* Update hpu_intermediate.rst

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-02 13:31:31 +05:30
otaj 4c7b9f0b11
Disallow batch sampler with multiple IPU devices (#13854)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-27 15:20:43 +05:30
Adrian Wälchli d24978baa3
Add ddp_notebook alias for ddp_fork (#13744)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-23 09:06:35 -04:00