Commit Graph

466 Commits

Author SHA1 Message Date
Thomas Viehmann 28d3afde85
update links (#20431) 2024-11-19 14:10:02 +01:00
Thibaut Durand b0aa504f80
docs: fix incorrect link in expert level up (#20408) 2024-11-12 22:43:49 +01:00
WuJian 6f86497f55
docs: fix pytorch version typo in upgrade/from_2_0 (#20333) 2024-10-21 16:11:22 +02:00
Jirka Borovec 0e1e14f815
docs: fix removed ref to `deepspeed.initialize` (#20353)
* docs: fix removed ref to `deepspeed.initialize`

* fix links
2024-10-21 06:47:30 -07:00
Nishant Dahal 66508ff4b7
docs: add note for `TQDMProgressBar` (#20198)
* Add documentation note for TQDMProgressBar

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-09-30 18:29:21 +02:00
Jirka Borovec 70f188b95c
docs: fix broken links to W&B (#20298) 2024-09-26 11:52:33 -04:00
Emi 48279a7961
Update favicon to match Lightning AI app consistency (#20290)
* Update favicon

* Update favicons - all sizes
2024-09-20 10:08:45 +02:00
awaelchli b3ee85d3a3
Prepare Lightning 2.4.0 release (#20154)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2024-08-05 17:32:32 -04:00
Jirka Borovec 3de60f4b9f
docs: fix typo in `linkcheck_ignore` (#20164) 2024-08-05 10:55:36 -04:00
awaelchli d4de8e20e9
Count number of modules in train/eval mode in ModelSummary (#20159) 2024-08-04 15:28:26 -04:00
Jonas Tingeborn e61eafa671
Add ability for TQDMProgressBar to retain prior epoch training bars (#19578)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2024-08-04 03:28:26 -04:00
Antonios P. Sarikas 854d166290
Docs: Add note about version counter in `ModelCheckpoint` (#20146) 2024-08-03 20:56:25 -04:00
awaelchli 1bc2aadd46
Update type check workflow to PyTorch 2.4 (#20155) 2024-08-03 15:49:42 +02:00
PL Ghost 12881e838a
docs: update ref to latest tutorials (#20132)
* update tutorials to `565f61a9`
* update CI
* skip link

---------

Co-authored-by: Borda <Borda@users.noreply.github.com>
Co-authored-by: jirka <jirka.borovec@seznam.cz>
2024-07-28 22:57:05 +02:00
awaelchli bf25167bbf
Add testing for PyTorch 2.4 (Trainer) (#20010) 2024-07-11 06:52:56 -04:00
awaelchli 50af052b31
Add reference to the `torch.compile` manual (#20066) 2024-07-09 12:18:35 -04:00
pre-commit-ci[bot] a40affb953
[pre-commit.ci] pre-commit suggestions (#20035)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2024-07-05 12:17:15 -04:00
awaelchli 330af381de
Remove the lightning app code (#20039)
* remove source, tests, docs, workflows

* update checkgroup

* update codeowners

* update workflows

* package setup

* config files

* update

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove entry point

* docs

* __main__

* remove store

* leftover store removals

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-07-05 13:56:29 +02:00
awaelchli b8a5236daa
Move checkpoint customization docs sections (#20006)
move docs sections
2024-07-04 09:39:26 +02:00
awaelchli 14493c0685
Drop PyTorch 2.0 from the test matrix (#20009) 2024-06-30 18:02:00 -04:00
PL Ghost fa5af16424
docs: Bump HPU ref `1.6.0` (#20026)
---------

Co-authored-by: jerome-habana <jerome-habana@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2024-06-28 14:47:45 +02:00
PL Ghost aa2da72ab9
docs: Bump HPU ref `1.5.0` (#19843)
* bumping HPU version -> (1.5.0)
* fix build warning
* the HPU also need some images
* Apply suggestions from code review

---------

Co-authored-by: jerome-habana <jerome-habana@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2024-06-28 14:28:01 +02:00
awaelchli 014cdd84ed
Update code owners file (#19922)
* update code owners

* update

* Update .github/CODEOWNERS

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>

---------

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2024-05-30 06:12:41 -04:00
awaelchli 98005bbed0
Add Studio badge to tensor parallel docs (#19913) 2024-05-28 09:04:55 -04:00
awaelchli c09356db1e
(10/10) Support 2D Parallelism - Port Fabric docs to PL (#19899) 2024-05-23 08:55:52 -04:00
awaelchli 987c2c4093
(7/n) Support 2D Parallelism - TP Fabric Docs (#19884)
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2024-05-22 06:20:40 -04:00
awaelchli 0c8a193d3c
(1/n) Support 2D Parallelism (#19846) 2024-05-07 17:02:58 -04:00
David de la Iglesia Castro 58ad56afec
Use `step` interval in `estimated_stepping_batches` docs example (#19774) 2024-04-15 10:16:17 -04:00
Jin Zehao 94167d6e65
docs: fix LightningCLI docs example for Trainer Callbacks (#19701) 2024-03-27 23:16:29 +01:00
Jirka Borovec b9bfd1e43b
docs: pin version in links to external docs for [2/3] (#19672)
* docs: pin version in links to external docs for fabric
* ci update
2024-03-25 10:45:27 +01:00
Felix Hirwa Nshuti fadd2fccdc
docs: added mathjax for rendering tex format as imgmath needs latex engine to run (#19675) 2024-03-20 09:09:22 -07:00
Jirka Borovec 51ad6bb8fe
docs: pin version in links to external docs [1/n] (#19584)
* docs: ignore link to private repo
* lightning-utilities >=0.11.0, <0.12.0

---------

Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2024-03-19 16:35:40 +01:00
Karthik Venkataramani 88869ad482
docs: fix `y` var for compute loss (#19465)
* Update train_model_basic.rst

Loss should be calculated by comparing `x_hat` to `y`, not `x`

* Update evaluation_basic.rst

Error in the documentation where training loss is calculated as `F.mse_loss(x_hat, x)`. Changed to `F.mse_loss(x_hat, y)`

* Update notebooks.rst

Doc fix

* Update README.md

Doc fix

* Update introduction.rst

Doc fix

* Update train.py

Doc fix

* Update README.md

Doc fix

* Revert "Update train.py"

This reverts commit 1934760056.

* Revert "Update introduction.rst"

This reverts commit 0cec613c07.

* Revert "Update README.md"

This reverts commit fda3fbf32d.

* Revert "Update notebooks.rst"

This reverts commit 183a2a9e33.

* Revert "Update evaluation_basic.rst"

This reverts commit 0eefe39aea.

* Revert "Update train_model_basic.rst"

This reverts commit b98e1d2414.

* remove the variable y when unused in docs

* Update README.md

---------

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2024-03-15 17:56:28 -04:00
invisprints 75553845c6
docs: fix missing arguments in get_metrics() (#19629) 2024-03-14 20:50:42 -04:00
awaelchli 97a95ed6cc
Update `overfit_batches` docs (#19622) 2024-03-13 17:47:55 -04:00
awaelchli 8549a932f7
Remove orphaned docs pages (#19555) 2024-03-12 21:19:05 -04:00
Jirka Borovec b871f7a826
docs: switch NGC link to Nemo (#19568) 2024-03-04 21:51:19 +01:00
awaelchli d9113b61cc
Add additional references in compile guides (#19550) 2024-03-04 08:00:50 -05:00
awaelchli 48c39ce24f
Compile guide for Trainer (#19531)
Co-authored-by: Luca Antiga <luca.antiga@gmail.com>
2024-02-28 09:15:33 -05:00
awaelchli abae4c903b
Update Lightning AI multi-node guide (Trainer) (#19530)
* update

* update

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* configure_model

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-02-28 08:35:53 -05:00
awaelchli e461e90f84
Update the Multi-GPU docs (#19525) 2024-02-26 22:29:26 -05:00
Jirka Borovec cf3553cdb5
docs: enable Sphinx linter & fixing (#19515)
* docs: enable Sphinx linter
* fixes
2024-02-26 16:20:33 +01:00
thomas chaton e43820a4be
migrate Data subpackage (#19523)
* update

* update

* update

* update

* Update checkgroup.yml

* More

* Add note

* Labeller should be kept as long as we have the stubs

* update

* update

* update

* Apply suggestions from code review

* init

* ci fix

* pin version range

* https://www.neptune.ai/

---------

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2024-02-26 08:25:00 -05:00
awaelchli 2a827f3f6f
Docs fixes (#19529) 2024-02-26 12:06:08 +01:00
Mauricio Villegas 623ec5824f
`load_from_checkpoint` support for LightningCLI when using dependency injection (#18105) 2024-02-23 10:55:07 +01:00
Jirka Borovec 5998dd12e8
docs: ignore mall behave link (#19488) 2024-02-16 17:48:51 +01:00
PL Ghost 61ba180e5f
docs: Bump HPU ref `1.4.0` (#19484)
Co-authored-by: jerome-habana <jerome-habana@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2024-02-16 16:28:16 +01:00
awaelchli 120c87f8f7
Include the training mode in the ModelSummary (#19468) 2024-02-15 15:13:35 -05:00
awaelchli e950bb4828
Remove the Graphcore IPU integration (#19405)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2024-02-12 16:16:02 -05:00
awaelchli 9c8cd4ce68
Update to 2.3.0dev (#19430)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2024-02-08 06:34:51 -05:00