Commit Graph

76 Commits

Author SHA1 Message Date
Rohit Gupta d1a3a3ebf5
Add BatchSizeFinder callback (#11089)
* add BatchSizeFinderCallback callback

* temp rm from init

* skip with lr_finder tests

* restore loops and intergrate early exit

* enable fast_dev_run test

* add docs and tests

* keep tune and remove early_exit

* add more tests

* patch lr finder

* disable skip

* force_save and fix test

* mypy and circular import fix

* fix mypy

* fix

* updates

* rebase

* address reviews

* add more exceptions for unsupported functionalities

* move exception to setup

* chlog

* unit test

* address reviews

* Apply suggestions from code review

* update

* update

* mypy

* fix

* use it as a util func

* license

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* mypy

* mypy

* review

* fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix

* updates

* updates

* fix import

* Protect callback attrs

* don't reset val dataloader

* update test

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: otaj <6065855+otaj@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-09-27 08:54:37 -04:00
Andres Algaba 4fc8275cc3
Remove the deprecated `trainer.call_hook` (#14869) 2022-09-26 15:56:44 +02:00
jsr-p abb6049fa3
Update documentation for the basic skills tutorial level 2 on how to validate and test a model (#14874) 2022-09-24 10:34:06 +00:00
dconathan 633d14e67a
fixed comet -> mlflow typo in visualize/experiment_managers docs (#14843)
fixed comet -> mlflow typo

Co-authored-by: Devin Conathan <devin.conathan@libertymutual.com>
2022-09-24 00:13:28 +02:00
Adrian Wälchli dd2a1c5d29
Integrate Lite Precision into PL (#14798)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-09-22 14:51:04 +00:00
Mauricio Villegas 3064c28ce1
Added args parameter to LightningCLI to ease running from within Python (#14596)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-09-19 17:38:30 +00:00
Gilad a5b0f8bd5c
Fix TQDMProgressBar usage in logging.rst (#14768) 2022-09-19 01:07:19 +02:00
Akihiro Nitta 3c5e03e035
docs: Clarify versioning and API stability (#14549)
* mv releases to a standalone page

* Include release_policy in index

* Update policy

* mv releases to a standalone page

* Include release_policy in index

* Update policy

* Update title

* remove release_policy.rst

* Update versioning

* syntax

* simplify wording

* Include examples that don't follow X+2 rule

* syntax

* update

* consistency

* rm noninformative statement

* .

* Reduce redundancy in the deprecation process

* grammar?

* consistency

* Update docs/source-pytorch/versioning.rst

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>

Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-09-15 09:16:14 -04:00
Benjamin Krala c2378bd3b1
Add documentation for trainer.datamodule and dataloaders of a trainer object (#14600)
* Update trainer.rst
* Update datamodule.rst
2022-09-14 15:46:34 +00:00
Carlos Mocholí 313c338da1
Remove legacy examples from logging docs (#14686) 2022-09-13 13:47:51 -04:00
Max Ehrlich e5998e6bf2
Make the SLURM Preemption/Timeout Signal Configurable (#14626)
* Add parameter to change the preemption signal
* Make the signal connector use the custom signal from SLURMEnvironment

Signed-off-by: Max Ehrlich <max.ehr@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-09-12 19:24:35 +00:00
Adrian Wälchli 925edbca07
Remove the deprecated `weights_save_path` Trainer argument (#14424)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-09-12 19:02:38 +00:00
Adrian Wälchli 4f3c47294e
Add troubleshooting section to MPS docs (#14642)
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2022-09-12 17:35:18 +00:00
Vasilis Vryniotis 7e9e441843
Use TorchVision's Multi-weight Support and Model Registration API on Lightning (#14567)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-09-09 20:04:57 +00:00
Adrian Wälchli d2459df2ff
Standalone Lite: Remaining Utilities (#14492)
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Laverne Henderson <laverne.henderson@coupa.com>
Co-authored-by: Felonious-Spellfire <felonious.spellfire@gmail.com>
2022-09-07 15:25:23 +00:00
Dan Nissenbaum 7ada52e33c
Typo in major heading seen by newcomers (#14501)
* Typo in major heading seen by newcomers

Correct typo in one of the first major headings newcomers to Lightning see when they are considering migrating their code to use Lightning.

I know this is a trivial change in terms of the text change itself, but I really think it's valuable for one of the most important landing pages that users first investigating Lightning see - to have rock-solid, professional text without obvious typos. Here was a typo in the main heading itself. I suggest fixing it straightaway via this PR.

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-09-07 09:06:30 +00:00
Carlos Mocholí 44216fdd69
Integrate `lightning_utilities.core.imports` (#14475)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-09-06 12:56:20 +00:00
Carlos Mocholí 8a4a3b6766
Mark the lite `DeviceDtypeModuleMixin` as protected (#14548) 2022-09-06 14:17:15 +02:00
Rohit Gupta 8c6119fbce
Add auto wrapping support for `DDPFullyShardedStrategy` (#14383) 2022-09-05 19:07:26 +00:00
Laverne Henderson b19b15546b
Updated basic debugging (#14488)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-09-02 14:23:29 +05:30
Benjamin Krala cce55b6cd3
Precise description of reload_dataloaders_every_n_epochs (#14245) 2022-09-01 10:06:28 -04:00
Adrian Wälchli 28e18881a9
Mark stage argument in hooks as required (#14064)
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
2022-09-01 15:47:40 +02:00
Carlos Mocholí 10adcd5d12
Remove the unused pyDeprecate dependency (#14472)
* Remove the unused pyDeprecate dependency
* CHANGELOG
2022-09-01 03:30:18 +09:00
Rohit Gupta f3574176e2
Change `trainer.should_stop` to not stop in between an epoch and run until `min_steps/min_epochs` only (#13890) 2022-08-27 12:12:24 +00:00
Adrian Wälchli 250c06e406
Remove deprecated HPC model hooks (#14315)
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-08-26 20:59:32 +00:00
Tianshu Wang 8950613552
save checkpoints and profiler output to the first logger (#14325)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-08-26 17:23:54 +00:00
Rohit Gupta 6d00f31f0c
Add auto wrapping for `DDPFullyShardedNativeStrategy` (#14252) 2022-08-26 09:01:48 +00:00
Jerome Anand 424e78d5b3
Add document to showcase scaleout on hpu (#14357)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-08-25 14:15:01 +05:30
Laverne Henderson 8ff2e01025
Update for M1 Mac installations (#14350)
* Update for M1 Mac installations

* Apply suggestions from code review

* Update PL installation

* Update based on feedback

Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-08-23 13:31:18 -04:00
Carlos Mocholí 7a617ec90e
Add back support for logging in the gradient clipping hooks (#14298)
* Add back support for logging in the gradient clipping hooks

* Docs and CHANGELOG

* Fix tests
2022-08-22 09:19:53 -04:00
Rohit Gupta e949362a6b
Enable `on_before_batch_transfer` for `DPStrategy` and `IPUAccelerator` (#14023)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-08-18 12:12:29 +00:00
Rohit Gupta 48c23e5716
Use fsdp module to initialize precision scalar for fsdp native (#14092)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Laverne Henderson <laverne.henderson@coupa.com>
2022-08-13 07:52:06 +00:00
Rohit Gupta 208512e6e9
Add docs for `fsdp_native` (#14108) 2022-08-12 09:14:07 +00:00
Dan Dale 355fda3702
Add Promoted CLI to API Reference Section (#14072) 2022-08-08 10:16:53 +02:00
Rohit Gupta a4e4cab7a6
Deprecate `amp_level` from `Trainer` (#13898)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-08-05 08:31:19 +00:00
Carlos Mocholí b88b700745
Remove the deprecated DDP2 strategy (#14026) 2022-08-04 20:27:35 +00:00
Rohit Gupta e78bf2044b
Raise an error if batch transfer hooks are overridden with IPUAccelerator (#13961)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-08-04 12:04:42 +00:00
Jerome Anand b3203d93d0
Added support for HPU device stats monitor (#13819)
* Added support for HPU device stats monitor

Signed-off-by: Jerome <janand@habana.ai>

* Update changelog

Signed-off-by: Jerome <janand@habana.ai>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Apply suggestions from code review

Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>

* Update reference

Signed-off-by: Jerome <janand@habana.ai>

* Apply suggestions from code review

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

* fix alignment

* add descriptions

* Update hpu_intermediate.rst

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-02 13:31:31 +05:30
Adrian Wälchli 7708ce22b2
Update GitHub links to PL repo (#13849)
* update lightning links in docs

* update links in chlog

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Apply suggestions from code review

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

* Update src/pytorch_lightning/README.md

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

* Update src/pytorch_lightning/README.md

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

* update

* painful

* badges

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update badges

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-28 22:08:07 +02:00
otaj 4c7b9f0b11
Disallow batch sampler with multiple IPU devices (#13854)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-27 15:20:43 +05:30
Rohit Gupta faf7ff57c0
Add support for async checkpointing (#13658) 2022-07-26 21:13:19 +05:30
Adrian Wälchli a8d7b4476c
Fix PyTorch spelling errors (#13774)
* Fix PyTorch spelling errors

* more
2022-07-25 12:51:16 -04:00
Ha YongWook 11f8fa2ca6
Fix typos in Checkpointing doc (#13827)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2022-07-25 09:32:49 -04:00
rampartrange 8d14554383
[Typo] update introduction.rst (#13791) 2022-07-23 16:38:36 +00:00
Adrian Wälchli d24978baa3
Add ddp_notebook alias for ddp_fork (#13744)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-23 09:06:35 -04:00
Carlos Mocholí 4f53e7132f
Promote the CLI out of utilities (#13767) 2022-07-23 12:07:29 +00:00
Adrian Wälchli c3299d2c59
Add support for DDP fork (#13405)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-07-22 16:05:35 +00:00
Rohit Gupta e451fa28d0
Fix main progress bar counter when `val_check_interval=int` and `check_val_every_n_epoch=None` (#12832) 2022-07-20 20:33:00 +00:00
Nathaniel D'Amours cd206991b9
Fix typos initialize in docs (#13557)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-07-20 19:29:31 +00:00
Justus Schock c75457da99 Rename GPUAccelerator to CUDAAccelerator 2022-07-19 13:06:30 -04:00