Commit Graph

7875 Commits

Author SHA1 Message Date
Kishan Savant 1e5411b143
Removed the deprecated datamodule_checkpointhooks (#14909)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
Co-authored-by: otaj <6065855+otaj@users.noreply.github.com>
2022-09-29 22:31:58 +00:00
Aliaksandr Kuzmik 4c43e57b6f
Comet.ml logger - add usage tracking (#14906)
Co-authored-by: Aliaksandr.Kuzmik <AliaksandrK@comet.ml>
2022-09-29 21:10:54 +00:00
Adrian Wälchli c8059d4464
Update quick start guide with latest info (#14880)
Co-authored-by: thomas chaton <thomas@grid.ai>
2022-09-29 20:54:20 +00:00
Suyash Sonawane 72ac4b592f
Fixed docstring for unwatch method (#14920) 2022-09-29 19:20:42 +00:00
Tianshu Wang 485ab5e0de
Fix wandb `save_dir` is not overridden by `None` `dir` when using CLI (#14878)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-09-29 19:20:07 +00:00
Prince Canuma 04aaf83901
Fix MissingFieldException in offline mode (#14919)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: otaj <6065855+otaj@users.noreply.github.com>
2022-09-29 18:47:51 +00:00
Adrian Wälchli 498cb60417
Fairscale integration tests for Lite (#14921)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-09-29 17:46:49 +00:00
Adrian Wälchli 822a7f50af
Align ddp and ddp-spawn strategies in setting up the environment (#11073)
Co-authored-by: Kushashwa Ravi Shrimali <kushashwaravishrimali@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-09-29 19:30:09 +02:00
Rohit Gupta 3a70e5dbcb
Call `LightningDataModule.load_state_dict` hook while restoring checkpoint using `LightningDataModule.load_from_checkpoint` (#14883) 2022-09-29 16:55:59 +00:00
Ethan Harris 93e802afc2
Simplify bug report template (#14925)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>
2022-09-29 16:49:45 +00:00
Adrian Wälchli d8e90f6581
Fairscale import updates (#14721)
* fairscale imports
* refactor to avoid meta package build issue

Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: thomas chaton <thomas@grid.ai>
2022-09-29 16:45:27 +00:00
Adrian Wälchli 5b446aec4d
DeepSpeed integration tests for Lite (#14901) 2022-09-29 16:39:32 +00:00
Kaushik B 0abdd80104
Prepare v1.8.0rc0 (#14918) 2022-09-29 18:00:25 +02:00
Carlos Mocholí 6e70f55f00
Clean up CODEOWNERS for PL and Lite (#14942)
* Clean up CODEOWNERS for PL and Lite

* Update
2022-09-29 10:17:05 -04:00
Carlos Mocholí b8cc4525bd
Skip CircleCI trigger for forks (#14930) 2022-09-29 10:16:37 -04:00
Carlos Mocholí 7893eb259a
Prepare CI to run on 3090s (#14910) 2022-09-29 14:01:59 +00:00
Carlos Mocholí 4c53eae0f4
Self-review of the recent Trainer changes (#14916) 2022-09-29 13:59:16 +00:00
Carlos Mocholí 4eb7766f3c
Make internal torchscript check a class attribute (#14904) 2022-09-29 13:40:25 +00:00
otaj 5f0c4aad12
Introduce `ckpt_path="hpc"` keyword for checkpoint loading (#14911)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-09-29 12:45:51 +00:00
Adrian Wälchli ff3c5b7b9d
Docs section for SLURM troubleshooting (#14873)
Co-authored-by: Laverne Henderson <laverne.henderson@coupa.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-09-29 12:41:31 +00:00
Adrian Wälchli a45c047b38
Remove deprecated LightningIPUModule (#14830)
* Remove deprecated LightningIPUModule
* chlog
* fix import
* Fix 1.10 depr test

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-09-29 13:07:45 +01:00
Masahiro Wada d377d0efde
Fix type hints of tuner/batch_size_scaling.py (#13518)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: otaj <ota@lightning.ai>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-09-29 12:00:42 +00:00
Jerome Anand 136d57312d
Upgrade HPU image to release 1.6.1 (#14932) 2022-09-29 11:22:27 +00:00
otaj b06f9b7468
Improve building times of IPU docker image (#14934) 2022-09-29 09:55:12 +00:00
otaj 7ff297bdff
Fix App tests (#14922) 2022-09-28 13:20:57 -04:00
Rohit Gupta 688a6da710
Fix pkg version issue while compiling docs (#14914)
* Revert "Add BatchSizeFinder callback (#11089)"

This reverts commit d1a3a3ebf5.

* Revert "Revert "Add BatchSizeFinder callback (#11089)""

This reverts commit 9cc4695925.

* remove pl

* add torch

* add numpy

* rm packages

* add packages

* add packages

* import from PL

* import from PL

* always install PL for doctests

* remove unnecessary requirements

* always install PL in editable mode

* once more

* another attempt

* maybe fix app test?

* Redundant checkgroup path

* Revert "maybe fix app test?"

This reverts commit 8210a43ef4.

* speed up install deps

* damn this

* damn trio

Co-authored-by: otaj <ota@lightning.ai>
Co-authored-by: otaj <6065855+otaj@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-09-28 12:18:07 -04:00
Adrian Wälchli ea5e817973
Better error message when trying to re-initialize CUDA in forked subprocess (#14709)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-09-28 05:07:33 -04:00
Carlos Mocholí 9fc4ff3278
Move logic to error out on deprecation warnings into conftest (#14902) 2022-09-27 17:49:25 +02:00
Rohit Gupta d1a3a3ebf5
Add BatchSizeFinder callback (#11089)
* add BatchSizeFinderCallback callback

* temp rm from init

* skip with lr_finder tests

* restore loops and intergrate early exit

* enable fast_dev_run test

* add docs and tests

* keep tune and remove early_exit

* add more tests

* patch lr finder

* disable skip

* force_save and fix test

* mypy and circular import fix

* fix mypy

* fix

* updates

* rebase

* address reviews

* add more exceptions for unsupported functionalities

* move exception to setup

* chlog

* unit test

* address reviews

* Apply suggestions from code review

* update

* update

* mypy

* fix

* use it as a util func

* license

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* mypy

* mypy

* review

* fix

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix

* updates

* updates

* fix import

* Protect callback attrs

* don't reset val dataloader

* update test

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: otaj <6065855+otaj@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-09-27 08:54:37 -04:00
Adrian Wälchli d572a7e2ec
Fix double precision support in Lite (#14827) 2022-09-27 08:38:20 +00:00
Mansy b0bd949d58
Enable quick-start-app-e2e (#14542)
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-09-27 08:16:46 +01:00
Raphael Randschau 6edf5c5cc2
fix connect/ disconnect without arguments (#14877) 2022-09-27 03:43:36 +00:00
Kishan Savant 9fd951b45c
Remove the deprecated `trainer.*_ckpt_path` (#14897) 2022-09-27 02:00:33 +02:00
Adrian Wälchli d7404c775a
Integration tests for Precision in Lite (#14815)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2022-09-26 18:50:11 +00:00
Kishan Savant 8c01e82e26
Remove deprecated trainer_optimizer_mixin (#14887)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-09-26 15:32:02 +00:00
Kishan Savant dce5644101
Removed deprecated trainer_lightning_optimizers (#14889) 2022-09-26 14:08:27 +00:00
Andres Algaba 4fc8275cc3
Remove the deprecated `trainer.call_hook` (#14869) 2022-09-26 15:56:44 +02:00
Kishan Savant 35419b5bf9
Removed the deprecated trainer_data_loading_mixin (#14888) 2022-09-26 10:58:19 +00:00
Adrian Wälchli 2ef1f0ab96
Update root Makefile to run all test projects (#14881) 2022-09-26 10:27:46 +00:00
Kishan Savant e7ad151c31
Remove the deprecated `trainer.verbose_evaluate` (#14884)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-09-24 19:20:31 +00:00
Andres Algaba a6489347ca
Remove the deprecated run_stage (#14870) 2022-09-24 17:22:09 +00:00
Kishan Savant 5ec12f849e
Remove deprecated trainer.should_rank_save_checkpoint (#14885)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-09-24 18:40:11 +02:00
Krishna Kalyan d38633592d
Remove the deprecated `profile_iterable` (#14864)
* remove profile_iterable
* remove imports
* remove depricated api
* update changelog

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-09-24 12:01:18 +00:00
jsr-p abb6049fa3
Update documentation for the basic skills tutorial level 2 on how to validate and test a model (#14874) 2022-09-24 10:34:06 +00:00
dconathan 633d14e67a
fixed comet -> mlflow typo in visualize/experiment_managers docs (#14843)
fixed comet -> mlflow typo

Co-authored-by: Devin Conathan <devin.conathan@libertymutual.com>
2022-09-24 00:13:28 +02:00
Jirka Borovec b1b50450ac
Cleaning requirement + git fix (#14863)
* drop duplicate docs requirements
* skip empty dir
* mypy for #14861
2022-09-23 16:44:28 +01:00
William Falcon 047d7088f4
Make Trainer readable and debuggable (3/n) (#14871)
* clean trainer 3/n

* clean trainer 3/n

* clean trainer 3/n

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* clean trainer 3/n

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-09-23 11:10:42 -04:00
William Falcon fbfcc3d871
Make Trainer readable and debuggable (2/n) (#14862)
* clean trainer 2/n

* clean trainer 2/n

* clean trainer 2/n

* clean trainer 2/n

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-09-23 08:05:29 -04:00
William Falcon c77d4a8394
Make Trainer Debuggable and understandable again (1/n) (#14861)
* clean trainer 1/n

* clean trainer 1/n

* clean trainer 1/n

* clean trainer 1/n

* clean trainer 1/n

* clean trainer 1/n

* clean trainer 1/n

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* clean trainer 1/n

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-09-23 01:15:59 -04:00
Jirka Borovec a325839141
update changelog after App 0.6.2 (#14853) 2022-09-22 23:08:53 +05:30