Commit Graph

6219 Commits

Author SHA1 Message Date
jona-0 7aee00c679
[DeepSpeed] fix flag forwarding in DeepSpeedPlugin (#10899)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Sean Naren <sean@grid.ai>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-12-14 15:56:08 +00:00
Josafat-Mattias Burmeister d7b6e87aeb
Add missing `self` to setup hook example (#11041)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
2021-12-14 15:29:11 +00:00
Rohit Gupta 5b153e5072
Update data docs (#11042)
Co-authored-by: Aki Nitta <nitta@akihironitta.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-12-14 15:26:46 +00:00
Rohit Gupta 59ef66c06b
Fix support for `CombinedLoader` while checking for warning raised with eval dataloaders (#10994) 2021-12-14 20:43:23 +05:30
Adrian Wälchli 2faaf35b91
Convert warning message to debug-level info in spawn plugins (#10864)
Co-authored-by: four4fish <88516121+four4fish@users.noreply.github.com>
2021-12-13 21:09:19 +00:00
Rafał Jankowski ed84cef3af
Removed duplicated file extension when uploading model checkpoints with NeptuneLogger (#11015) 2021-12-11 00:03:12 +05:30
Rohit Gupta 5576fbc5f9
move optional section (#11011) 2021-12-10 09:47:44 +00:00
Adrian Wälchli a4083df586
2/n Simplify spawn plugins: Spawn immediately (#10896) 2021-12-09 18:56:24 +00:00
Danielle Pintz 3fcfd0214c
Remove `_call_accelerator_hook` Trainer method (#10999) 2021-12-09 02:27:13 +01:00
jjenniferdai 6e21dd3767
Deprecate `on_hpc_{save/load}` hooks (#10911)
* first commit

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update pr #

* test filterwarnings

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add a todo comment

* updates

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* `` Update pytorch_lightning/core/saving.py

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

* `` Update pytorch_lightning/core/saving.py

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* model --> LightningModule Update pytorch_lightning/core/saving.py

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

* model --> LightningModule Update pytorch_lightning/core/saving.py

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-12-08 14:56:15 -08:00
Adrian Wälchli 8b30981b10
Remove `start_{training,evaluating,predicting}` from `HorovodPlugin` (#10989) 2021-12-08 14:02:26 +00:00
Danielle Pintz 01f5f99919
Deprecate callback hooks `on_init_start` and `on_init_end` (#10940) 2021-12-08 07:42:19 +00:00
Danielle Pintz aeb0b5595f
Deprecate `call_hook` (#10979) 2021-12-08 00:52:47 +00:00
Rohit Gupta 6369e3b77f
Update Changelog after 1.5.5 release (#10977) 2021-12-07 12:35:20 -08:00
Rohit Gupta 5324f1c6ed
Prune EvalModelTemplate (1/n) (#10969)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-12-07 18:04:06 +00:00
Rohit Gupta c1b0bc8c25
remove EvalModelTemplate from tests (#10970) 2021-12-07 23:11:15 +05:30
Carlos Mocholí 99adc45af1
Follow-up changes to #10575 (#10957)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-12-07 15:27:52 +01:00
Matthias Fey 42b5417e9b
Sharing Datasets Across Process Boundaries (#10951) 2021-12-07 13:17:15 +00:00
Adrian Wälchli 46f718d2ba
Fix typing in `pl.plugins.environments` (#10943) 2021-12-07 02:14:02 +00:00
Adrian Wälchli 6bfc0bbc56
Remove `TrainingTypePlugin.post_dispatch` in favor of `teardown` (#10939)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2021-12-06 22:27:30 +00:00
four4fish 629ca09e09
fix TypeError cause failure in singal_connector teardown (#10961) 2021-12-06 21:48:31 +00:00
four4fish 63bb4ec77d
4/n Move Accelerator into strategy - remove X_step() from accelerator (#10890)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-12-06 20:16:54 +00:00
Adrian Wälchli 6c79b2e969
Change temporary spawn checkpoint name (#10934) 2021-12-06 16:08:55 +00:00
Adrian Wälchli 5f8504ee87
Add Trainer.state to spawn queue #10937 2021-12-06 14:50:50 +00:00
Adrian Wälchli 3e1f8aa312
Fix spawn plugins not deleting temp checkpoint (#10935) 2021-12-06 13:41:19 +00:00
Adrian Wälchli c81a296ee1
Move implementation of `LightningModule.add_to_queue/get_from_queue` (#10936) 2021-12-06 16:17:37 +05:30
Adrian Wälchli 9a4b51d17f
Fix typing in `pl.trainer.config_validator` (#10803) 2021-12-06 11:19:36 +01:00
four4fish 2fc64e9656
2/n Move Accelerator into strategy - remove dispatch functions from Accelerator (#10885)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-12-06 09:51:14 +00:00
Rajath Bharadwaj 7914e5c157
added UserWarnings if max_epochs not set in the Trainer class (#10700) 2021-12-06 09:44:25 +00:00
os1ma 99bb62ae64
Fix some missing code in step-by-step walk through (#10519)
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-12-06 07:46:33 +00:00
Kaushik B 6599ced17d
Don't import torch_xla.debug for torch-xla<1.8 (#10836) 2021-12-06 06:31:38 +00:00
Adrian Wälchli 3d59a2faff
triger ci only with pull request (#10932) 2021-12-05 06:55:47 +00:00
Danielle Pintz 6043179931
Re-design `call_hook` interface (#10575) 2021-12-04 16:39:55 -05:00
Carlos Mocholí a28b4cd0c0
Sort out the dataloader idx logic for evaluation (#10923)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-12-03 20:01:46 +00:00
Luca Moschella 7792b77932
Resolve: 'DummyExperiment' object does not support item assignment (#10917)
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-12-03 17:54:05 +00:00
four4fish 6fe3211573
Unroll dict input before call Accelerator X_steps (#10908)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-12-03 17:00:52 +00:00
Adrian Wälchli d92ab96f17
Simplify some ddp-spawn tests #10921 2021-12-03 17:37:40 +01:00
Rohit Gupta 8ba3b383c0
Fix filtration logic for eval results with multiple dataloaders (#10810)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2021-12-03 14:34:46 +00:00
four4fish e646ca1d59
Remove `setup_optimizers_in_pre_dispatch` logic (#10906) 2021-12-03 15:05:08 +01:00
Adrian Wälchli 553d429ecb
Fix `prepare_data` implementation in `BoringDataModule` (#10915) 2021-12-03 13:51:34 +00:00
Rohit Gupta b7331d80dc
Disable eval dataloaders replacement during overfitting (#10877)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-12-03 14:11:43 +01:00
Ethan Harris ff20af41fd
Fix mathjax in RTD build (#10889)
* Update docs.txt
* Update conf.py
* Update .readthedocs.yml
2021-12-02 11:46:57 +00:00
Adrian Wälchli c55bc433ce
Fix retrieval of batch indices when dataloader num_workers > 0 (#10870)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-12-02 10:36:10 +00:00
Adrian Wälchli 98cb7e8790
1/n Simplify spawn plugins: Simplify handling of multiprocessing queue (#10034)
Co-authored-by: thomas chaton <thomas@grid.ai>
2021-12-02 10:30:44 +00:00
Kaushik B 541b983b90
Fix deadlinks in docs (#10739)
Co-authored-by: Ethan Harris <ewah1g13@soton.ac.uk>
2021-12-02 09:05:32 +00:00
Rohit Gupta 5b9995da04
Fix schedule reset logic in pytorch profiler (#10837) 2021-12-02 14:22:49 +05:30
four4fish 9beeabbced
Removed unnecessary `_move_optimizer_state` method overrides (#10849)
* Update tpu tp share same logic with ttp

* run test

* Update tpu_spawn.py

* debug

* Add changelog

* Apply suggestions from code review

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>

* Update training_type_plugin.py

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update training_type_plugin.py

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-12-02 05:03:30 +00:00
four4fish 45dd8066e7
3/n Move Accelerator into strategy - remove model_sharded_context() (#10886)
* 3/n Move Accelerator into strategy - remove model_sharded_context()

* update ttp function

* update changelog

* update changelog

Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
2021-12-02 03:34:51 +00:00
four4fish 44cd412e91
Remove precision_plugin pre_dispatch() method (#10887)
* Remove precision_plugin pre_dispatch() method

* update changelog
2021-12-01 18:42:17 -08:00
Carlos Mocholí 26977043bf
Add separate CI job for slow tests (#10830) 2021-12-01 19:58:18 +00:00