Commit Graph

3449 Commits

Author SHA1 Message Date
Carlos Mocholí efc8727314
Add typing for `LightningOptimizer` (#9990) 2021-10-18 17:29:51 +02:00
thomas chaton 86df7dcee7
Add KFold Loop example (#9965) 2021-10-18 16:27:12 +01:00
Adrian Wälchli a99b7440b5
Add unit tests for `pl.utilities.grads` (#9765)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2021-10-18 18:58:51 +05:30
Rohit Gupta 4dc32ad7db
Fix logic to check for spawn in worker_check (#9902)
* fix

* update tests

* chlog

* skip windows
2021-10-18 13:02:46 +00:00
Adrian Wälchli 10d0b41977
Introduce `PrecisionPlugin.forward_context()` (#9988)
Co-authored-by: thomas chaton <thomas@grid.ai>
2021-10-18 12:58:19 +00:00
Carlos Mocholí 3f355d0eb7
Remove manual tracking of optimizer steps (#9957) 2021-10-18 12:43:06 +00:00
Carlos Mocholí e0470cc244
Update `resume_from_checkpoint` docs (#9952) 2021-10-18 17:40:47 +05:30
Carlos Mocholí c69a79c86f
Fix `self.log(on_epoch=True)` on_batch_start (#9780) 2021-10-18 14:02:16 +02:00
Elad Segal 8c76cf5ae1
reset val dataloader for binsearch (#9975) 2021-10-18 12:54:26 +02:00
Adrian Wälchli 7a9151637c
loop customization docs (#9609)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>
2021-10-18 09:43:11 +00:00
Carlos Mocholí 01b304ec57
Update accelerator connector messages after the addition of strategy (#9937) 2021-10-18 01:10:48 +00:00
Carlos Mocholí 788f6864d9
Fix `LightningOptimizer` step and toggling logic (#9958) 2021-10-18 00:23:51 +00:00
ronif 7b4df7bf91
Fix issue with no-init dataclass fields in move_to_device (#9963)
Co-authored-by: ronif <ronif@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-10-17 07:10:47 +00:00
Carlos Mocholí e5dfdf34f9
Avoid deprecation warning after #9901 (#9951) 2021-10-16 17:36:25 +01:00
Carlos Mocholí db4e770004
Validate the precision input earlier (#9763) 2021-10-15 17:30:00 +00:00
kingyiusuen 6429de8944
Add support for `len(datamodule)` (#9895)
Co-authored-by: tchaton <thomas@grid.ai>
2021-10-15 14:19:50 +02:00
Danielle Pintz 16213b1635
Deprecate `log_gpu_memory`, `gpu_metrics`, and util funcs in favor of `DeviceStatsMonitor` callback (#9921)
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-10-14 22:45:44 +02:00
Oliver Borchert afbf703684
Single-process multi-node CPU training (#9603)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2021-10-14 22:21:41 +02:00
Danielle Pintz 6feda08109
Deprecate `GPUStatsMonitor` and `XLAStatsMonitor` in favor of `DeviceStatsMonitor` (#9924)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Nicki Skafte Detlefsen <skaftenicki@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-10-14 15:52:45 +00:00
four4fish a002f872ea
[2/n] Directly call TrainingTypePlugin APIs instead of going through the Accelerator (#9901)
Co-authored-by: tchaton <thomas@grid.ai>
2021-10-14 17:38:22 +02:00
Viraj Bagal 15698698c4
Log LR using LearningRateMonitor even when LR Scheduler is not defined. (#9786)
* LR logging works even with no lr scheduler, wrote few extra tests as well

* updated changelog

* modified code as suggested by DeepSource

* added helper functions

* opt with no scheduler

* rename

* chlog

* update test

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
2021-10-14 13:28:19 +00:00
Danielle Pintz 940b910d27
[2/4] Add DeviceStatsMonitor callback (#9712)
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Kaushik B <kaushikbokka@gmail.com>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
2021-10-13 18:29:36 +00:00
Rohit Gupta 23e8b59ae7
Add `configure_gradient_clipping` hook in `LightningModule` (#9584)
* init hook

* docs

* dep train args

* update tests

* doc

* doc

* .gitignore

* not dep

* add trainer args

* add & update tests

* fix tests

* pre-commit

* docs

* add docs

* add exception

* code review

* deepspeed

* update tests

* not

* try fix

* Apply suggestions from code review

* update deepspeed

* disable some tests

* disable some tests

* enable all tests
2021-10-13 20:15:13 +05:30
Kaushik B 05b15e63f0
Add `strategy` argument to Trainer (#8597)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-10-13 12:34:06 +00:00
ananthsub 28fc8d2016
Add `enable_model_summary` flag and deprecate `weights_summary` (#9699)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Kaushik B <kaushikbokka@gmail.com>
2021-10-13 17:20:54 +05:30
Kaushik B b1e215d036
Remove `should_rank_save_checkpoint` property from Trainer (#9433) 2021-10-13 11:36:24 +00:00
Rohit Gupta 0f8fd20443
Remove epoch from `trainer.logged_metrics` (#9904) 2021-10-13 11:30:27 +02:00
ananthsub 4610fddb19
Mark `Trainer.terminate_on_nan` protected and deprecate public property (#9849)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-10-12 20:23:22 +00:00
Danielle Pintz dd6d797e0e
Remove type error handling in _configure_checkpoint_callbacks (#9823)
* remove type error handling in _configure_checkpoint_callbacks

* rm test

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-10-12 20:13:02 +00:00
Chris Chow f14a47a0b2
guard against None in pytorch get_xla_supported_devices (#9572)
Co-authored-by: Chris Chow <cchow@nianticlabs.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2021-10-12 12:01:32 +00:00
Rohit Gupta 98c0a110e0
Update docs for `GradientAccumulationScheduler` (#9891)
* update docs and add tests

* update docs and add tests

* Update pytorch_lightning/callbacks/gradient_accumulation_scheduler.py

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

* Apply suggestions from code review

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-10-12 10:37:16 +00:00
Rohit Gupta 7eff00317d
update docs (#9903) 2021-10-12 15:37:08 +05:30
Rohit Gupta f2b0db60f1
Raise a `MisconfigurationException` when trainer functions are called with `ckpt_path="best"` but `checkpoint_callback` isn't configured (#9841)
* add check

* chlog

* Apply suggestions from code review

Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>

* Apply suggestions from code review

Co-authored-by: thomas chaton <thomas@grid.ai>

Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2021-10-12 15:35:55 +05:30
Adrian Wälchli 64d1c46623
Update error message for interactive incompatible plugins (#9896)
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
2021-10-12 15:10:49 +05:30
Sean Naren 6da5829e53
DeepSpeed support for device IDs (#9847) 2021-10-12 09:24:46 +00:00
ananthsub f16bfe9bdd
Mark `trainer.config_validator` as protected (#9779) 2021-10-12 09:29:05 +01:00
Rohit Gupta db322f4bbb
Deprecate `checkpoint_callback` from the `Trainer` constructor in favour of `enable_checkpointing` (#9754)
* enable_chekpointing

* update codebase

* chlog

* update tests

* fix warning

* Apply suggestions from code review

Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Apply suggestions from code review

Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>

* Apply suggestions from code review

Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-10-12 07:55:07 +00:00
Gili Tzabari 4afe53791b
Clarify lr scheduler frequency (#9843) 2021-10-12 01:44:07 +00:00
Adrian Wälchli f9d2612102
fix qconfig import for pytorch 1.10 (#9899) 2021-10-11 22:30:34 +00:00
Kaushik B c3aa6e9818
Prepare v1.5.0rc0 (#9893) 2021-10-11 20:36:01 +01:00
yopknopixx 173f4c8466
Deprecate `terminate_on_nan` Trainer argument in favor of `detect_anomaly` (#9175)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-10-11 17:17:43 +00:00
Adrian Wälchli 6a0c47a014
remove redundant accumulation normalization in manual optimization (#9769) 2021-10-11 15:26:12 +00:00
Ranuga-Disansa f915a8a283
Removed a redundant warning with `ModelCheckpoint(monitor=None)` callback (#9875)
* Update README.md

* Update README.md

* Create evaluation.py

* Update README.md

* Update evaluation.py

* Create evaluation.py

* Create evaluation.py

* Update evaluation.py

* Create nlp.py

* Update evaluation.py

* Create evaluation.py

* Update nlp.py

* Update nlp.py

* Update evaluation.py

* Create evaluation.py

* Update nlp.py

* Update nlp.py

* Update requirements.txt

* Update evaluation.py

* Create data_loader.py

* Update nlp.py

* Update evaluation.py

* Update data_loader.py

* Update nlp.py

* Update data_loader.py

* Update requirements.txt

* Update model_checkpoint.py

* Delete evaluation.py

* Delete data_loader.py

* Delete nlp.py

* Update requirements.txt

* Update model_checkpoint.py

* Update README.md

* Update pytorch_lightning/callbacks/model_checkpoint.py

* Update CHANGELOG.md

* Update test_model_checkpoint.py

* Update model_checkpoint.py

* update

* update

* chlog update

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-10-11 14:54:07 +00:00
Rohit Gupta 54d4b4b21d
use existing logic to configure optimizers in lr_finder (#9789)
* use predefined logic

* patch init_optimizers

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-10-11 13:05:27 +00:00
theory-in-progress 4ecb0d8bc9
Updated quantization imports in PyTorch 1.10 (#9878)
Co-authored-by: tchaton <thomas@grid.ai>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-10-11 11:23:21 +00:00
Rohit Gupta 46fa703853
disable_logger (#9837) 2021-10-11 16:36:59 +05:30
Boris Dayma 2db9ea3500
feat(wandb): support media logging (#9545) 2021-10-11 10:15:36 +01:00
Rohit Gupta ce8233e6f0
use public format checkpoint method (#9818)
* use public method

* document

* Apply suggestions from code review
2021-10-11 09:23:47 +01:00
Rohit Gupta d71501d97f
Reset `val_dataloader` in `tuner/batch_size_scaling` (#9857)
* reset val

* chlog
2021-10-11 09:13:33 +01:00
kingyiusuen 8740c801bb
Fix typo in _validate_scheduler_optimizer() (#9886) 2021-10-11 09:16:17 +02:00