Carlos Mocholí
efc8727314
Add typing for `LightningOptimizer` ( #9990 )
2021-10-18 17:29:51 +02:00
thomas chaton
86df7dcee7
Add KFold Loop example ( #9965 )
2021-10-18 16:27:12 +01:00
Adrian Wälchli
a99b7440b5
Add unit tests for `pl.utilities.grads` ( #9765 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2021-10-18 18:58:51 +05:30
Rohit Gupta
4dc32ad7db
Fix logic to check for spawn in worker_check ( #9902 )
...
* fix
* update tests
* chlog
* skip windows
2021-10-18 13:02:46 +00:00
Adrian Wälchli
10d0b41977
Introduce `PrecisionPlugin.forward_context()` ( #9988 )
...
Co-authored-by: thomas chaton <thomas@grid.ai>
2021-10-18 12:58:19 +00:00
Carlos Mocholí
3f355d0eb7
Remove manual tracking of optimizer steps ( #9957 )
2021-10-18 12:43:06 +00:00
Carlos Mocholí
e0470cc244
Update `resume_from_checkpoint` docs ( #9952 )
2021-10-18 17:40:47 +05:30
Carlos Mocholí
c69a79c86f
Fix `self.log(on_epoch=True)` on_batch_start ( #9780 )
2021-10-18 14:02:16 +02:00
Elad Segal
8c76cf5ae1
reset val dataloader for binsearch ( #9975 )
2021-10-18 12:54:26 +02:00
Adrian Wälchli
7a9151637c
loop customization docs ( #9609 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>
2021-10-18 09:43:11 +00:00
Carlos Mocholí
01b304ec57
Update accelerator connector messages after the addition of strategy ( #9937 )
2021-10-18 01:10:48 +00:00
Carlos Mocholí
788f6864d9
Fix `LightningOptimizer` step and toggling logic ( #9958 )
2021-10-18 00:23:51 +00:00
ronif
7b4df7bf91
Fix issue with no-init dataclass fields in move_to_device ( #9963 )
...
Co-authored-by: ronif <ronif@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-10-17 07:10:47 +00:00
Carlos Mocholí
e5dfdf34f9
Avoid deprecation warning after #9901 ( #9951 )
2021-10-16 17:36:25 +01:00
Carlos Mocholí
db4e770004
Validate the precision input earlier ( #9763 )
2021-10-15 17:30:00 +00:00
kingyiusuen
6429de8944
Add support for `len(datamodule)` ( #9895 )
...
Co-authored-by: tchaton <thomas@grid.ai>
2021-10-15 14:19:50 +02:00
Danielle Pintz
16213b1635
Deprecate `log_gpu_memory`, `gpu_metrics`, and util funcs in favor of `DeviceStatsMonitor` callback ( #9921 )
...
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-10-14 22:45:44 +02:00
Oliver Borchert
afbf703684
Single-process multi-node CPU training ( #9603 )
...
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2021-10-14 22:21:41 +02:00
Danielle Pintz
6feda08109
Deprecate `GPUStatsMonitor` and `XLAStatsMonitor` in favor of `DeviceStatsMonitor` ( #9924 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Nicki Skafte Detlefsen <skaftenicki@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-10-14 15:52:45 +00:00
four4fish
a002f872ea
[2/n] Directly call TrainingTypePlugin APIs instead of going through the Accelerator ( #9901 )
...
Co-authored-by: tchaton <thomas@grid.ai>
2021-10-14 17:38:22 +02:00
Viraj Bagal
15698698c4
Log LR using LearningRateMonitor even when LR Scheduler is not defined. ( #9786 )
...
* LR logging works even with no lr scheduler, wrote few extra tests as well
* updated changelog
* modified code as suggested by DeepSource
* added helper functions
* opt with no scheduler
* rename
* chlog
* update test
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
2021-10-14 13:28:19 +00:00
Danielle Pintz
940b910d27
[2/4] Add DeviceStatsMonitor callback ( #9712 )
...
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Kaushik B <kaushikbokka@gmail.com>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
2021-10-13 18:29:36 +00:00
Rohit Gupta
23e8b59ae7
Add `configure_gradient_clipping` hook in `LightningModule` ( #9584 )
...
* init hook
* docs
* dep train args
* update tests
* doc
* doc
* .gitignore
* not dep
* add trainer args
* add & update tests
* fix tests
* pre-commit
* docs
* add docs
* add exception
* code review
* deepspeed
* update tests
* not
* try fix
* Apply suggestions from code review
* update deepspeed
* disable some tests
* disable some tests
* enable all tests
2021-10-13 20:15:13 +05:30
Kaushik B
05b15e63f0
Add `strategy` argument to Trainer ( #8597 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-10-13 12:34:06 +00:00
ananthsub
28fc8d2016
Add `enable_model_summary` flag and deprecate `weights_summary` ( #9699 )
...
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Kaushik B <kaushikbokka@gmail.com>
2021-10-13 17:20:54 +05:30
Kaushik B
b1e215d036
Remove `should_rank_save_checkpoint` property from Trainer ( #9433 )
2021-10-13 11:36:24 +00:00
Rohit Gupta
0f8fd20443
Remove epoch from `trainer.logged_metrics` ( #9904 )
2021-10-13 11:30:27 +02:00
ananthsub
4610fddb19
Mark `Trainer.terminate_on_nan` protected and deprecate public property ( #9849 )
...
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-10-12 20:23:22 +00:00
Danielle Pintz
dd6d797e0e
Remove type error handling in _configure_checkpoint_callbacks ( #9823 )
...
* remove type error handling in _configure_checkpoint_callbacks
* rm test
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-10-12 20:13:02 +00:00
Chris Chow
f14a47a0b2
guard against None in pytorch get_xla_supported_devices ( #9572 )
...
Co-authored-by: Chris Chow <cchow@nianticlabs.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2021-10-12 12:01:32 +00:00
Rohit Gupta
98c0a110e0
Update docs for `GradientAccumulationScheduler` ( #9891 )
...
* update docs and add tests
* update docs and add tests
* Update pytorch_lightning/callbacks/gradient_accumulation_scheduler.py
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
* Apply suggestions from code review
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-10-12 10:37:16 +00:00
Rohit Gupta
7eff00317d
update docs ( #9903 )
2021-10-12 15:37:08 +05:30
Rohit Gupta
f2b0db60f1
Raise a `MisconfigurationException` when trainer functions are called with `ckpt_path="best"` but `checkpoint_callback` isn't configured ( #9841 )
...
* add check
* chlog
* Apply suggestions from code review
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
* Apply suggestions from code review
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2021-10-12 15:35:55 +05:30
Adrian Wälchli
64d1c46623
Update error message for interactive incompatible plugins ( #9896 )
...
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
2021-10-12 15:10:49 +05:30
Sean Naren
6da5829e53
DeepSpeed support for device IDs ( #9847 )
2021-10-12 09:24:46 +00:00
ananthsub
f16bfe9bdd
Mark `trainer.config_validator` as protected ( #9779 )
2021-10-12 09:29:05 +01:00
Rohit Gupta
db322f4bbb
Deprecate `checkpoint_callback` from the `Trainer` constructor in favour of `enable_checkpointing` ( #9754 )
...
* enable_chekpointing
* update codebase
* chlog
* update tests
* fix warning
* Apply suggestions from code review
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Apply suggestions from code review
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
* Apply suggestions from code review
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-10-12 07:55:07 +00:00
Gili Tzabari
4afe53791b
Clarify lr scheduler frequency ( #9843 )
2021-10-12 01:44:07 +00:00
Adrian Wälchli
f9d2612102
fix qconfig import for pytorch 1.10 ( #9899 )
2021-10-11 22:30:34 +00:00
Kaushik B
c3aa6e9818
Prepare v1.5.0rc0 ( #9893 )
2021-10-11 20:36:01 +01:00
yopknopixx
173f4c8466
Deprecate `terminate_on_nan` Trainer argument in favor of `detect_anomaly` ( #9175 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-10-11 17:17:43 +00:00
Adrian Wälchli
6a0c47a014
remove redundant accumulation normalization in manual optimization ( #9769 )
2021-10-11 15:26:12 +00:00
Ranuga-Disansa
f915a8a283
Removed a redundant warning with `ModelCheckpoint(monitor=None)` callback ( #9875 )
...
* Update README.md
* Update README.md
* Create evaluation.py
* Update README.md
* Update evaluation.py
* Create evaluation.py
* Create evaluation.py
* Update evaluation.py
* Create nlp.py
* Update evaluation.py
* Create evaluation.py
* Update nlp.py
* Update nlp.py
* Update evaluation.py
* Create evaluation.py
* Update nlp.py
* Update nlp.py
* Update requirements.txt
* Update evaluation.py
* Create data_loader.py
* Update nlp.py
* Update evaluation.py
* Update data_loader.py
* Update nlp.py
* Update data_loader.py
* Update requirements.txt
* Update model_checkpoint.py
* Delete evaluation.py
* Delete data_loader.py
* Delete nlp.py
* Update requirements.txt
* Update model_checkpoint.py
* Update README.md
* Update pytorch_lightning/callbacks/model_checkpoint.py
* Update CHANGELOG.md
* Update test_model_checkpoint.py
* Update model_checkpoint.py
* update
* update
* chlog update
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-10-11 14:54:07 +00:00
Rohit Gupta
54d4b4b21d
use existing logic to configure optimizers in lr_finder ( #9789 )
...
* use predefined logic
* patch init_optimizers
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-10-11 13:05:27 +00:00
theory-in-progress
4ecb0d8bc9
Updated quantization imports in PyTorch 1.10 ( #9878 )
...
Co-authored-by: tchaton <thomas@grid.ai>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2021-10-11 11:23:21 +00:00
Rohit Gupta
46fa703853
disable_logger ( #9837 )
2021-10-11 16:36:59 +05:30
Boris Dayma
2db9ea3500
feat(wandb): support media logging ( #9545 )
2021-10-11 10:15:36 +01:00
Rohit Gupta
ce8233e6f0
use public format checkpoint method ( #9818 )
...
* use public method
* document
* Apply suggestions from code review
2021-10-11 09:23:47 +01:00
Rohit Gupta
d71501d97f
Reset `val_dataloader` in `tuner/batch_size_scaling` ( #9857 )
...
* reset val
* chlog
2021-10-11 09:13:33 +01:00
kingyiusuen
8740c801bb
Fix typo in _validate_scheduler_optimizer() ( #9886 )
2021-10-11 09:16:17 +02:00