Kaushik B
d773407e59
feat: Add ModelSummary Callback ( #9344 )
...
Co-authored-by: Ethan Harris <ewah1g13@soton.ac.uk>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-09-10 12:42:42 +00:00
Kaushik B
d028e36946
Add remove_checkpoint to CheckpointIO plugin to simplify ModelCheckpo… ( #9373 )
...
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-09-10 11:55:04 +01:00
Danielle Pintz
160e7e1289
Deprecate LightningModule.get_progress_bar_dict ( #8985 )
...
* Move get_progress_bar_dict from lightning module to progress bar callback
2021-09-09 20:53:47 +00:00
Adrian Wälchli
91ce0d0a99
Remove checkpoint tracking from internal debugger ( #9326 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2021-09-08 00:42:31 +00:00
Jirka Borovec
6e124e7207
CI: precommit - docformatter ( #8584 )
...
* CI: precommit - docformatter
* fix deprecated
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-09-06 12:49:09 +00:00
Adrian Wälchli
50198d7483
fix progress bar restart with fault-tolerant training enabled ( #9310 )
...
* reset progress updates
* update docs
* add test
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-09-06 10:43:59 +02:00
Adrian Wälchli
f9132e8db6
remove early stopping tracking from internal debugger ( #9327 )
...
* replace dev debugger in early stopping
* remove unused imports
2021-09-06 10:43:03 +02:00
Danielle Pintz
912fd31131
Deprecate on_keyboard_interrupt callback hook ( #9260 )
...
* add on_exception callback hook
* deprecate on_keyboard_interrupt
* Apply suggestions from code review
* raise keyboard interrupt
* Delete cluster
* update changelog
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2021-09-06 09:57:00 +02:00
Carlos Mocholí
d5ee8d8e3f
Disable `{save,check}_on_train_epoch_end` with `check_val_every_n_epoch>1` ( #9156 )
2021-09-03 14:27:44 +00:00
Danielle Pintz
b046bd0670
Add on_exception callback hook ( #9183 )
2021-09-01 10:49:00 +02:00
B. Kerim Tshimanga
07ee8fc9a0
Remove deprecated property `ModelCheckpoint.period` in favor of `ModelCheckpoint.every_n_epochs` ( #9213 )
2021-08-31 10:04:29 +02:00
Adrian Wälchli
0abd6e94b5
[3 / 3] improvements to saving and loading callback state ( #7161 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2021-08-26 10:02:49 +02:00
Adrian Wälchli
b9443a07b9
[2 / 3] improvements to saving and loading callback state ( #7187 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2021-08-24 17:35:19 +00:00
Kaushik B
538e743f17
feat: Add Rich Progress Bar ( #8929 )
...
Co-authored-by: thomas chaton <thomas@grid.ai>
2021-08-24 02:40:36 +00:00
Carlos Mocholí
b1a859f312
Remove deprecated `on_{save,load}_checkpoint` signature ( #8697 )
...
Co-authored-by: Yifu Wang <yifuwang2012@gmail.com>
2021-08-21 22:48:28 -07:00
Michele Sanna
9ff0c22e43
Handle the case with no queries in `GPUStatsMonitor` ( #9014 )
...
Co-authored-by: Michele Sanna <{ID}+{username}@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2021-08-21 05:22:33 +02:00
Adrian Wälchli
5329b0d113
Fix "line too long" PEP8 complaint ( #8957 )
2021-08-18 03:31:36 +00:00
Danielle Pintz
bd13d392af
Add error handling for all trainer entry points ( #8819 )
...
* [lightning] Ensure error handling works different trainer entry points
2021-08-18 02:04:40 +00:00
Carlos Mocholí
bfeffde8f4
Smart handling of `EarlyStopping.check_on_train_epoch_end` ( #8888 )
...
* Smart handling of `EarlyStopping.check_on_train_epoch_end`
* dummy value
* Extra flag
2021-08-14 08:50:39 +02:00
Carlos Mocholí
7d87879350
Fix SWA with a list of learning rates ( #8747 )
...
* Fix swa lrs - needs test
* Add test
* Update CHANGELOG
2021-08-14 08:50:08 +02:00
christopherfish
0749c1e7d8
Remove call to deprecated fit_loop ( #8873 )
2021-08-13 10:06:36 +02:00
Stefan Wijnja
c77cd518b5
Fix on_train_batch_end signature and call in ProgressBarBase example ( #8836 )
2021-08-12 12:24:12 +00:00
Carlos Mocholí
4928dc5579
Improve SWA docs ( #8717 )
2021-08-05 16:07:50 +00:00
Carlos Mocholí
299e289980
Remove deprecated `on_save_checkpoint` argument ( #8688 )
2021-08-05 16:16:30 +01:00
Sean Naren
e5d9e21dea
Fix save/load/resume from checkpoint for DeepSpeed Plugin ( #8397 )
2021-08-02 22:31:05 +00:00
Carlos Mocholí
ca96b2d23e
Delete deprecated save function ( #8680 )
2021-08-02 19:28:31 +02:00
Carlos Mocholí
93784da2c3
Fix pre-commit blacken-docs failures ( #8624 )
2021-07-30 12:10:15 +00:00
Carlos Mocholí
0dc0472e1f
Use class name in SWA info message ( #8602 )
2021-07-29 09:39:46 +02:00
Adrian Wälchli
8c27fa71fa
[1 / 3] improvements to saving and loading callback state ( #6886 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-07-29 00:12:32 +02:00
Carlos Mocholí
47c47faeae
Remove `outputs` in `on_train_epoch_end` hooks ( #8587 )
2021-07-28 18:27:54 +02:00
Jirka Borovec
0a71fe2859
CI: black docs ( #8566 )
...
* black docs
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-07-28 18:08:31 +02:00
Carlos Mocholí
e63968ab88
Add `pyupgrade` to `pre-commit` ( #8557 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-07-26 14:38:12 +02:00
Carlos Mocholí
a64cc37394
Replace `yapf` with `black` ( #7783 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-07-26 13:37:35 +02:00
Elad Segal
07635d0e86
fix restoring finetune callbacks after accelerator setup on training resume ( #8501 )
...
Co-authored-by: tchaton <thomas@grid.ai>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-07-23 19:49:32 +02:00
Carlos Mocholí
f7027a8701
Remove `torch >= 1.6` checks ( #8523 )
2021-07-23 04:03:20 +00:00
Jirka Borovec
b7dbcc3e13
Quant as optional step ( #8464 )
...
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2021-07-22 12:44:27 +00:00
thomas chaton
063f5ba73e
[bugfix] Re-compute accumulated_grad_batches ( #8493 )
...
* resolve resolution
* update changelog
* typo
* optimize test
* update on comments
* resolve comments
* update
2021-07-21 10:46:25 +00:00
thomas chaton
ea13f6021c
[bugfix] Prevent deepcopy of dataloaders / Trainer in SWA Callback ( #8472 )
...
* resolve deepcopy
* update changelog
* move private
* update on comments
* Update CHANGELOG
* Set skipped attributes to None
* Simplify test
* update
* update changelog
* update
* update on comments
* typo
* update
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2021-07-20 18:31:49 +00:00
deepsource-autofix[bot]
4bc3d70ad9
Remove unnecessary generator ( #8470 )
...
Co-authored-by: deepsource-autofix[bot] <62050782+deepsource-autofix[bot]@users.noreply.github.com>
2021-07-19 14:02:07 +00:00
Xuehai Pan
2c5d94d98b
Fix: handle logical CUDA device IDs for GPUStatsMonitor if `CUDA_VISIBLE_DEVICES` set ( #8260 )
...
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2021-07-19 11:42:43 +00:00
Carlos Mocholí
710df398c9
Remove `check_checkpoint_callback` ( #7724 )
...
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-07-19 11:29:00 +00:00
deepsource-autofix[bot]
cbf71d0a14
Remove unnecessary comprehension ( #8405 )
2021-07-19 08:30:24 +00:00
Adrian Wälchli
8c5042e1a8
fix internal call to deprecated train_loop ( #8434 )
2021-07-16 02:24:18 +02:00
Carlos Mocholí
6ce77a102b
Set minimum PyTorch version to 1.6 ( #8288 )
...
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2021-07-13 17:12:49 +00:00
Carlos Mocholí
321689f52e
Add `ModelCheckpoint(save_on_train_epoch_end)` ( #8389 )
...
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-07-13 14:47:59 +00:00
Carlos Mocholí
733cdbb9ad
`every_n_val_epochs` -> `every_n_epochs` ( #8383 )
2021-07-13 01:20:20 +02:00
Dusan Drevicky
1b06edf2f2
Add the `on_before_optimizer_step` hook ( #8048 )
...
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2021-07-09 13:30:52 +02:00
thomas chaton
1c825a2a9c
Add the `on_before_backward` hook ( #7865 )
...
* Add callback to hook tests and add predict test
* Fix lambda callback test
* Simplify lambda call test
* Use LambdaCallback
* Dynamically append to called for the model
* Remove print
* Consistency
* Consistency
* Prepare args/kwargs testing
* yapf doesn't like dict literals
* Add arguments for fit no val test
* Add arguments for fit no val test
* add before_backward_hook
* add test
* resolve flake8
* resolve tests
* update changelog
* add on_before_backward to LightningModule
* update on comments
* Test arguments
* Datamodule refactor
* Fix eval test
* remove extra file
* resolve bug
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* move to hooks
* update
* resolve flake8
* update on comments
* Update full fit + val test
* Update test
* Remove FIXME
* Remove FIXME
* Undo change
* Fix
* Parametrize fit hook test
* Comment
* Parametrize fit hook test with different precision plugins
* Fix tests
* Parametrize fit hook test with manual optimization
* Unnecessary parenthesis
* WIP
* Comments
* Fix message
* Test CI error
* Revert "Test CI error"
This reverts commit 39c4a85a83
.
* Add ddp training type teardown
* Update CHANGELOG
* Adrian's fix
* Use destructor
* Update CHANGELOG.md
* RPC destructor
* Update pytorch_lightning/plugins/training_type/ddp.py
* Why do you not work :(
* Missing condition
* Fix deepspeed test
* GC collect in conftest
* Do not show warnings for special tests
* Needs to run on 1.8
To avoid: "RuntimeError: NCCL error in: /pytorch/torch/lib/c10d/ProcessGroupNCCL.cpp:32, unhandled cuda error, NCCL version 2.4.8"
* Run torch 1.8
* Skip test due to 'Python bus error'
* Debug NCCL
* shm size
* Disable warnings for special tests
* Remove NCCL_DEBUG statement
* Try smaller shm size
* Revert "Skip test due to 'Python bus error'"
This reverts commit e0a3e8785d
.
* README and adjust versions
* Avoid self.on_gpu call
* empty cache cleanup
* More garbage collection
* Unroll parametrizations
* Do not reuse mock
* Undo changes
* Undo notebooks modification
* resolve test
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* delete file
* Undo
* Fix test
* Revert "WIP"
This reverts commit f5828a8c42
.
* Rename
* Remove optimizers
* Fix bug with LightningOptimizer
* Add optimizers
* update
* update
* Update CHANGELOG
* On after backward refactor
* Do not call super
* Fixes
* Remove should_accumulate
* pre/post backward refactor
* Call the LM backward hook
* Update tests
* Remove dev debug patch
* Fix test
* Remove optimizer arguments and typing
* Docs fixes
* Fix comment
* Undo changes
* Split manual and auto
* Undo change
* Deepsource
* Remove optimizers
* Undo changes
* Call the hook
* Docs
* Docs
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-07-09 06:15:57 +00:00
Jaime Ferrando Huertas
9bbca402ff
Add auto_insert_metric_name to ModelCheckpoint docstring. ( #8310 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2021-07-07 23:15:21 +00:00
Carlos Mocholí
07d7c37a79
Remove magic monitor support for `ModelCheckpoint` ( #8293 )
2021-07-07 18:36:19 +01:00