Commit Graph

4112 Commits

Author SHA1 Message Date
ananthsub 5d040e7fc6
Add support for `dataloader_iter` to validate and test steps (#11546)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-02-24 18:01:35 +00:00
Jan Stratil c877d54c04
Fix passing _ddp_params_and_buffers_to_ignore (#11949) 2022-02-24 17:22:48 +00:00
Rohit Gupta d9938da8a4
add properties to check for trainer state in pytorch profier (#12063) 2022-02-24 17:03:16 +05:30
Carlos Mocholí 1fa0639bca
Set the last global step saved only when actually saving (#12057) 2022-02-24 09:43:50 +01:00
Kaushik B 49420548ce
Update warnings for available accelerators not being used (#11909)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-24 09:12:02 +01:00
Danielle Pintz a8ee5cacb7
Further clean up aggregation logic (#12053) 2022-02-23 21:42:51 +00:00
Dennis Bappert 1026ceb86d
Fixed a small issue in the documentation for FeatureExtractorFreezeUnfreeze (#12049) 2022-02-23 13:03:34 +00:00
Tianshu Wang 1ef34c5ffd
[CLI] Add config `-c` argument (#12039) 2022-02-23 13:49:01 +05:30
Ryan Soklaski 550d3a640d
Improve annotations for `_defaults_from_env_vars` decorator (#11888)
Co-authored-by: rsokl <ryan.soklaski@ll.mit.edu>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
2022-02-22 19:12:52 +00:00
Carlos Mocholí 2d043857ee
Add callback to manage fault-tolerance checkpoints (#11862)
Co-authored-by: Danielle Pintz <38207072+daniellepintz@users.noreply.github.com>
2022-02-22 13:55:29 +00:00
Adrian Wälchli d0f54609de
Fix `is_interactive_compatible` logic after AcceleratorConnector rewrite (#12008)
* fix is_interactive_compatible

* improve tests

* update message

* address review

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-02-22 19:20:54 +05:30
Carlos Mocholí 4c4b9d540f
Remove `Trainer._device_type` (#11992) 2022-02-22 13:02:13 +00:00
Rohit Gupta d541cf4c64
remove ddp procs collection from script launcher (#12029) 2022-02-22 12:39:48 +01:00
Rohit Gupta 5ea811b1d9
Avoid loading dataloaders if `limit_batches=0` (#11576) 2022-02-22 11:33:53 +00:00
Adrian Wälchli de1815f4ba
Remove `DDPSpawnStrategy.get_mp_spawn_kwargs` in favor of launchers (#11966) 2022-02-22 11:28:21 +00:00
Kaushik B 6ff38d4c8e
Update RichProgressBarTheme after detecting light theme on colab (#10993) 2022-02-22 05:32:27 +00:00
Kaushik B 11bd176d2f
Add XLA Profiler section to docs (#11436)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-02-22 05:21:41 +00:00
Danielle Pintz 60fe1520fd
Remove calls to profile `model_forward` (#12032)
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
2022-02-21 23:32:38 +00:00
Akash Kwatra e50653da08
Deprecate `agg_key_funcs`, `agg_default_func`, and `update_agg_funcs` from `LightningLoggerBase` (#11871)
Co-authored-by: Danielle Pintz <38207072+daniellepintz@users.noreply.github.com>
2022-02-21 22:35:59 +00:00
Carlos Mocholí ab2e326e01
Clarify `val_check_interval` description (#11951)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-21 21:20:34 +00:00
Sebastian Raschka 40308f76a1
add default args in Trainer methods doc (#11614) 2022-02-21 20:39:20 +00:00
Adrian Wälchli 57aae5912e
Refactor signature for launcher (#11967) 2022-02-21 21:11:50 +01:00
Danielle Pintz d724465ae6
Fix docstrings of on_fit_start/end #12016 2022-02-21 20:50:10 +01:00
Carlos Mocholí 3579a3083f
Describe the behavior with `limit_*_batches=1|1.0` (#11950) 2022-02-21 15:12:03 +01:00
Carlos Mocholí 08384c45ea
Support manual optimization step profiling without a trainer reference (#11883) 2022-02-21 13:43:12 +00:00
Carlos Mocholí 0771a553de
Remove `Trainer._strategy_type` (#11990) 2022-02-21 13:39:30 +00:00
four4fish e15a66412c
Add back deterministic support in accelerator_connector (#11999)
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2022-02-20 18:11:47 +00:00
Kushashwa Ravi Shrimali 0374fe65db
Support gradient accumulation using Horovod's `backward_passes_per_step` (#11911)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-19 02:54:04 +01:00
ananthsub cf64f34434
Refactor `Strategy._move_optimizer_states` as utility functions (#11758)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2022-02-18 08:36:07 +00:00
Akash Kwatra d61371922b
Deprecate and remove calls to agg_and_log_metrics (#11832)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-18 02:54:33 +00:00
Akash Kwatra 73e9ca343a
Remove `log_text`, and `log_image` from `LightningLoggerBase` API (#11857) 2022-02-17 18:09:09 -08:00
four4fish 6e14209185
Rewrite accelerator_connector (#11448) 2022-02-17 23:38:39 +00:00
Rohit Gupta 25b505508d
Add process launchers (#11643)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-17 21:16:51 +00:00
ananthsub 4dba492fb5
Update horovod.py (#11917) 2022-02-16 21:58:54 -08:00
Konstantin Dobler d902b59886
Fix environment variable order for global rank determination (#11406)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-17 02:09:05 +00:00
ananthsub 62ebd42ce0
Update ddp.py (#11929) 2022-02-16 17:29:07 -08:00
edward-io 87bd54aedf
fix typos (#11937) 2022-02-16 17:27:51 -08:00
Carlos Mocholí 4cc05b2cf9
Support optimizer step progress tracking with manual optimization (#11848) 2022-02-16 21:27:27 +00:00
Carlos Mocholí 963adc7857
Small cleanup when dataloader states are saved (#11843) 2022-02-16 20:57:21 +00:00
jjenniferdai d69b33f1f0
Introduce `Stateful` PrecisionPlugin (#11638)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
2022-02-14 15:56:09 +05:30
Akash Kwatra 914f685ed8
Update typing for CometLogger.experiment (#11836) 2022-02-14 09:16:11 +00:00
guyang3532 79c4e5de60
Refine the pytorch profiler (#11268) 2022-02-11 14:50:18 +01:00
Carlos Mocholí 6f22b3623c
Fix master merge conflict (#11858) 2022-02-10 21:49:45 +00:00
Carlos Mocholí 1515ef90ee
Revert saving the dataloader and result collection by default (#11842) 2022-02-10 20:43:35 +01:00
Carlos Mocholí 789fae828d
Fix `current_epoch` value on training end (#8578)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-10 17:55:59 +01:00
Danielle Pintz 0e68734ddd
Make default logger name "lightning_logs" (#11762)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-10 15:19:18 +05:30
ananthsub 8d23f6287a
Update module path for `LightningDeprecationWarning` in setup.cfg (#11793) 2022-02-10 08:59:32 +05:30
ananthsub 1b107c5892
Add `Accelerator.is_available()` interface requirement (#11797) 2022-02-09 15:11:27 -08:00
Adrian Wälchli c618e59689
Fix `save_hyperparameters` when no parameters need saving (#11827) 2022-02-09 23:10:14 +00:00
Akash Kwatra a2d8c4f6a6
Create loggers property for Trainer and LightningModule (#11683)
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Danielle Pintz <38207072+daniellepintz@users.noreply.github.com>
2022-02-09 23:39:41 +01:00
Boris Dayma 1e36cffbca
feat(wandb): support distributed modes (#11650)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-02-09 19:53:21 +01:00
Carlos Mocholí 8394770d4a
Move data fetcher ownership to the loops (#11621) 2022-02-09 20:04:24 +05:30
Biho-Kim 24de29974c
bug fix #10872 (#10965)
Co-authored-by: louie.kim <louie.kim@kakaocorp.comlouie.kim@kakaocorp.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-02-09 14:15:49 +00:00
Carlos Mocholí 8822117200
Return the output of the optimizer step (#11711)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-02-09 09:37:13 +00:00
Danielle Pintz 9e63281a4c
remove todos (#11804) 2022-02-09 08:30:27 +00:00
ananthsub 9d4de3a863
Faster callback configuration validator checks (#11785)
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
2022-02-09 08:24:14 +00:00
Rohit Gupta 182c18d319
Configure native deepspeed schedulers with interval='step' (#11788) 2022-02-09 08:20:50 +00:00
jjenniferdai 1203094a20
Introduce `Stateful` DataModule (#11637) 2022-02-07 21:13:24 +01:00
circlecrystal 43a89eb132
bug fix: restore_optimizers correctly handles non-mapping values in optimizer.state.values() (#11757)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-02-07 14:55:06 +00:00
Rohit Gupta 9ed44dee0d
Fix to avoid moving batch to device for DataParallel (#11780)
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
2022-02-07 14:26:18 +00:00
Rohit Gupta 581bf7f2f2
Deprecate `on_epoch_start/on_epoch_end` hook (#11578) 2022-02-07 14:15:27 +00:00
ananthsub bbf27ed09a
Use fsspec in checkpoint connector for fault-tolerant training (#11776) 2022-02-07 13:29:41 +01:00
ananthsub 0ba25d3cac
Update DDPStrategy to use optimizers property from within class (#11777) 2022-02-07 13:28:37 +01:00
Rohit Gupta 7ec1e66e17
reduce only loss with dp (#11594)
Co-authored-by: Aki Nitta <nitta@akihironitta.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-07 17:00:29 +05:30
Krishna Kalyan f509e40ae3
Deprecate `on_before_accelerator_backend_setup` callback hook (#11655)
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
2022-02-07 11:07:21 +00:00
ananthsub a64438c897
Centralize rank_zero_only utilities into their own module (#11747)
* Centralize rank_zero_only utilities into their own module

Fixes #11746

* PossibleUserWarning

* Update test_warnings.py

* update imports

* more imports

* Update CHANGELOG.md

* Update mlflow.py

* Update cli.py

* Update api_references.rst

* Update meta.py

* add deprecation tests

* debug standalone

* fix standalone tests

* Update CHANGELOG.md
2022-02-07 08:09:55 +00:00
Danielle Pintz 34c454c756
Small improvements to TB and CSV loggers (#11764)
* small improvements to TB and CSV loggers
* addr comments
* remove redundant lines and update tests

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Aki Nitta <nitta@akihironitta.com>
2022-02-07 14:59:39 +09:00
ananthsub 7900aabe62
Keep `is_global_zero` definitions in sync across strategy and trainer (#11761) 2022-02-07 01:33:32 +05:30
ananthsub dfda970572
Update TPU Spawn to use root_device instead of LightningModule's device (#11750) 2022-02-06 06:26:38 +00:00
Dan Dale 9d8faecdb2
Allow Horovod `teardown()` to complete gracefully if exception thrown in callback setup (#11752) 2022-02-05 11:13:21 -08:00
ananthsub 819a747031
Use `root_device` in XLAStatsMonitor callback (#11749) 2022-02-05 10:09:08 -08:00
ananthsub 7d9454a3e9
Use `root_device` in DeviceStatsMonitor callback (#11748)
* Use trainer.strategy.root_device in favor of LightningModule.device in DeviceStatsMonitor

Minor refactor to use the strategy's own `root_device` instead of the LightningModule's device property.

Attempts at manual model parallelization by extending this plugin will face difficulties with the assumption that the LightningModule has all of its parameters on the same device. 

For those use cases, it is critical to remove the assumption that the module has a device property (device in general goes against PyTorch module's design principles:
- https://github.com/pytorch/pytorch/issues/7460
- https://github.com/PyTorchLightning/pytorch-lightning/pull/1790#discussion_r423459412
2022-02-05 11:20:15 +01:00
ananthsub 241c97e6eb
Update HorovodStrategy to use optimizers property from within class (#11728) 2022-02-05 10:04:55 +01:00
Adrian Wälchli cc43d07db1
Remove legacy dead code in DDP script launch (#11678)
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-02-05 11:40:16 +05:30
Dan Dale 3bc2407239
Allow access to ckpt_path within context of fit() (#11696)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-05 05:23:16 +01:00
Carlos Mocholí 7da931d1ca
Support no pre-fetching (#11606) 2022-02-05 03:59:46 +00:00
Danielle Pintz c71a1d7ea2
Remove `self._log_dir` from `BaseProfiler` (#11740) 2022-02-05 04:45:48 +01:00
ananthsub 72db64d294
Use the strategy's `root_device` instead of the LightningModule's device property (#11734) 2022-02-05 04:33:25 +01:00
Andres Algaba 58324b5197
Improve the result printing at the end of evaluation (#11332)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-02-05 03:03:22 +01:00
NathanGodey 8a1b1eeef8
WandbLogger's log_image can use step argument (#11716)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-05 01:02:41 +00:00
wangraying 8c07d8bf90
Add `Trainer(strategy="bagua")` (#11146)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
Co-authored-by: Sean Naren <sean@grid.ai>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2022-02-04 17:02:09 +00:00
ananthsub 2eca957b29
Minor refactors to `init_dist_connection` (#11733) 2022-02-04 13:33:49 +01:00
Rohit Gupta 4d72110b51
Deprecate `on_batch_start/on_batch_end` callback hooks (#11577) 2022-02-03 19:51:56 +00:00
Rohit Gupta 400201712f
added warning for distributedsampler in case of evaluation (#11479) 2022-02-03 18:42:13 +00:00
Rohit Gupta 01abe72278
Fix to avoid val progress bar disappear after validate (#11700)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-03 13:35:38 +00:00
Rohit Gupta e9065e9d42
Fix rich with uneven refresh rate tracking (#11668)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-03 10:27:05 +00:00
Rohit Gupta 7948ed703d
Avoid enforcing `shuffle=False` for eval dataloaders (#11575) 2022-02-03 09:35:31 +00:00
Danielle Pintz 9ebd7df22a
Move progress bar disabling out of the Trainer (#11377)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-02-03 06:29:32 +00:00
Rohit Gupta 0cb64fb8ba
Fix mid-epoch warning call while resuming (#11556)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-02-03 05:42:31 +00:00
four4fish d43fd0d4d6
Lazy initialize Strategy.parallel_devices (#11572)
Co-authored-by: Aki Nitta <nitta@akihironitta.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-03 04:25:16 +00:00
Rohit Gupta eceefdc602
Fix rich progress bar render only on main pbar (#11690) 2022-02-03 04:18:07 +00:00
Krishna Kalyan 6291af5c19
Replace occurrences of `on_before_accelerator_backend_setup_called` with `setup` (#11568)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-02-03 04:14:33 +00:00
Peter Franek ed8a5dadce
Improving instructions in finetuning docstring (#10484) 2022-02-03 04:13:06 +00:00
Anton Schwaighofer f935319622
Allow a `CombinedLoader` as the training data in DDP (#11648)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-02-03 04:01:20 +00:00
Sebastian Raschka 0e17f16438
Clarify what the default values for log are based on hooks (#11611) 2022-02-03 03:55:42 +00:00
Jirka Borovec c5de105276
fix available modules (#11526) 2022-02-03 03:38:16 +00:00
Sebastian Raschka 9934569373
Fix typo in `TensorBoardLogger.log_metrics` error message (#11595) 2022-02-03 03:18:54 +00:00
Carlos Mocholí 3d3172d3da
[CLI] Support shorthand for loggers (#11533) 2022-02-03 02:58:14 +00:00
Bhadresh Savani 0ea48416cd
Removed subsection in `LightningDataModule` (#11675) 2022-02-03 02:53:43 +00:00
DuYicong515 0816a1997e
Add typing for utilities/memory.py (#11545)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-02-03 02:34:05 +00:00
Piyush Hirapara 72f0e5bfae
Deprecate `on_configure_sharded_model` callback hook for v1.6 (#11627)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Danielle Pintz <38207072+daniellepintz@users.noreply.github.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-03 02:29:26 +00:00
Krishna Kalyan 6586dd23b7
Mark `CheckpointConnector` as protected (#11550)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-03 02:26:08 +00:00
DuYicong515 06e2635c71
Refactor get_filesystem to use native fsspec API (#11708) 2022-02-03 01:55:24 +00:00
Akash Kwatra d5aa7717aa
Remove experiment property from abstract class (#11603)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-03 01:51:34 +00:00
Rohit Gupta ee049e123d
Fix rich progress bar metric render on epoch end (#11689)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-02-03 01:43:48 +00:00
jjenniferdai ec1379da2c
Rename `_SupportsStateDict` --> `_Stateful` Protocol (#11469) 2022-02-02 23:45:59 +01:00
Carlos Mocholí b8e360dafa
[CLI] Fix bug that forces overriding `configure_optimizers` (#11672) 2022-02-02 22:44:00 +00:00
Akash Kwatra 115a5d08e8
Decouple utilities from `LightningLoggerBase` (#11484)
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-02-02 23:29:01 +01:00
Aki Nitta fbc1f9f1d9
Rename `Strategy.lr_schedulers` to `Strategy.lr_scheduler_configs` (#11549) 2022-02-02 22:10:01 +00:00
Nithin Rao b8d2c65a37
Set the state before saving "last" or "none" checkpoints (#11481)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-02-02 23:07:05 +01:00
Carlos Mocholí d7944a13cd
Teardown all internal components on exception (#11620) 2022-02-02 21:10:19 +00:00
Rohit Gupta 3eee8f18cf
Sort simple profiler summary based on mean duration (#11671) 2022-02-02 20:44:42 +00:00
Rohit Gupta 76175217e4
Fix val_loop run on restart (#11552)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-02 20:19:34 +00:00
Carlos Mocholí a44881cd90
Changes in preparation to #8578 (#11562) 2022-02-02 19:57:08 +00:00
Carlos Mocholí 79a3ff690b
Add typing to data fetching (#11515) 2022-02-02 20:53:50 +01:00
Chunyang Wen fe34bf2a65
Remove useless pass and abc (#11522) 2022-01-24 08:19:57 +00:00
Chunyang Wen 350c88e621
Let Accelerator inherit from ABC to make sure abstractmethod takes effect (#11521) 2022-01-23 20:47:43 +01:00
Carlos Mocholí 623dc974f5
Construct the hook kwargs inside each loop (#11511) 2022-01-22 15:57:12 +00:00
Carlos Mocholí 5ad5ba54c0
Refactor fetching function (#11516) 2022-01-20 20:06:58 +01:00
Carlos Mocholí 075b8801c9
Fix checkpoint values when saving and resetting the tuner state (#11518) 2022-01-20 18:54:40 +00:00
Carlos Mocholí 7295457a7b
[CLI] Save only the configuration used (#11532) 2022-01-20 12:35:43 +00:00
Rafał Jankowski e78d658c8d
Remove access to `_short_id` in NeptuneLogger (#11517) 2022-01-20 12:07:42 +00:00
Maaz Karim 16a04b29eb
Mark SignalConnector as protected (#11513)
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
2022-01-20 08:39:59 +01:00
ananthsub 1bd6fc979e
Remove `Strategy.on_tpu` property (#11536) 2022-01-20 08:25:26 +01:00
ananthsub f41d1e5e5e
Remove `Strategy.on_gpu` (#11537) 2022-01-19 21:27:12 +00:00
Rohit Gupta f7f835fa0e
improve simple profiler output (#11414) 2022-01-18 19:58:34 +00:00
Carlos Mocholí 62818dbace
Use a dataclass as the scheduler config (#11443) 2022-01-18 20:23:32 +01:00
Carlos Mocholí 344ab1e0a5
Move the `lightning_optimizers` ownership to the `Strategy` (#11444) 2022-01-18 12:58:56 +01:00
Rohit Gupta 033dba1494
Disable attaching samplers when using `IterableDataset` (#11507) 2022-01-17 23:33:57 +01:00
Gautam R Gare ef4677ae7b
Change the default `prog_bar=False` to `True` in `LightningModule.log_grad_norm` (#11472)
* Reset on_step flag to True in log_grad_norm
* updated change log

Co-authored-by: Aki Nitta <nitta@akihironitta.com>
2022-01-18 02:34:50 +09:00
Carlos Mocholí 9cf9ded73b
Simplify data fetching (#11466) 2022-01-17 14:46:55 +00:00
Rohit Gupta cad604211b
update load_from_checkpoint docstrings (#11467) 2022-01-16 20:48:27 +00:00
Carlos Mocholí 18bbb39eef
Set `Loop.restarting` recursively (#11442)
* Set `Loop.restarting` recursively
* Docs
* CHANGELOG
* Update pytorch_lightning/loops/epoch/training_epoch_loop.py
Co-authored-by: Aki Nitta <nitta@akihironitta.com>
2022-01-14 19:25:23 +09:00
Rohit Gupta 9771e7dff6
Update introduction docs (#11140) 2022-01-13 21:11:43 +00:00
Carlos Mocholí a80da35d5d
Fix compatibility with old checkpoints and fault-tolerance enabled (#11439) 2022-01-13 14:53:17 +01:00
Rohit Gupta 96a53382ac
Update utilities API references (#11450) 2022-01-13 13:22:58 +00:00
Carlos Mocholí 5914fb748f
Add typing to accelerators/gpu.py (#11333) 2022-01-12 19:44:51 +00:00
Rohit Gupta 00d1758bac
Update training tricks docs (#11169) 2022-01-12 16:26:03 +00:00
Carlos Mocholí f5bbc2cf17
Avoid in-place ops during logging result updates (#11401)
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
2022-01-12 09:09:36 +01:00
Rohit Gupta 221091afc4
move profiler docs (#11431) 2022-01-12 05:56:16 +00:00
Aki Nitta 8dc36c3745
Fix inconsistent exceptions raised with no `rich` installed (#11360)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-01-12 03:55:51 +00:00
Rohit Gupta 82c8875f33
Add `LightningModule.lr_scheduler_step` (#10249)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-01-12 03:53:49 +00:00
Carlos Mocholí 9771040621
Add typing to `TQDMProgressBar` (#11369) 2022-01-12 01:07:30 +00:00
edward-io 6107ce8e0d
Add DETAIL logs for batch use cases (#11008) 2022-01-12 01:22:48 +01:00
Rohit Gupta 06b8f82b8a
Update API references in doc (#11357) 2022-01-07 15:56:17 +01:00
Carlos Mocholí 59a7ba7605
Move `epoch_{start,end}` hooks from `TrainingEpochLoop` to `FitLoop` (#11201) 2022-01-06 15:13:18 +00:00
Danielle Pintz 57567edeab
Move newly added Trainer methods to be with other methods (#11335) 2022-01-06 14:10:21 +00:00
Kaushik B 42a1c72660
Add Accelerators section to Lightning docs (#10755) 2022-01-06 19:12:44 +05:30
Carlos Mocholí 8a549a550c
Integrate progress tracking into the progress bar (#11213) 2022-01-06 14:29:48 +01:00
Adrian Wälchli 3a2df4f75d
Fix typing in `pl.callbacks.xla_stats_monitor` (#11219)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-01-06 12:51:02 +00:00