Commit Graph

6579 Commits

Author SHA1 Message Date
Jan Stratil c877d54c04
Fix passing _ddp_params_and_buffers_to_ignore (#11949) 2022-02-24 17:22:48 +00:00
wangraying a0655611de
Add bagua installation in dockerfile (#11283)
Co-authored-by: Aki Nitta <nitta@akihironitta.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-02-24 15:17:31 +01:00
Rohit Gupta d9938da8a4
add properties to check for trainer state in pytorch profier (#12063) 2022-02-24 17:03:16 +05:30
Carlos Mocholí 1fa0639bca
Set the last global step saved only when actually saving (#12057) 2022-02-24 09:43:50 +01:00
Kaushik B 49420548ce
Update warnings for available accelerators not being used (#11909)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-24 09:12:02 +01:00
Danielle Pintz a8ee5cacb7
Further clean up aggregation logic (#12053) 2022-02-23 21:42:51 +00:00
Dennis Bappert 1026ceb86d
Fixed a small issue in the documentation for FeatureExtractorFreezeUnfreeze (#12049) 2022-02-23 13:03:34 +00:00
Tianshu Wang 1ef34c5ffd
[CLI] Add config `-c` argument (#12039) 2022-02-23 13:49:01 +05:30
Ryan Soklaski 550d3a640d
Improve annotations for `_defaults_from_env_vars` decorator (#11888)
Co-authored-by: rsokl <ryan.soklaski@ll.mit.edu>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
2022-02-22 19:12:52 +00:00
Carlos Mocholí 2d043857ee
Add callback to manage fault-tolerance checkpoints (#11862)
Co-authored-by: Danielle Pintz <38207072+daniellepintz@users.noreply.github.com>
2022-02-22 13:55:29 +00:00
Adrian Wälchli d0f54609de
Fix `is_interactive_compatible` logic after AcceleratorConnector rewrite (#12008)
* fix is_interactive_compatible

* improve tests

* update message

* address review

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-02-22 19:20:54 +05:30
Carlos Mocholí 4c4b9d540f
Remove `Trainer._device_type` (#11992) 2022-02-22 13:02:13 +00:00
Rohit Gupta d541cf4c64
remove ddp procs collection from script launcher (#12029) 2022-02-22 12:39:48 +01:00
Rohit Gupta 5ea811b1d9
Avoid loading dataloaders if `limit_batches=0` (#11576) 2022-02-22 11:33:53 +00:00
Adrian Wälchli de1815f4ba
Remove `DDPSpawnStrategy.get_mp_spawn_kwargs` in favor of launchers (#11966) 2022-02-22 11:28:21 +00:00
Kaushik B 6ff38d4c8e
Update RichProgressBarTheme after detecting light theme on colab (#10993) 2022-02-22 05:32:27 +00:00
Kaushik B 11bd176d2f
Add XLA Profiler section to docs (#11436)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-02-22 05:21:41 +00:00
Kaushik B dcad2ea4bc
Move strategy tests from accelerators to strategies directory (#11329) 2022-02-22 05:14:18 +00:00
Danielle Pintz 60fe1520fd
Remove calls to profile `model_forward` (#12032)
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
2022-02-21 23:32:38 +00:00
Akash Kwatra e50653da08
Deprecate `agg_key_funcs`, `agg_default_func`, and `update_agg_funcs` from `LightningLoggerBase` (#11871)
Co-authored-by: Danielle Pintz <38207072+daniellepintz@users.noreply.github.com>
2022-02-21 22:35:59 +00:00
chaddy 4c57155d14
Fix typos on new-project page (#11942)
Co-authored-by: Chaddie <chaddie.paik@webtoonscorp.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-21 22:08:26 +00:00
Jv Kyle Eclarin 9c067c2a3e
Update `tests/plugins/*.py` to use `devices` instead of `gpus` or `ipus` (#11872) 2022-02-21 22:57:21 +01:00
John Kilpatrick b5c135896f
Fix missing imports in converting.rst (#11945) 2022-02-21 22:26:18 +01:00
zredeaux65 70f9c6fda3
PyTorch documentation updates (#11739)
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Ethan Harris <ethanwharris@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2022-02-21 21:21:12 +00:00
Carlos Mocholí ab2e326e01
Clarify `val_check_interval` description (#11951)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-21 21:20:34 +00:00
Sebastian Raschka 40308f76a1
add default args in Trainer methods doc (#11614) 2022-02-21 20:39:20 +00:00
Adrian Wälchli 57aae5912e
Refactor signature for launcher (#11967) 2022-02-21 21:11:50 +01:00
Kaushik B ed863dd3e7
Update Bagua section example (#11899) 2022-02-21 20:51:58 +01:00
Danielle Pintz d724465ae6
Fix docstrings of on_fit_start/end #12016 2022-02-21 20:50:10 +01:00
Rohit Gupta f811284136
Update PT to PL conversion doc (#11397)
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: Ethan Harris <ewah1g13@soton.ac.uk>
2022-02-22 01:17:03 +05:30
Jirka Borovec 9b0942d731
CI: fix upload-artifact (#11962)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-21 16:03:02 +01:00
Carlos Mocholí 3579a3083f
Describe the behavior with `limit_*_batches=1|1.0` (#11950) 2022-02-21 15:12:03 +01:00
Carlos Mocholí 08384c45ea
Support manual optimization step profiling without a trainer reference (#11883) 2022-02-21 13:43:12 +00:00
Carlos Mocholí 0771a553de
Remove `Trainer._strategy_type` (#11990) 2022-02-21 13:39:30 +00:00
four4fish e15a66412c
Add back deterministic support in accelerator_connector (#11999)
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2022-02-20 18:11:47 +00:00
Carlos Mocholí 6bc0e1da63
Restore test after #11448 (#11986) 2022-02-19 21:39:41 +00:00
Adrian Wälchli 923b30dc51
Fix import error when running doctests for RL examples (#12010)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-19 21:57:02 +01:00
Kushashwa Ravi Shrimali 0374fe65db
Support gradient accumulation using Horovod's `backward_passes_per_step` (#11911)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-19 02:54:04 +01:00
ananthsub cf64f34434
Refactor `Strategy._move_optimizer_states` as utility functions (#11758)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2022-02-18 08:36:07 +00:00
Akash Kwatra d61371922b
Deprecate and remove calls to agg_and_log_metrics (#11832)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-18 02:54:33 +00:00
Akash Kwatra 73e9ca343a
Remove `log_text`, and `log_image` from `LightningLoggerBase` API (#11857) 2022-02-17 18:09:09 -08:00
four4fish 6e14209185
Rewrite accelerator_connector (#11448) 2022-02-17 23:38:39 +00:00
Carlos Mocholí a0ca8d076f
Refactor early stopping test (#11866) 2022-02-17 23:20:39 +00:00
Rohit Gupta 25b505508d
Add process launchers (#11643)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-17 21:16:51 +00:00
ananthsub 4dba492fb5
Update horovod.py (#11917) 2022-02-16 21:58:54 -08:00
Konstantin Dobler d902b59886
Fix environment variable order for global rank determination (#11406)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-17 02:09:05 +00:00
ananthsub 62ebd42ce0
Update ddp.py (#11929) 2022-02-16 17:29:07 -08:00
edward-io 87bd54aedf
fix typos (#11937) 2022-02-16 17:27:51 -08:00
Carlos Mocholí 4cc05b2cf9
Support optimizer step progress tracking with manual optimization (#11848) 2022-02-16 21:27:27 +00:00
Carlos Mocholí 963adc7857
Small cleanup when dataloader states are saved (#11843) 2022-02-16 20:57:21 +00:00