Commit Graph

6552 Commits

Author SHA1 Message Date
Kaushik B ed863dd3e7
Update Bagua section example (#11899) 2022-02-21 20:51:58 +01:00
Danielle Pintz d724465ae6
Fix docstrings of on_fit_start/end #12016 2022-02-21 20:50:10 +01:00
Rohit Gupta f811284136
Update PT to PL conversion doc (#11397)
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: Ethan Harris <ewah1g13@soton.ac.uk>
2022-02-22 01:17:03 +05:30
Jirka Borovec 9b0942d731
CI: fix upload-artifact (#11962)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-21 16:03:02 +01:00
Carlos Mocholí 3579a3083f
Describe the behavior with `limit_*_batches=1|1.0` (#11950) 2022-02-21 15:12:03 +01:00
Carlos Mocholí 08384c45ea
Support manual optimization step profiling without a trainer reference (#11883) 2022-02-21 13:43:12 +00:00
Carlos Mocholí 0771a553de
Remove `Trainer._strategy_type` (#11990) 2022-02-21 13:39:30 +00:00
four4fish e15a66412c
Add back deterministic support in accelerator_connector (#11999)
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2022-02-20 18:11:47 +00:00
Carlos Mocholí 6bc0e1da63
Restore test after #11448 (#11986) 2022-02-19 21:39:41 +00:00
Adrian Wälchli 923b30dc51
Fix import error when running doctests for RL examples (#12010)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-19 21:57:02 +01:00
Kushashwa Ravi Shrimali 0374fe65db
Support gradient accumulation using Horovod's `backward_passes_per_step` (#11911)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-19 02:54:04 +01:00
ananthsub cf64f34434
Refactor `Strategy._move_optimizer_states` as utility functions (#11758)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2022-02-18 08:36:07 +00:00
Akash Kwatra d61371922b
Deprecate and remove calls to agg_and_log_metrics (#11832)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-18 02:54:33 +00:00
Akash Kwatra 73e9ca343a
Remove `log_text`, and `log_image` from `LightningLoggerBase` API (#11857) 2022-02-17 18:09:09 -08:00
four4fish 6e14209185
Rewrite accelerator_connector (#11448) 2022-02-17 23:38:39 +00:00
Carlos Mocholí a0ca8d076f
Refactor early stopping test (#11866) 2022-02-17 23:20:39 +00:00
Rohit Gupta 25b505508d
Add process launchers (#11643)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-17 21:16:51 +00:00
ananthsub 4dba492fb5
Update horovod.py (#11917) 2022-02-16 21:58:54 -08:00
Konstantin Dobler d902b59886
Fix environment variable order for global rank determination (#11406)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-17 02:09:05 +00:00
ananthsub 62ebd42ce0
Update ddp.py (#11929) 2022-02-16 17:29:07 -08:00
edward-io 87bd54aedf
fix typos (#11937) 2022-02-16 17:27:51 -08:00
Carlos Mocholí 4cc05b2cf9
Support optimizer step progress tracking with manual optimization (#11848) 2022-02-16 21:27:27 +00:00
Carlos Mocholí 963adc7857
Small cleanup when dataloader states are saved (#11843) 2022-02-16 20:57:21 +00:00
Jirka Borovec 7bc87015ea
Unblock GPU CI (#11934)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-02-16 21:15:44 +01:00
amin-nejad a143a5280e Update trainer.rst 2022-02-16 09:03:48 -05:00
Akihiro Nitta f79d75d4b5 Pin myst-parser<0.17 2022-02-14 09:19:49 -05:00
jjenniferdai d69b33f1f0
Introduce `Stateful` PrecisionPlugin (#11638)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
2022-02-14 15:56:09 +05:30
Akash Kwatra 914f685ed8
Update typing for CometLogger.experiment (#11836) 2022-02-14 09:16:11 +00:00
guyang3532 79c4e5de60
Refine the pytorch profiler (#11268) 2022-02-11 14:50:18 +01:00
Carlos Mocholí 6f22b3623c
Fix master merge conflict (#11858) 2022-02-10 21:49:45 +00:00
Carlos Mocholí 1515ef90ee
Revert saving the dataloader and result collection by default (#11842) 2022-02-10 20:43:35 +01:00
Carlos Mocholí 789fae828d
Fix `current_epoch` value on training end (#8578)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-10 17:55:59 +01:00
Rohit Gupta 5e78f429e3
update changelog after 1.5.10 release (#11830) 2022-02-10 13:10:38 +00:00
Danielle Pintz 0e68734ddd
Make default logger name "lightning_logs" (#11762)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-02-10 15:19:18 +05:30
Aki Nitta 0a1b8b880d
Fix horovod installation `base-cuda` Dockerfile (#11811)
* pip install --user

* add checks

* rm unrelated comment

* consistent format

* Fail if horovod not found

Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-02-10 16:48:33 +09:00
ananthsub 8d23f6287a
Update module path for `LightningDeprecationWarning` in setup.cfg (#11793) 2022-02-10 08:59:32 +05:30
ananthsub 1b107c5892
Add `Accelerator.is_available()` interface requirement (#11797) 2022-02-09 15:11:27 -08:00
Adrian Wälchli c618e59689
Fix `save_hyperparameters` when no parameters need saving (#11827) 2022-02-09 23:10:14 +00:00
Akash Kwatra a2d8c4f6a6
Create loggers property for Trainer and LightningModule (#11683)
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Danielle Pintz <38207072+daniellepintz@users.noreply.github.com>
2022-02-09 23:39:41 +01:00
Boris Dayma 1e36cffbca
feat(wandb): support distributed modes (#11650)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-02-09 19:53:21 +01:00
Carlos Mocholí 8394770d4a
Move data fetcher ownership to the loops (#11621) 2022-02-09 20:04:24 +05:30
Biho-Kim 24de29974c
bug fix #10872 (#10965)
Co-authored-by: louie.kim <louie.kim@kakaocorp.comlouie.kim@kakaocorp.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-02-09 14:15:49 +00:00
Rohit Gupta e3820da28a
Update inference doc (#11428) 2022-02-09 12:39:31 +00:00
Jv Kyle Eclarin b34d8673a9
Update `tests/trainer/*.py` to use `devices` instead of `gpus` or `ipus` (#11697) 2022-02-09 09:40:22 +00:00
Carlos Mocholí 8822117200
Return the output of the optimizer step (#11711)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-02-09 09:37:13 +00:00
Danielle Pintz 9e63281a4c
remove todos (#11804) 2022-02-09 08:30:27 +00:00
ananthsub 9d4de3a863
Faster callback configuration validator checks (#11785)
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
2022-02-09 08:24:14 +00:00
Rohit Gupta 182c18d319
Configure native deepspeed schedulers with interval='step' (#11788) 2022-02-09 08:20:50 +00:00
William Falcon 0fcc048c7d
update slack link 2022-02-08 20:30:52 -08:00
Danielle Pintz 32e7d32956
Delete `test_on_before_accelerator_backend_setup` (#11803)
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-02-08 09:36:37 +00:00