Commit Graph

6743 Commits

Author SHA1 Message Date
William Falcon 1c50ff7a2f
Update README.md 2022-03-28 08:56:25 -04:00
William Falcon 53d1c03394
Docs clean up 1/n (#12477) 2022-03-28 12:35:59 +00:00
Carlos Mocholí dd0c40e0fd
Fix current epoch value override on restart (#12429) 2022-03-28 14:41:31 +05:30
Jerome Anand b2e98d6166
Run HPU tests only with yml (#12469) (#12478)
* Run HPU tests only with yml (#12469)

Execute supported tests serially

Signed-off-by: Jerome <janand@habana.ai>
2022-03-28 16:50:20 +09:00
Danielle Pintz 2e5728a484
Update standalone tests (#12472) 2022-03-28 04:16:17 +02:00
Carlos Mocholí 939d56c6d6
Drop PyTorch 1.7 support (#12432) 2022-03-27 21:31:20 +00:00
Adrian Wälchli ef8f49a6e0
fix title levels (#12470) 2022-03-27 12:29:58 -07:00
Bruno Cabado e618a331fd
Allow log to an existing run ID in MLflow with MLFlowLogger (#12290)
Co-authored-by: bruno.cabado <bruno.cabado@cinfo.es>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-03-27 17:28:57 +00:00
Danielle Pintz 5be67200c0
Update tests in `strategies` directory in preparation for #11040 (#12467) 2022-03-27 18:13:48 +02:00
Danielle Pintz 204c5cd943
Update remaining tests in `test_accelerator_connector` in preparation for #11040 (#12466)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-03-27 18:11:42 +02:00
Adam Reeve 7c7a4ba233
Fix SWA LR scheduler not being stepped (#12446)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-03-27 15:49:41 +00:00
Aki Nitta 648cc2d2eb
Update environment variable for cache reset for consistency (#12455) 2022-03-27 17:42:01 +02:00
DuYicong515 01d817cb9f
Deprecate `Trainer.gpus` (#12436)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-03-27 16:53:28 +02:00
Kaushik B e0723369a3
Update Introduction docs page to "Lightning in 2 Steps" (#12357)
Co-authored-by: Aki Nitta <nitta@akihironitta.com>
2022-03-27 14:34:32 +00:00
Kaushik B 1dc57f9573
Fix: Update sphinx-autodoc-typehints minimal version (#12468) 2022-03-27 14:31:28 +00:00
Jv Kyle Eclarin 8b4abe4edb
Update `tests/models/*.py` to use `devices` instead of `gpus` or `ipus` (#11470) 2022-03-26 14:38:57 +00:00
Kushashwa Ravi Shrimali 92a2a6e951
Preparing for 1.6.0rc1 (#12453) 2022-03-25 18:23:47 +01:00
William Falcon d745260352
Update README.md 2022-03-25 13:13:56 -04:00
William Falcon 55ea3c78d5
Update README.md 2022-03-25 13:11:52 -04:00
Rohit Gupta 48f171006d
Avoid fallback on CPU if no devices are provided (#12410) 2022-03-25 15:59:06 +00:00
Rohit Gupta 5d015e7ae8
Add and update readme for docs and tests (#12348) 2022-03-25 15:33:54 +00:00
Rohit Gupta e631a66530
Update TQDM progress bar tracking with multiple dataloaders (#11657) 2022-03-25 15:13:35 +00:00
Kaushik B 28dac0c8d9
Update tpu_cores flag with accelerator and devices flag (#12158) 2022-03-25 11:57:02 +00:00
Ivan Švogor 25b771ca08
Create the loss accumulator directly on the device (#12430)
Co-authored-by: Ivan Svogor <ivan.svogor@iarai.ac.at>
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-03-25 12:46:17 +01:00
ananthsub 9ac636335e
Update fit_loop.py (#12450) 2022-03-25 11:38:09 +01:00
Jerome Anand 812c2dc3d3
Add support for Habana accelerator (HPU) (#11808)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
Co-authored-by: Aki Nitta <nitta@akihironitta.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
Co-authored-by: four4fish <88516121+four4fish@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Kaushik B <kaushikbokka@gmail.com>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: jjenniferdai <89552168+jjenniferdai@users.noreply.github.com>
Co-authored-by: Kushashwa Ravi Shrimali <kushashwaravishrimali@gmail.com>
Co-authored-by: Akarsha Rao <94624926+raoakarsha@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.comk-Pro.local>
2022-03-25 10:24:52 +00:00
Kaushik B 089fcb91a0
Collect and run all IPU tests (#11170)
* Collect and run all ipu tests

* Update azure pipeline

* Increase pytest verbosity

* Update RunIf

Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-03-25 14:20:22 +09:00
ananthsub 4651c11178
Centralize DDP speedups in docs (#12448) 2022-03-24 20:11:02 -07:00
Aki Nitta cf3518231e
Remove unused devbot (#12338)
* Remove devbot
2022-03-25 02:54:57 +00:00
Aki Nitta 8423e47ad9
Fix default labels in issue templates (#12434) 2022-03-25 03:37:58 +01:00
DuYicong515 cfc971700d
Remove AcceleratorConnector.parallel_devices (#12075) 2022-03-25 01:45:40 +00:00
DuYicong515 b5b951b05a
Remove AcceleratorConnector.devices (#12435) 2022-03-24 17:35:46 -07:00
Danielle Pintz 6329be60be
Replace PostLocalSGDOptimizer with a dedicated model averaging component (#12378) 2022-03-24 17:33:19 -07:00
jjenniferdai ec7fa1e2d8
[docs] Update `checkpointing.rst` and `callbacks.rst` for Stateful support (#12351) 2022-03-24 17:20:21 -07:00
Kaushik B 4ca3572051
Enable tpu device ids test (#12428) 2022-03-25 09:19:08 +09:00
jjenniferdai d4a4b77906
[3/3] Update lightning callbacks to `Stateful`, deprecations for old `on_save/load_checkpoint` signatures (#11887) 2022-03-25 00:06:10 +00:00
Carlos Mocholí 71e0ddb62f
`ModelCheckpoint`'s `save_last` now ignores `every_n_epochs` (#12418)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-03-24 20:06:52 +01:00
Kaushik B dcc973e019
Add `AcceleratorRegistry` (#12180)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
2022-03-24 18:29:32 +00:00
Ethan Harris c099c8be5d
Fix: freeze Jinja2 version (#12444) 2022-03-24 16:46:08 +00:00
Jv Kyle Eclarin b1f8b111b5
Update `tests/accelerators/*.py` to use `devices` instead of `gpus` or `ipus` (#11817) 2022-03-24 14:09:39 +00:00
Carlos Mocholí 45400be921
Do not print empty evaluation result tables (#12427) 2022-03-24 15:26:35 +05:30
ananthsub d418cf23b2
Do not configure launcher if processes are launched externally (#12431) 2022-03-24 09:40:34 +00:00
Kaushik B f740085678
Add Progress Bar to docs (#11359)
Co-authored-by: Ethan Harris <ewah1g13@soton.ac.uk>
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-03-24 06:27:10 +00:00
Danielle Pintz 0e6e0715f2
Update logging docs (#12245)
* Update logging docs

* fix text size issue

* put back Log Writing Frequency section
2022-03-24 03:52:05 +00:00
Carlos Mocholí 51575dcf60
Remove manual optimization `find_unused_parameters` override (#12425) 2022-03-24 00:17:18 +00:00
Ning a9bfcc7407
Call `Strategy.process_dataloader` in `data_connector.py` (#12251) 2022-03-23 22:57:56 +00:00
DuYicong515 923174147d
Remove Accelerator.parallel_device_ids and deprecate Trainer.data_parallel_device_ids (#12072) 2022-03-23 22:18:30 +00:00
ananthsub ebbe938dc1
Use debug instead of detail logging for per-iteration hooks (#12281) 2022-03-23 21:59:09 +00:00
Kaushik B 7b0d1183db
Update `gpus` flag with `accelerator` and `devices` flag (#12156)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-03-23 19:52:12 +00:00
DuYicong515 7a6efb38b2
fix merge issue (#12420) 2022-03-23 11:17:17 -07:00