Commit Graph

312 Commits

Author SHA1 Message Date
Adrian Wälchli 4008f9cd41
Convert subprocess test to standalone test (#14101) 2022-08-10 17:15:12 -04:00
otaj f132d44821
Fix a bug that caused spurious `AttributeError` when multiple `DataLoader` classes are imported (#14117) 2022-08-10 16:09:50 +00:00
Carlos Mocholí 9b61b1c482
Remove duplicated test classes (#14122)
Remove duplicated classes
2022-08-10 17:21:05 +02:00
Adrian Wälchli dc8ff5ed26
Fix device placement when `.cuda()` called without specifying index (#14128) 2022-08-10 05:23:20 -04:00
Adam Reeve 975a4fc2f1
Support checkpoint save and load with Stochastic Weight Averaging (#9938)
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
Co-authored-by: Kushashwa Ravi Shrimali <kushashwaravishrimali@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-09 23:18:21 +00:00
Adrian Wälchli 06c255c5c1
Skip ddp fork tests on windows (#14121) 2022-08-09 22:54:10 +00:00
Carlos Mocholí d85085479d
Reset all results on epoch end (#14061) 2022-08-09 23:01:11 +05:30
Rohit Gupta ac369f5570
Fix incorrect `precision="mixed"` being used with `DeepSpeedStrategy` and `IPUStrategy` (#14041)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-08-09 21:25:23 +05:30
Anton Shevtsov c55fe7105b
Prefix seed_everything log messages with rank info (#14031)
Co-authored-by: Anton Shevtsov <aeshevtsov@avito.ru>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-08-09 15:40:30 +02:00
Adrian Wälchli 0cfc53d6b4
Fix regression on default value for `find_unused_parameters` (#14095) 2022-08-09 13:56:02 +05:30
Carlos Mocholí d072e4451a
Fix dtype inference during gradient norm computation (#14051) 2022-08-08 11:35:06 +00:00
Carlos Mocholí aaeff90254
Remove deprecated `DistributedType` and `DeviceType` enum classes (#14045) 2022-08-08 10:07:54 +02:00
Rohit Gupta b25275ccc2
Cast to fp16 before moving to device with deepspeed (#14000) 2022-08-05 22:15:15 +00:00
Carlos Mocholí 91dd6a68fb
Remove meta device utilities in favor of torchdistx (#13868) 2022-08-05 12:20:27 +00:00
Adrian Wälchli 3d5c3d24f9
Remove unused auto_collect_arguments class method (#14015) 2022-08-05 08:49:00 +00:00
Rohit Gupta a4e4cab7a6
Deprecate `amp_level` from `Trainer` (#13898)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-08-05 08:31:19 +00:00
Carlos Mocholí b88b700745
Remove the deprecated DDP2 strategy (#14026) 2022-08-04 20:27:35 +00:00
Rohit Gupta f5bd6e6f5f
Cast only floating types with IPUs (#13983) 2022-08-04 19:46:07 +00:00
Adrian Wälchli ef0623ec64
Remove deprecated training type plugins (#14011)
* Remove deprecated training type plugins

* update changelog

* DDP2Plugin

* Update src/pytorch_lightning/CHANGELOG.md
2022-08-04 18:00:00 +02:00
Rohit Gupta e78bf2044b
Raise an error if batch transfer hooks are overridden with IPUAccelerator (#13961)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-08-04 12:04:42 +00:00
Adam J. Stewart d748dae548
Fix erroneous warning for unset `max_epochs` (#13262)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-03 19:17:21 +00:00
Adrian Wälchli e6a8283e9c
Organize accelerator tests (#13986) 2022-08-03 13:49:55 +00:00
Adrian Wälchli 4ce97f37a2
Validate the model input of trainer methods (#13892)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-03 13:38:42 +00:00
Adrian Wälchli ce025bf954
Lazy import check for hydra dependency (#13812) 2022-08-03 04:27:16 -04:00
Jerome Anand b3203d93d0
Added support for HPU device stats monitor (#13819)
* Added support for HPU device stats monitor

Signed-off-by: Jerome <janand@habana.ai>

* Update changelog

Signed-off-by: Jerome <janand@habana.ai>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Apply suggestions from code review

Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>

* Update reference

Signed-off-by: Jerome <janand@habana.ai>

* Apply suggestions from code review

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

* fix alignment

* add descriptions

* Update hpu_intermediate.rst

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-02 13:31:31 +05:30
Adrian Wälchli eb233ea12d
Snapshot selected globals and restore them in spawned process (#13921)
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-08-01 22:21:46 +00:00
Rohit Gupta 0f6caffa57
Fix deepspeed default precision plugin `amp_level` to O2 (#13897)
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-29 20:36:51 +00:00
Adrian Wälchli caaf35689c
Improvements to standalone scripts (#13840)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-28 23:33:22 +00:00
HMellor 07b39c257b
Cast on host instead of IPU when using `precision=16` (#13880)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-28 19:26:41 +00:00
Adrian Wälchli 25203d4c81
Organize model summary utilities (#13893) 2022-07-28 19:23:29 +02:00
Carlos Mocholí 406cea7146
Support DeepSpeed <0.7.0 (#13859)
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2022-07-28 14:38:51 +00:00
Carlos Mocholí 1299e4f984
Run GPU tests with PyTorch 1.12 (#13716)
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-07-28 19:37:57 +05:30
Carlos Mocholí 511875e567
Support DeepSpeed >=0.6.0, <0.6.5 (#13863)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-07-27 18:57:52 +02:00
Adrian Wälchli fff62f0ae5
Fix TPU testing and collect all tests (#11098)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
2022-07-27 15:40:40 +00:00
otaj 95f5f170f5
Allowed custom `BatchSampler`s when instantiated in `*_dataloader` hook (#13640)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-07-27 15:32:50 +00:00
Adrian Wälchli 2a24b906ac
Add batch size script argument for standalone tests (#13841)
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-07-27 12:36:22 +00:00
otaj 4c7b9f0b11
Disallow batch sampler with multiple IPU devices (#13854)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-27 15:20:43 +05:30
Anton Shevtsov 41f45b475e
Check if the scheduler already has `reduce_on_plateau` (#13838)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-27 09:10:57 +00:00
Adrian Wälchli c3911700d1
Fix error handling in learning rate finder (#13845)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-27 04:32:39 -04:00
Rohit Gupta faf7ff57c0
Add support for async checkpointing (#13658) 2022-07-26 21:13:19 +05:30
Adrian Wälchli a8d7b4476c
Fix PyTorch spelling errors (#13774)
* Fix PyTorch spelling errors

* more
2022-07-25 12:51:16 -04:00
Justus Schock 227871982d
Merge different gpu backends with accelerator='gpu' (#13642)
* Rename GPUAccelerator to CUDAAccelerator

* Add back GPUAccelerator and deprecate it

* Remove temporary registration

* accelerator connector reroute

* accelerator_connector tests

* update enums

* lite support + tests

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* typo

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* move "gpu" support up before actual accelerator flag checks

* Stupid arguments

* fix tests

* change exception type

* fix registry test

* pre-commit

* CI: debug HPU flow (#13419)

* Update the hpu-tests.yml to pull docker from vault
* fire & sudo
* habana-gaudi-hpus
* Check the driver status on gaudi server (#13718)

Co-authored-by: arao <arao@habana.ai>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Akarsha Rao <94624926+raoakarsha@users.noreply.github.com>

* Update typing-extensions requirement from <4.2.1,>=4.0.0 to >=4.0.0,<4.3.1 in /requirements (#13529)

Update typing-extensions requirement in /requirements

Updates the requirements on [typing-extensions](https://github.com/python/typing_extensions) to permit the latest version.
- [Release notes](https://github.com/python/typing_extensions/releases)
- [Changelog](https://github.com/python/typing_extensions/blob/main/CHANGELOG.md)
- [Commits](https://github.com/python/typing_extensions/compare/4.0.0...4.3.0)

---
updated-dependencies:
- dependency-name: typing-extensions
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* [pre-commit.ci] pre-commit suggestions (#13540)

updates:
- [github.com/psf/black: 22.3.0 → 22.6.0](https://github.com/psf/black/compare/22.3.0...22.6.0)

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* [FIX] Native FSDP precision + tests (#12985)

* Simplify fetching's loader types (#13111)

* Include app templates to the lightning and app packages (#13731)

* Include app templates to the package

Co-authored-by: mansy <mansy@lightning.ai>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>

* Fix mypy typing errors in pytorch_lightning/callbacks/model_checkpoint.py (#13617)

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

* Fix typos initialize in docs (#13557)


Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>

* Fix main progress bar counter when `val_check_interval=int` and `check_val_every_n_epoch=None` (#12832)

* Fix mypy errors attributed to `pytorch_lightning.loggers.tensorboard.py` (#13688)

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

* Fix mypy errors attributed to `pytorch_lightning.loggers.mlflow` (#13691)

Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: otaj <6065855+otaj@users.noreply.github.com>

* fix mypy errors for loggers/wandb.py (#13483)


Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>

* Fix gatekeeper minimum check (#13769)

* changelog

* changelog

* fix order

* move up again

* add missing test

Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: arao <arao@habana.ai>
Co-authored-by: Akarsha Rao <94624926+raoakarsha@users.noreply.github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Sean Naren <sean@grid.ai>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Mansy <ahmed.mansy156@gmail.com>
Co-authored-by: mansy <mansy@lightning.ai>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Lee Jungwon <33821003+BongYang@users.noreply.github.com>
Co-authored-by: Nathaniel D'Amours <88633026+NathanielDamours@users.noreply.github.com>
Co-authored-by: Justin Goheen <26209687+JustinGoheen@users.noreply.github.com>
Co-authored-by: otaj <6065855+otaj@users.noreply.github.com>
Co-authored-by: Gautier Dagan <s2234411@ed.ac.uk>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-25 14:46:45 +00:00
Mauricio Villegas 1b31039c58
Update LightningCLI test for new support in latest release of jsonargparse (#13805) 2022-07-25 09:25:42 +00:00
Adrian Wälchli 81f149e9d4
Rename spawn-based launchers (#13743) 2022-07-23 11:48:15 -04:00
Adrian Wälchli fa886f2a58
Lazy import check for neptune dependency (#13477)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-23 14:06:26 +00:00
Adrian Wälchli d24978baa3
Add ddp_notebook alias for ddp_fork (#13744)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-23 09:06:35 -04:00
Jinyoung Lim ae9803137a
Add logging messages to notify when `FitLoop` stopping conditions are met (#9749)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-23 12:07:47 +00:00
Carlos Mocholí 4f53e7132f
Promote the CLI out of utilities (#13767) 2022-07-23 12:07:29 +00:00
Adrian Wälchli f6f06d4e42
Set default strategy to ddp_fork in interactive environments (#13746) 2022-07-22 19:34:30 +00:00
Carlos Mocholí 9f51c07604
Support setting the trainer reference recursively for ensembles (#13638)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2022-07-22 19:58:46 +02:00
Adrian Wälchli 596aa8400d
Lazy import check for wandb dependency (#13474) 2022-07-22 19:57:46 +02:00
Adrian Wälchli c3299d2c59
Add support for DDP fork (#13405)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-07-22 16:05:35 +00:00
Rohit Gupta 763fbf6b77
Fix to allow custom `CheckpointIO` with strategy classes (#13785) 2022-07-22 14:32:54 +00:00
Krishna Kalyan 238c9913a2
Do not force `sync_dist=True` on epoch end (#13364)
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-22 10:04:30 +00:00
Jerome Anand 9596fabe7b
Add auto_device_count and device name support (#13423)
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: mansy <mansy@lightning.ai>
Co-authored-by: manskx <mansy@lightning.ai>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Mansy <ahmed.mansy156@gmail.com>
Co-authored-by: otaj <ota@lightning.ai>
Co-authored-by: Sean Naren <sean@grid.ai>
Co-authored-by: Keiichi Kuroyanagi <kuroyanagi.keiichi@gmail.com>
Co-authored-by: Martino Sorbaro <martinosorb@users.noreply.github.com>
Co-authored-by: Wang Ran (汪然) <wangr@smail.nju.edu.cn>
Co-authored-by: Rhys Goodall <rhys.goodall@outlook.com>
Co-authored-by: Siyuan Li <siyuanli.s.c@gmail.com>
Co-authored-by: Ekagra Ranjan <ekagra.ranjan@gmail.com>
Co-authored-by: S. Kumano <54502860+s-kumano@users.noreply.github.com>
Co-authored-by: otaj <6065855+otaj@users.noreply.github.com>
Co-authored-by: Gautier Dagan <gautierdagan2017@u.northwestern.edu>
Co-authored-by: Sherin Thomas <sherinct@live.com>
Co-authored-by: Cyprien Ricque <48893621+Cyprien-Ricque@users.noreply.github.com>
Co-authored-by: Masahiro Wada <argon.argon.argon@gmail.com>
Co-authored-by: nitinramvelraj <98356761+nitinramvelraj@users.noreply.github.com>
Co-authored-by: donlapark <10988155+donlapark@users.noreply.github.com>
Co-authored-by: Justin Goheen <26209687+JustinGoheen@users.noreply.github.com>
Co-authored-by: Shantam Gilra <64306405+shantam-8@users.noreply.github.com>
Co-authored-by: Bibhabasu Mohapatra <68384968+bibhabasumohapatra@users.noreply.github.com>
Co-authored-by: Jimmy Yao <jiahaoyao.math@gmail.com>
Co-authored-by: Nikhil Shenoy <nikhilshenoy98@gmail.com>
Co-authored-by: Sanjay Aradhyamath <57592361+samz5320@users.noreply.github.com>
2022-07-22 10:29:02 +05:30
Gautier Dagan 0e5312833f
fix mypy errors for loggers/wandb.py (#13483)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-21 01:07:24 +00:00
Rohit Gupta e451fa28d0
Fix main progress bar counter when `val_check_interval=int` and `check_val_every_n_epoch=None` (#12832) 2022-07-20 20:33:00 +00:00
Carlos Mocholí bbd364a041
Simplify fetching's loader types (#13111) 2022-07-20 12:15:24 +00:00
Sean Naren d78698528d
[FIX] Native FSDP precision + tests (#12985) 2022-07-20 11:32:35 +00:00
Rohit Gupta c67b075cf5
Use `global_step` while restoring logging step for old checkpoints (#13645)
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-19 18:53:22 +00:00
Justus Schock abf82b360a Add back GPUAccelerator and deprecate it 2022-07-19 13:06:30 -04:00
Justus Schock c75457da99 Rename GPUAccelerator to CUDAAccelerator 2022-07-19 13:06:30 -04:00
Carlos Mocholí 0e5a51f55c
Allow CUDA and IPU tests without the CI environment var (#13676) 2022-07-19 13:40:25 +09:00
Carlos Mocholí d058190b6d
Run standalone tests in batches (#13673) 2022-07-18 12:10:35 +00:00
George Stein 0449e861cc
Fix `trainer.predict(return_predictions=False)` does not track `batch_indices` (#13629)
* Pull request for fixing issue #13580
* chlog and test
* disable track for epoch

Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
2022-07-18 08:26:15 +00:00
Jirka Borovec aa62fe36df
add testing PT 1.12 (#13386)
* add testing PT 1.12
* Fix quantization tests
* Fix another set of tests
* Fix check since https://github.com/pytorch/pytorch/pull/80139 is only going to be available for 1.13
* Skip this test for now for 1.12

Co-authored-by: SeanNaren <sean@grid.ai>
2022-07-15 19:41:23 +02:00
Adrian Wälchli d42711f22f
Remove deprecated `Strategy.post_dispatch` (#13461)
* Remove deprecated Strategy.post_dispatch

* changelog

* remove unused imports
2022-07-15 13:18:55 -04:00
thomas chaton 5e26840f94
Introduce ServableModuleValidator Callback (#13614)
* wip

* wip

* wip

* wip

* wip

* wip

* wip

* wip

* Update tests/tests_pytorch/serve/test_servable_module_validator.py

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

* Update tests/tests_pytorch/serve/test_servable_module_validator.py

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

* Update src/pytorch_lightning/serve/servable_module_validator.py

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

* Update src/pytorch_lightning/serve/servable_module_validator.py

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

* Update src/pytorch_lightning/serve/servable_module_validator.py

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

* Typing improvements

* wip

* update doc

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* Update examples/pl_servable_module/production.py

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>

* update

* update

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-15 11:07:40 -04:00
Carlos Mocholí 8355ba1260
Run only CUDA tests on Azure GPU CI (#13651) 2022-07-15 13:51:23 +02:00
Akihiro Nitta 7ba0270552
Remove deprecated `max_steps=None` (#13591)
* Remove max_steps=None

* Update changelog

* Update docs

* Unused import

Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-14 12:28:38 +00:00
Akihiro Nitta c1cc112b52
Remove deprecated `LightningDistributed` (#13549)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-13 21:15:28 +00:00
Adrian Wälchli daf7cec01e
Remove deprecated ClustertEnvironment methods (#13458)
* Remove deprecated ClustertEnvironment methods

* update changelog

* ignore typing error

Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-13 19:53:46 +00:00
Akihiro Nitta feb8e7d344
Remove deprecated `LightningModule.on_post_move_to_device` (#13548)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-13 18:06:27 +00:00
Adrian Wälchli 07e7d6dc3b
Remove deprecated `Trainer.slurm_job_id` (#13459) 2022-07-13 16:50:55 +00:00
Sanjay Aradhyamath 562467402d
Removed deprecated `pytorch_lightning.overrides.distributed.IndexBatchSamplerWrapper.batch_indices` (#13565)
* Removed the deprecated   method

* Removed deprecated  IndexBatchSamplerWrapper.batch_indices

* Update src/pytorch_lightning/CHANGELOG.md

* Missed code

Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-13 00:53:18 +00:00
Nikhil Shenoy e034cd31d3
Remove `add_to_queue` and `remove_from_queue` from LightningModule (#13600)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-12 16:47:51 +02:00
Rohit Gupta dba65be911
Remove redundant GPU test (#13623)
Remove redundant test
2022-07-12 09:51:11 -04:00
Rohit Gupta df931e2486
Restore log step during restart (#13467)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-12 09:45:59 +00:00
Shantam Gilra bdb6e40392
Remove deprecated `pytorch_lightning.core.decorators.parameter_validation` (#13514)
* Removal of depreciated code from decorators

* Update CHANGELOG.md

* Removed imports
2022-07-11 23:03:54 +00:00
nitinramvelraj 61c28cb428
Remove deprecated `on_keyboard_interrupt` (#13438)
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-07-05 07:34:18 +00:00
Adrian Wälchli a80354e3ae
Move deepspeed summary test to correct folder (#13478) 2022-07-01 08:47:04 +00:00
Siyuan Li e0a0d1e4f9
Set timeout for DDPSpawnStrategy (#13383)
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-06-29 20:55:06 -04:00
Adrian Wälchli 1f85b6d6a4
Fix validation when accelerator is a string (#13417)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-06-29 22:42:34 +00:00
Rhys Goodall 8c4d640bfc
Convert validation loop config warnings to `PossibleUserWarning` (#13377)
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-06-29 22:34:25 +00:00
Adrian Wälchli 2dd332f9c7
Call `set_epoch` for distributed batch samplers (#13396)
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-06-29 19:09:35 +00:00
Adrian Wälchli 43635a9a9b
Remove remaining old-style AcceleratorConnector properties (#13412)
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-06-29 18:51:48 +00:00
ananthsub 7fca126749
Update gather_all_tensors to handle tensors of different sizes (#12630)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2022-06-29 17:03:00 +00:00
Adrian Wälchli ddbf95516b
Remove support for DDP2 strategy (#12705)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-06-29 15:06:51 +00:00
Sean Naren f145acd2a3
Add model summary when using DeepSpeed Stage 3 (#13427) 2022-06-29 14:49:34 +00:00
Adrian Wälchli c71f32a490
Rename old references to training type plugin in tests (#13421) 2022-06-28 14:57:44 -04:00
Carlos Mocholí b1e38bfd79
Better errors for logging corner cases (#13164) 2022-06-28 16:59:31 +01:00
Carlos Mocholí a4750100cf
[CLI] Support custom trainers without callbacks (#13138) 2022-06-28 17:39:17 +02:00
Sean Naren 54f2d44fb8
Remove unnecessary endpoint logic, rename `collaborative` to `hivemind` (#13392)
* Remove endpoint after collaborate app/dht CLI

* Fix references, change filename

* Add CHANGELOG.md

* Address review

Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-06-28 08:41:08 -04:00
Jirka Borovec d2e4e7e003
create meta package [RFC] (#13327)
* placeholder

* move setup_tools & abstract about

* adjust lightning-app

* notes

* lightning about

* lightning init

* CI check

* ci

* install

* adjust manifest & mv chlog

* manifest

* pkg

* mv __setup__

* parse_requirements

* lit

* ci - pytorch

* wrap func

* ci

* cd draft

* generate lit

* pkg

* utf-8

* root pkg

* req.

* ver

* mypy

* try check

* meta pkg

* meta pkg - vars

* meta pkg - pruning

* meta pkg - fixing

* fix PL for meta

* multi-line wrapper

* hack manifest

* ci

* fix docstr

* fixing

* ci & mypy

* links
2022-06-27 09:34:18 -04:00
Justus Schock f54abc506f
Merge pull request #13123 from Lightning-AI/mps_accelerator
MPS Accelerator
2022-06-24 08:15:48 -04:00
Sean Naren 73e7a5d0c2
Rename `CollaborativeStrategy` to `HivemindStrategy` (#13388) 2022-06-23 15:44:48 +00:00
awaelchli 511f1a6515 Reroute profiler to profilers (#12308)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-06-22 20:55:39 -04:00
awaelchli fc1559e41c Rename profiler to profilers (#12308)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-06-22 20:55:39 -04:00
Patrick Haller 887dc0ff8c
DummyLogger can be called with unknown methods (#13224)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-06-22 17:51:42 +02:00
Atharva Phatak 63a9ab4ae2
Improved Deepspeed Imports (#13223)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-06-22 11:09:33 -04:00
otaj 33bd270845
Adds Sampler Wrappers for custom samplers in distributed environment (#12959)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-06-22 12:17:53 +02:00
Ray Schireman 8266300b29
Remove pytorch lightning.callbacks.lr monitor.learning rate monitor.lr_sch_names (#13353)
Co-authored-by: Raymond G Schireman <raymond.schireman@uvm.edu>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-06-22 02:03:17 +02:00
otaj 2e9cd72add
Improve support for custom `DataLoader`s when instantiated in `*_dataloader` hook (#12981)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-06-22 01:53:24 +02:00
Mauricio Villegas 6371d7c615
Fix LightningCLI signature parameter resolving for some lightning classes (#13283)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-06-21 21:58:41 +00:00
Sean Naren 89e2e69b01
[BUG] `estimated_stepping_batches` requires distributed comms in `configure_optimizers` for `DeepSpeedStrategy` (#13350) 2022-06-21 17:48:27 +01:00
Tianshu Wang 749709fb4f
Use run name for logging with WandbLogger (#12604)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-06-21 15:25:37 +00:00
Mauricio Villegas 0ae9627bf8
Deprecate CLI registries and update documentation (#13221)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-06-21 17:12:04 +02:00
Siyuan Li c600f987c2
Enable timeout for `DDPStrategy` (#13244) 2022-06-21 15:49:57 +02:00
Ekagra Ranjan 81b7000978
EarlyStopping logging on rank 0 only (#13233)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-06-21 09:37:41 -04:00
Adam J. Stewart d24178ec29
Fix torch.distributed._sharded_tensor DeprecationWarning (#13261) 2022-06-21 04:52:06 -04:00
Jerome Anand cd44512ab9
Added multi-optimizer tests with hpu (#13217)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-06-21 09:07:31 +02:00
Jirka Borovec ab59f308b1
Future 4/n: test & legacy in test/ folder (#13295)
* move: legacy >> test/

* move: tests >> test/

* rename unittests

* update CI

* tests4pl

* tests_pytorch

* proxi

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* ci

* link

* cli

* standalone

* fixing

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* .

* Apply suggestions from code review

Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* alone

* test -> tests

* Standalone fixes

* ci

* Update

* More fixes

* Fix coverage

* Fix mypy

* mypy

* Empty-Commit

* Fix

* mypy just for pl

* Fix standalone

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-06-15 18:10:49 -04:00