Adrian Wälchli
4008f9cd41
Convert subprocess test to standalone test ( #14101 )
2022-08-10 17:15:12 -04:00
otaj
f132d44821
Fix a bug that caused spurious `AttributeError` when multiple `DataLoader` classes are imported ( #14117 )
2022-08-10 16:09:50 +00:00
Carlos Mocholí
9b61b1c482
Remove duplicated test classes ( #14122 )
...
Remove duplicated classes
2022-08-10 17:21:05 +02:00
Adrian Wälchli
dc8ff5ed26
Fix device placement when `.cuda()` called without specifying index ( #14128 )
2022-08-10 05:23:20 -04:00
Adam Reeve
975a4fc2f1
Support checkpoint save and load with Stochastic Weight Averaging ( #9938 )
...
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
Co-authored-by: Kushashwa Ravi Shrimali <kushashwaravishrimali@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-09 23:18:21 +00:00
Adrian Wälchli
06c255c5c1
Skip ddp fork tests on windows ( #14121 )
2022-08-09 22:54:10 +00:00
Carlos Mocholí
d85085479d
Reset all results on epoch end ( #14061 )
2022-08-09 23:01:11 +05:30
Rohit Gupta
ac369f5570
Fix incorrect `precision="mixed"` being used with `DeepSpeedStrategy` and `IPUStrategy` ( #14041 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-08-09 21:25:23 +05:30
Anton Shevtsov
c55fe7105b
Prefix seed_everything log messages with rank info ( #14031 )
...
Co-authored-by: Anton Shevtsov <aeshevtsov@avito.ru>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-08-09 15:40:30 +02:00
Adrian Wälchli
0cfc53d6b4
Fix regression on default value for `find_unused_parameters` ( #14095 )
2022-08-09 13:56:02 +05:30
Carlos Mocholí
d072e4451a
Fix dtype inference during gradient norm computation ( #14051 )
2022-08-08 11:35:06 +00:00
Carlos Mocholí
aaeff90254
Remove deprecated `DistributedType` and `DeviceType` enum classes ( #14045 )
2022-08-08 10:07:54 +02:00
Rohit Gupta
b25275ccc2
Cast to fp16 before moving to device with deepspeed ( #14000 )
2022-08-05 22:15:15 +00:00
Carlos Mocholí
91dd6a68fb
Remove meta device utilities in favor of torchdistx ( #13868 )
2022-08-05 12:20:27 +00:00
Adrian Wälchli
3d5c3d24f9
Remove unused auto_collect_arguments class method ( #14015 )
2022-08-05 08:49:00 +00:00
Rohit Gupta
a4e4cab7a6
Deprecate `amp_level` from `Trainer` ( #13898 )
...
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-08-05 08:31:19 +00:00
Carlos Mocholí
b88b700745
Remove the deprecated DDP2 strategy ( #14026 )
2022-08-04 20:27:35 +00:00
Rohit Gupta
f5bd6e6f5f
Cast only floating types with IPUs ( #13983 )
2022-08-04 19:46:07 +00:00
Adrian Wälchli
ef0623ec64
Remove deprecated training type plugins ( #14011 )
...
* Remove deprecated training type plugins
* update changelog
* DDP2Plugin
* Update src/pytorch_lightning/CHANGELOG.md
2022-08-04 18:00:00 +02:00
Rohit Gupta
e78bf2044b
Raise an error if batch transfer hooks are overridden with IPUAccelerator ( #13961 )
...
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-08-04 12:04:42 +00:00
Adam J. Stewart
d748dae548
Fix erroneous warning for unset `max_epochs` ( #13262 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-03 19:17:21 +00:00
Adrian Wälchli
e6a8283e9c
Organize accelerator tests ( #13986 )
2022-08-03 13:49:55 +00:00
Adrian Wälchli
4ce97f37a2
Validate the model input of trainer methods ( #13892 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-03 13:38:42 +00:00
Adrian Wälchli
ce025bf954
Lazy import check for hydra dependency ( #13812 )
2022-08-03 04:27:16 -04:00
Jerome Anand
b3203d93d0
Added support for HPU device stats monitor ( #13819 )
...
* Added support for HPU device stats monitor
Signed-off-by: Jerome <janand@habana.ai>
* Update changelog
Signed-off-by: Jerome <janand@habana.ai>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Apply suggestions from code review
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
* Update reference
Signed-off-by: Jerome <janand@habana.ai>
* Apply suggestions from code review
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
* fix alignment
* add descriptions
* Update hpu_intermediate.rst
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-08-02 13:31:31 +05:30
Adrian Wälchli
eb233ea12d
Snapshot selected globals and restore them in spawned process ( #13921 )
...
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-08-01 22:21:46 +00:00
Rohit Gupta
0f6caffa57
Fix deepspeed default precision plugin `amp_level` to O2 ( #13897 )
...
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-29 20:36:51 +00:00
Adrian Wälchli
caaf35689c
Improvements to standalone scripts ( #13840 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-28 23:33:22 +00:00
HMellor
07b39c257b
Cast on host instead of IPU when using `precision=16` ( #13880 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-28 19:26:41 +00:00
Adrian Wälchli
25203d4c81
Organize model summary utilities ( #13893 )
2022-07-28 19:23:29 +02:00
Carlos Mocholí
406cea7146
Support DeepSpeed <0.7.0 ( #13859 )
...
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2022-07-28 14:38:51 +00:00
Carlos Mocholí
1299e4f984
Run GPU tests with PyTorch 1.12 ( #13716 )
...
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-07-28 19:37:57 +05:30
Carlos Mocholí
511875e567
Support DeepSpeed >=0.6.0, <0.6.5 ( #13863 )
...
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-07-27 18:57:52 +02:00
Adrian Wälchli
fff62f0ae5
Fix TPU testing and collect all tests ( #11098 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
2022-07-27 15:40:40 +00:00
otaj
95f5f170f5
Allowed custom `BatchSampler`s when instantiated in `*_dataloader` hook ( #13640 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-07-27 15:32:50 +00:00
Adrian Wälchli
2a24b906ac
Add batch size script argument for standalone tests ( #13841 )
...
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-07-27 12:36:22 +00:00
otaj
4c7b9f0b11
Disallow batch sampler with multiple IPU devices ( #13854 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-27 15:20:43 +05:30
Anton Shevtsov
41f45b475e
Check if the scheduler already has `reduce_on_plateau` ( #13838 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-27 09:10:57 +00:00
Adrian Wälchli
c3911700d1
Fix error handling in learning rate finder ( #13845 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-27 04:32:39 -04:00
Rohit Gupta
faf7ff57c0
Add support for async checkpointing ( #13658 )
2022-07-26 21:13:19 +05:30
Adrian Wälchli
a8d7b4476c
Fix PyTorch spelling errors ( #13774 )
...
* Fix PyTorch spelling errors
* more
2022-07-25 12:51:16 -04:00
Justus Schock
227871982d
Merge different gpu backends with accelerator='gpu' ( #13642 )
...
* Rename GPUAccelerator to CUDAAccelerator
* Add back GPUAccelerator and deprecate it
* Remove temporary registration
* accelerator connector reroute
* accelerator_connector tests
* update enums
* lite support + tests
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* typo
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* move "gpu" support up before actual accelerator flag checks
* Stupid arguments
* fix tests
* change exception type
* fix registry test
* pre-commit
* CI: debug HPU flow (#13419 )
* Update the hpu-tests.yml to pull docker from vault
* fire & sudo
* habana-gaudi-hpus
* Check the driver status on gaudi server (#13718 )
Co-authored-by: arao <arao@habana.ai>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Akarsha Rao <94624926+raoakarsha@users.noreply.github.com>
* Update typing-extensions requirement from <4.2.1,>=4.0.0 to >=4.0.0,<4.3.1 in /requirements (#13529 )
Update typing-extensions requirement in /requirements
Updates the requirements on [typing-extensions](https://github.com/python/typing_extensions ) to permit the latest version.
- [Release notes](https://github.com/python/typing_extensions/releases )
- [Changelog](https://github.com/python/typing_extensions/blob/main/CHANGELOG.md )
- [Commits](https://github.com/python/typing_extensions/compare/4.0.0...4.3.0 )
---
updated-dependencies:
- dependency-name: typing-extensions
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* [pre-commit.ci] pre-commit suggestions (#13540 )
updates:
- [github.com/psf/black: 22.3.0 → 22.6.0](https://github.com/psf/black/compare/22.3.0...22.6.0 )
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* [FIX] Native FSDP precision + tests (#12985 )
* Simplify fetching's loader types (#13111 )
* Include app templates to the lightning and app packages (#13731 )
* Include app templates to the package
Co-authored-by: mansy <mansy@lightning.ai>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
* Fix mypy typing errors in pytorch_lightning/callbacks/model_checkpoint.py (#13617 )
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
* Fix typos initialize in docs (#13557 )
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
* Fix main progress bar counter when `val_check_interval=int` and `check_val_every_n_epoch=None` (#12832 )
* Fix mypy errors attributed to `pytorch_lightning.loggers.tensorboard.py` (#13688 )
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
* Fix mypy errors attributed to `pytorch_lightning.loggers.mlflow` (#13691 )
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: otaj <6065855+otaj@users.noreply.github.com>
* fix mypy errors for loggers/wandb.py (#13483 )
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
* Fix gatekeeper minimum check (#13769 )
* changelog
* changelog
* fix order
* move up again
* add missing test
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: arao <arao@habana.ai>
Co-authored-by: Akarsha Rao <94624926+raoakarsha@users.noreply.github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Sean Naren <sean@grid.ai>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Mansy <ahmed.mansy156@gmail.com>
Co-authored-by: mansy <mansy@lightning.ai>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Lee Jungwon <33821003+BongYang@users.noreply.github.com>
Co-authored-by: Nathaniel D'Amours <88633026+NathanielDamours@users.noreply.github.com>
Co-authored-by: Justin Goheen <26209687+JustinGoheen@users.noreply.github.com>
Co-authored-by: otaj <6065855+otaj@users.noreply.github.com>
Co-authored-by: Gautier Dagan <s2234411@ed.ac.uk>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-25 14:46:45 +00:00
Mauricio Villegas
1b31039c58
Update LightningCLI test for new support in latest release of jsonargparse ( #13805 )
2022-07-25 09:25:42 +00:00
Adrian Wälchli
81f149e9d4
Rename spawn-based launchers ( #13743 )
2022-07-23 11:48:15 -04:00
Adrian Wälchli
fa886f2a58
Lazy import check for neptune dependency ( #13477 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-23 14:06:26 +00:00
Adrian Wälchli
d24978baa3
Add ddp_notebook alias for ddp_fork ( #13744 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-07-23 09:06:35 -04:00
Jinyoung Lim
ae9803137a
Add logging messages to notify when `FitLoop` stopping conditions are met ( #9749 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-23 12:07:47 +00:00
Carlos Mocholí
4f53e7132f
Promote the CLI out of utilities ( #13767 )
2022-07-23 12:07:29 +00:00
Adrian Wälchli
f6f06d4e42
Set default strategy to ddp_fork in interactive environments ( #13746 )
2022-07-22 19:34:30 +00:00
Carlos Mocholí
9f51c07604
Support setting the trainer reference recursively for ensembles ( #13638 )
...
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
2022-07-22 19:58:46 +02:00
Adrian Wälchli
596aa8400d
Lazy import check for wandb dependency ( #13474 )
2022-07-22 19:57:46 +02:00
Adrian Wälchli
c3299d2c59
Add support for DDP fork ( #13405 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-07-22 16:05:35 +00:00
Rohit Gupta
763fbf6b77
Fix to allow custom `CheckpointIO` with strategy classes ( #13785 )
2022-07-22 14:32:54 +00:00
Krishna Kalyan
238c9913a2
Do not force `sync_dist=True` on epoch end ( #13364 )
...
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-22 10:04:30 +00:00
Jerome Anand
9596fabe7b
Add auto_device_count and device name support ( #13423 )
...
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: mansy <mansy@lightning.ai>
Co-authored-by: manskx <mansy@lightning.ai>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Mansy <ahmed.mansy156@gmail.com>
Co-authored-by: otaj <ota@lightning.ai>
Co-authored-by: Sean Naren <sean@grid.ai>
Co-authored-by: Keiichi Kuroyanagi <kuroyanagi.keiichi@gmail.com>
Co-authored-by: Martino Sorbaro <martinosorb@users.noreply.github.com>
Co-authored-by: Wang Ran (汪然) <wangr@smail.nju.edu.cn>
Co-authored-by: Rhys Goodall <rhys.goodall@outlook.com>
Co-authored-by: Siyuan Li <siyuanli.s.c@gmail.com>
Co-authored-by: Ekagra Ranjan <ekagra.ranjan@gmail.com>
Co-authored-by: S. Kumano <54502860+s-kumano@users.noreply.github.com>
Co-authored-by: otaj <6065855+otaj@users.noreply.github.com>
Co-authored-by: Gautier Dagan <gautierdagan2017@u.northwestern.edu>
Co-authored-by: Sherin Thomas <sherinct@live.com>
Co-authored-by: Cyprien Ricque <48893621+Cyprien-Ricque@users.noreply.github.com>
Co-authored-by: Masahiro Wada <argon.argon.argon@gmail.com>
Co-authored-by: nitinramvelraj <98356761+nitinramvelraj@users.noreply.github.com>
Co-authored-by: donlapark <10988155+donlapark@users.noreply.github.com>
Co-authored-by: Justin Goheen <26209687+JustinGoheen@users.noreply.github.com>
Co-authored-by: Shantam Gilra <64306405+shantam-8@users.noreply.github.com>
Co-authored-by: Bibhabasu Mohapatra <68384968+bibhabasumohapatra@users.noreply.github.com>
Co-authored-by: Jimmy Yao <jiahaoyao.math@gmail.com>
Co-authored-by: Nikhil Shenoy <nikhilshenoy98@gmail.com>
Co-authored-by: Sanjay Aradhyamath <57592361+samz5320@users.noreply.github.com>
2022-07-22 10:29:02 +05:30
Gautier Dagan
0e5312833f
fix mypy errors for loggers/wandb.py ( #13483 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-21 01:07:24 +00:00
Rohit Gupta
e451fa28d0
Fix main progress bar counter when `val_check_interval=int` and `check_val_every_n_epoch=None` ( #12832 )
2022-07-20 20:33:00 +00:00
Carlos Mocholí
bbd364a041
Simplify fetching's loader types ( #13111 )
2022-07-20 12:15:24 +00:00
Sean Naren
d78698528d
[FIX] Native FSDP precision + tests ( #12985 )
2022-07-20 11:32:35 +00:00
Rohit Gupta
c67b075cf5
Use `global_step` while restoring logging step for old checkpoints ( #13645 )
...
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-19 18:53:22 +00:00
Justus Schock
abf82b360a
Add back GPUAccelerator and deprecate it
2022-07-19 13:06:30 -04:00
Justus Schock
c75457da99
Rename GPUAccelerator to CUDAAccelerator
2022-07-19 13:06:30 -04:00
Carlos Mocholí
0e5a51f55c
Allow CUDA and IPU tests without the CI environment var ( #13676 )
2022-07-19 13:40:25 +09:00
Carlos Mocholí
d058190b6d
Run standalone tests in batches ( #13673 )
2022-07-18 12:10:35 +00:00
George Stein
0449e861cc
Fix `trainer.predict(return_predictions=False)` does not track `batch_indices` ( #13629 )
...
* Pull request for fixing issue #13580
* chlog and test
* disable track for epoch
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
2022-07-18 08:26:15 +00:00
Jirka Borovec
aa62fe36df
add testing PT 1.12 ( #13386 )
...
* add testing PT 1.12
* Fix quantization tests
* Fix another set of tests
* Fix check since https://github.com/pytorch/pytorch/pull/80139 is only going to be available for 1.13
* Skip this test for now for 1.12
Co-authored-by: SeanNaren <sean@grid.ai>
2022-07-15 19:41:23 +02:00
Adrian Wälchli
d42711f22f
Remove deprecated `Strategy.post_dispatch` ( #13461 )
...
* Remove deprecated Strategy.post_dispatch
* changelog
* remove unused imports
2022-07-15 13:18:55 -04:00
thomas chaton
5e26840f94
Introduce ServableModuleValidator Callback ( #13614 )
...
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* wip
* Update tests/tests_pytorch/serve/test_servable_module_validator.py
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
* Update tests/tests_pytorch/serve/test_servable_module_validator.py
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
* Update src/pytorch_lightning/serve/servable_module_validator.py
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
* Update src/pytorch_lightning/serve/servable_module_validator.py
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
* Update src/pytorch_lightning/serve/servable_module_validator.py
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
* Typing improvements
* wip
* update doc
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* Update examples/pl_servable_module/production.py
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
* update
* update
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-15 11:07:40 -04:00
Carlos Mocholí
8355ba1260
Run only CUDA tests on Azure GPU CI ( #13651 )
2022-07-15 13:51:23 +02:00
Akihiro Nitta
7ba0270552
Remove deprecated `max_steps=None` ( #13591 )
...
* Remove max_steps=None
* Update changelog
* Update docs
* Unused import
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-14 12:28:38 +00:00
Akihiro Nitta
c1cc112b52
Remove deprecated `LightningDistributed` ( #13549 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-13 21:15:28 +00:00
Adrian Wälchli
daf7cec01e
Remove deprecated ClustertEnvironment methods ( #13458 )
...
* Remove deprecated ClustertEnvironment methods
* update changelog
* ignore typing error
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-07-13 19:53:46 +00:00
Akihiro Nitta
feb8e7d344
Remove deprecated `LightningModule.on_post_move_to_device` ( #13548 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-13 18:06:27 +00:00
Adrian Wälchli
07e7d6dc3b
Remove deprecated `Trainer.slurm_job_id` ( #13459 )
2022-07-13 16:50:55 +00:00
Sanjay Aradhyamath
562467402d
Removed deprecated `pytorch_lightning.overrides.distributed.IndexBatchSamplerWrapper.batch_indices` ( #13565 )
...
* Removed the deprecated method
* Removed deprecated IndexBatchSamplerWrapper.batch_indices
* Update src/pytorch_lightning/CHANGELOG.md
* Missed code
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-13 00:53:18 +00:00
Nikhil Shenoy
e034cd31d3
Remove `add_to_queue` and `remove_from_queue` from LightningModule ( #13600 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-12 16:47:51 +02:00
Rohit Gupta
dba65be911
Remove redundant GPU test ( #13623 )
...
Remove redundant test
2022-07-12 09:51:11 -04:00
Rohit Gupta
df931e2486
Restore log step during restart ( #13467 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-07-12 09:45:59 +00:00
Shantam Gilra
bdb6e40392
Remove deprecated `pytorch_lightning.core.decorators.parameter_validation` ( #13514 )
...
* Removal of depreciated code from decorators
* Update CHANGELOG.md
* Removed imports
2022-07-11 23:03:54 +00:00
nitinramvelraj
61c28cb428
Remove deprecated `on_keyboard_interrupt` ( #13438 )
...
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-07-05 07:34:18 +00:00
Adrian Wälchli
a80354e3ae
Move deepspeed summary test to correct folder ( #13478 )
2022-07-01 08:47:04 +00:00
Siyuan Li
e0a0d1e4f9
Set timeout for DDPSpawnStrategy ( #13383 )
...
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-06-29 20:55:06 -04:00
Adrian Wälchli
1f85b6d6a4
Fix validation when accelerator is a string ( #13417 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-06-29 22:42:34 +00:00
Rhys Goodall
8c4d640bfc
Convert validation loop config warnings to `PossibleUserWarning` ( #13377 )
...
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-06-29 22:34:25 +00:00
Adrian Wälchli
2dd332f9c7
Call `set_epoch` for distributed batch samplers ( #13396 )
...
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-06-29 19:09:35 +00:00
Adrian Wälchli
43635a9a9b
Remove remaining old-style AcceleratorConnector properties ( #13412 )
...
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-06-29 18:51:48 +00:00
ananthsub
7fca126749
Update gather_all_tensors to handle tensors of different sizes ( #12630 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: awaelchli <aedu.waelchli@gmail.com>
2022-06-29 17:03:00 +00:00
Adrian Wälchli
ddbf95516b
Remove support for DDP2 strategy ( #12705 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-06-29 15:06:51 +00:00
Sean Naren
f145acd2a3
Add model summary when using DeepSpeed Stage 3 ( #13427 )
2022-06-29 14:49:34 +00:00
Adrian Wälchli
c71f32a490
Rename old references to training type plugin in tests ( #13421 )
2022-06-28 14:57:44 -04:00
Carlos Mocholí
b1e38bfd79
Better errors for logging corner cases ( #13164 )
2022-06-28 16:59:31 +01:00
Carlos Mocholí
a4750100cf
[CLI] Support custom trainers without callbacks ( #13138 )
2022-06-28 17:39:17 +02:00
Sean Naren
54f2d44fb8
Remove unnecessary endpoint logic, rename `collaborative` to `hivemind` ( #13392 )
...
* Remove endpoint after collaborate app/dht CLI
* Fix references, change filename
* Add CHANGELOG.md
* Address review
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-06-28 08:41:08 -04:00
Jirka Borovec
d2e4e7e003
create meta package [RFC] ( #13327 )
...
* placeholder
* move setup_tools & abstract about
* adjust lightning-app
* notes
* lightning about
* lightning init
* CI check
* ci
* install
* adjust manifest & mv chlog
* manifest
* pkg
* mv __setup__
* parse_requirements
* lit
* ci - pytorch
* wrap func
* ci
* cd draft
* generate lit
* pkg
* utf-8
* root pkg
* req.
* ver
* mypy
* try check
* meta pkg
* meta pkg - vars
* meta pkg - pruning
* meta pkg - fixing
* fix PL for meta
* multi-line wrapper
* hack manifest
* ci
* fix docstr
* fixing
* ci & mypy
* links
2022-06-27 09:34:18 -04:00
Justus Schock
f54abc506f
Merge pull request #13123 from Lightning-AI/mps_accelerator
...
MPS Accelerator
2022-06-24 08:15:48 -04:00
Sean Naren
73e7a5d0c2
Rename `CollaborativeStrategy` to `HivemindStrategy` ( #13388 )
2022-06-23 15:44:48 +00:00
awaelchli
511f1a6515
Reroute profiler to profilers ( #12308 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-06-22 20:55:39 -04:00
awaelchli
fc1559e41c
Rename profiler to profilers ( #12308 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-06-22 20:55:39 -04:00
Patrick Haller
887dc0ff8c
DummyLogger can be called with unknown methods ( #13224 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-06-22 17:51:42 +02:00
Atharva Phatak
63a9ab4ae2
Improved Deepspeed Imports ( #13223 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2022-06-22 11:09:33 -04:00
otaj
33bd270845
Adds Sampler Wrappers for custom samplers in distributed environment ( #12959 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
2022-06-22 12:17:53 +02:00
Ray Schireman
8266300b29
Remove pytorch lightning.callbacks.lr monitor.learning rate monitor.lr_sch_names ( #13353 )
...
Co-authored-by: Raymond G Schireman <raymond.schireman@uvm.edu>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-06-22 02:03:17 +02:00
otaj
2e9cd72add
Improve support for custom `DataLoader`s when instantiated in `*_dataloader` hook ( #12981 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-06-22 01:53:24 +02:00
Mauricio Villegas
6371d7c615
Fix LightningCLI signature parameter resolving for some lightning classes ( #13283 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-06-21 21:58:41 +00:00
Sean Naren
89e2e69b01
[BUG] `estimated_stepping_batches` requires distributed comms in `configure_optimizers` for `DeepSpeedStrategy` ( #13350 )
2022-06-21 17:48:27 +01:00
Tianshu Wang
749709fb4f
Use run name for logging with WandbLogger ( #12604 )
...
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
2022-06-21 15:25:37 +00:00
Mauricio Villegas
0ae9627bf8
Deprecate CLI registries and update documentation ( #13221 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-06-21 17:12:04 +02:00
Siyuan Li
c600f987c2
Enable timeout for `DDPStrategy` ( #13244 )
2022-06-21 15:49:57 +02:00
Ekagra Ranjan
81b7000978
EarlyStopping logging on rank 0 only ( #13233 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-06-21 09:37:41 -04:00
Adam J. Stewart
d24178ec29
Fix torch.distributed._sharded_tensor DeprecationWarning ( #13261 )
2022-06-21 04:52:06 -04:00
Jerome Anand
cd44512ab9
Added multi-optimizer tests with hpu ( #13217 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-06-21 09:07:31 +02:00
Jirka Borovec
ab59f308b1
Future 4/n: test & legacy in test/ folder ( #13295 )
...
* move: legacy >> test/
* move: tests >> test/
* rename unittests
* update CI
* tests4pl
* tests_pytorch
* proxi
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* ci
* link
* cli
* standalone
* fixing
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* .
* Apply suggestions from code review
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* alone
* test -> tests
* Standalone fixes
* ci
* Update
* More fixes
* Fix coverage
* Fix mypy
* mypy
* Empty-Commit
* Fix
* mypy just for pl
* Fix standalone
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-06-15 18:10:49 -04:00