lightning/pytorch_lightning/trainer
shuyingsunshine21 8538c1f61e
Accelerator model state dict (#7474)
* Fix some test errors
Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* checkpoint consolidation

* Update ddp_spawn.py

* Update test_metric_result_integration.py

* Update test_results.py

* Update utils.py

* Update utils.py

* Update test_all_gather_grad.py

* Update test_all_gather_grad.py

* Update test_results.py

* Revert "Update test_results.py"

This reverts commit 9d4a2b891d.

* Revert "Merge pull request #1 from shuyingsunshine21/shuyingsunshine21-checkpoint_consolidate"

This reverts commit c5053da789, reversing
changes made to 0d23d75bc9.

* Revert "Update test_all_gather_grad.py"

This reverts commit 0d23d75bc9.

* Revert "Update utils.py"

This reverts commit 70fe5da9c6.

* Revert "Update utils.py"

This reverts commit a9aae99f6e.

* Revert "Update test_results.py"

This reverts commit ea74906878.

* Revert "Update test_metric_result_integration.py"

This reverts commit bf70e431b3.

* Revert "Update ddp_spawn.py"

This reverts commit f17210183b.

* Revert "checkpoint consolidation"

This reverts commit 536c1323b0.

* Revert "Revert "checkpoint consolidation""

This reverts commit 3a9fde915a.

* Revert "Revert "Revert "checkpoint consolidation"""

This reverts commit 7a369f47e1.

* Revert "Revert "Update ddp_spawn.py""

This reverts commit 8222dc98ea.

* Revert "Revert "Update test_metric_result_integration.py""

This reverts commit 6c095b2370.

* Revert "Revert "Update test_results.py""

This reverts commit 250d0aaaa2.

* Revert "Revert "Update utils.py""

This reverts commit 8651d54d79.

* Revert "Revert "Update test_all_gather_grad.py""

This reverts commit dcdcd29731.

* modify distributed environment to make test pass

* modify model state dict to training type plugin

* remove changes

* add changelog

* fixing isort for pre-commit failure

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Address code review

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: SeanNaren <sean@grid.ai>
2021-05-11 16:39:04 +01:00
..
connectors Accelerator model state dict (#7474) 2021-05-11 16:39:04 +01:00
__init__.py added trainer api docs (#4569) 2020-11-07 14:18:45 -05:00
callback_hook.py [2/2] Remove outputs from evaluation epoch end hooks (#7338) 2021-05-05 19:50:58 +00:00
configuration_validator.py `TrainerState` refactor [5/5] (#7173) 2021-05-04 12:50:56 +02:00
data_loading.py `TrainerState` refactor [5/5] (#7173) 2021-05-04 12:50:56 +02:00
deprecated_api.py Simplify deprecations (#6620) 2021-03-25 15:26:38 +01:00
evaluation_loop.py Improve val step logging (#7351) 2021-05-07 22:58:03 +00:00
logging.py Move metrics_to_scalars to a dedicated utilities file (#7180) 2021-04-24 10:25:33 +01:00
model_hooks.py Deprecate TrainerModelHooksMixin (#7422) 2021-05-07 13:19:36 -07:00
optimizers.py Update LR schedulers only when their corresponding Optimizer is being… (#4868) 2021-05-04 09:37:40 +00:00
predict_loop.py Code cleaning in preparation for #7258 [3/n] (#7262) 2021-04-29 14:40:51 +02:00
properties.py remove trainer hidden state | sanity refactor [1 / n] (#7437) 2021-05-11 11:09:08 +02:00
states.py `TrainerState` refactor [5/5] (#7173) 2021-05-04 12:50:56 +02:00
supporters.py Bugfix/Multiple dataloaders (#7433) 2021-05-11 16:33:29 +02:00
trainer.py Mark certain Trainer APIs as protected (#7420) 2021-05-11 11:53:41 +02:00
training_loop.py Mark certain Trainer APIs as protected (#7420) 2021-05-11 11:53:41 +02:00
training_tricks.py Use `torch.nn.utils.clip_grad_norm_` and add `clip_grad_by_value` support for TPU (#7025) 2021-05-07 16:41:39 +00:00