Kaushik B
|
4f19a4dabc
|
Update Trainer flag docs for strategy (#10042)
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
|
2021-10-20 21:02:21 +05:30 |
kingyiusuen
|
2ed92ecabb
|
Rerun flaky profiler tests on failure (#10035)
|
2021-10-20 18:57:04 +05:30 |
Carlos Mocholí
|
f0b3e0f4de
|
Default to `precision=bf16` on CPU when `precision=16` is passed (#10033)
|
2021-10-20 13:25:13 +00:00 |
Adrian Wälchli
|
2c16f1d6b9
|
remove dataloader patching on the LightningModule (#9764)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: rohitgr7 <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
|
2021-10-20 15:23:20 +02:00 |
Rohit Gupta
|
6701526478
|
Add typing to callbacks (#10001)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
|
2021-10-20 10:22:03 +00:00 |
jjenniferdai
|
f0e6f1b58a
|
Support serialized checkpoint loading (#9605)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
|
2021-10-20 09:38:35 +01:00 |
Carlos Mocholí
|
53c62f63e8
|
Constrain IPU precision choices (#10030)
|
2021-10-20 00:52:01 +00:00 |
Carlos Mocholí
|
ad8d6c83da
|
[CLI] Shorthand notation to instantiate datamodules (#10011)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
|
2021-10-20 00:49:48 +00:00 |
Adrian Wälchli
|
e0c83ee6df
|
Update `TPUSpawnPlugin` spawn methods (#10022)
|
2021-10-20 01:59:11 +02:00 |
Carlos Mocholí
|
e44921ee21
|
Fix `self.log(on_epoch=True, reduce_fx=sum)` on_batch_start (#9791)
|
2021-10-20 01:56:37 +02:00 |
Carlos Mocholí
|
d45897d522
|
Rename `TPUHalfPrecisionPlugin` to `TPUBf16PrecisionPlugin` (#10026)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
|
2021-10-19 21:09:37 +00:00 |
Ning
|
0b68f2abf8
|
Remove `reset_train_val_dataloaders` from Trainer and move data reloading logic to loop (#9671)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
|
2021-10-19 21:45:52 +02:00 |
Adrian Wälchli
|
3ea534754e
|
Update setup logic in training type plugins (deepspeed) [2 / n] (#10009)
Co-authored-by: Sean Naren <sean@grid.ai>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2021-10-19 18:23:11 +00:00 |
Carlos Mocholí
|
e8beceb631
|
Add `TPUPrecisionPlugin` (#10020)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
|
2021-10-19 17:48:57 +00:00 |
Adrian Wälchli
|
4aaca17fce
|
Update setup logic in training type plugins (data-parallel) [3 / n] (#10010)
Co-authored-by: thomas chaton <thomas@grid.ai>
|
2021-10-19 19:47:36 +02:00 |
Adrian Wälchli
|
854bdc042d
|
Update setup logic in training type plugins [1 / n] (#9994)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2021-10-19 17:45:36 +02:00 |
Carlos Mocholí
|
e95f9b71c1
|
Set the optimization output result class as a class attribute (#9977)
|
2021-10-19 16:33:08 +01:00 |
Rohit Gupta
|
eb7f2e0b90
|
Update kfold example to avoid ci failures (#10019)
|
2021-10-19 17:09:42 +02:00 |
thomas chaton
|
1759403c8d
|
Add check for callable with datamodule len (#10003)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
|
2021-10-19 14:51:08 +00:00 |
Danielle Pintz
|
f4083d58d9
|
Fix `_compare_version` and add `_TORCH_GREATER_EQUAL_DEV_1_10` (#9960)
|
2021-10-19 14:42:46 +00:00 |
Adrian Wälchli
|
bcb94de90e
|
Add `DDPSpawnPlugin.spawn()` (#10018)
|
2021-10-19 14:34:47 +00:00 |
Rohit Gupta
|
0aa220b46b
|
Remove deprecated `distributed_backend` from `Trainer` (#10017)
* rm distributed_backend from Trainer
* unused
* chlog
* internal distributed_backend
* Docstring
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
|
2021-10-19 13:54:37 +00:00 |
Carlos Mocholí
|
bb2dc68792
|
Simplify track grad norm condition (#9992)
|
2021-10-19 15:00:16 +02:00 |
Adrian Wälchli
|
321502fe31
|
Update backward hook for `PrecisionPlugin` (#10008)
Co-authored-by: thomas chaton <thomas@grid.ai>
|
2021-10-19 10:51:45 +00:00 |
Aki Nitta
|
8f14e77d76
|
Make pytest not run .github/* (#10012)
|
2021-10-19 09:57:29 +02:00 |
Danielle Pintz
|
83ce1bf515
|
Make `verify_loop_configurations` a utility function (#9976)
|
2021-10-18 23:52:45 +00:00 |
Danielle Pintz
|
203737bfce
|
Don't raise DeprecationWarning for `LoggerConnector.gpus_metrics` (#9959)
|
2021-10-18 22:51:09 +00:00 |
Adrian Wälchli
|
65150cdb42
|
Update docs for base Loop class with examples (#9993)
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
|
2021-10-18 15:37:23 +00:00 |
Carlos Mocholí
|
efc8727314
|
Add typing for `LightningOptimizer` (#9990)
|
2021-10-18 17:29:51 +02:00 |
thomas chaton
|
86df7dcee7
|
Add KFold Loop example (#9965)
|
2021-10-18 16:27:12 +01:00 |
Adrian Wälchli
|
a99b7440b5
|
Add unit tests for `pl.utilities.grads` (#9765)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
|
2021-10-18 18:58:51 +05:30 |
Rohit Gupta
|
4dc32ad7db
|
Fix logic to check for spawn in worker_check (#9902)
* fix
* update tests
* chlog
* skip windows
|
2021-10-18 13:02:46 +00:00 |
Adrian Wälchli
|
10d0b41977
|
Introduce `PrecisionPlugin.forward_context()` (#9988)
Co-authored-by: thomas chaton <thomas@grid.ai>
|
2021-10-18 12:58:19 +00:00 |
Carlos Mocholí
|
3f355d0eb7
|
Remove manual tracking of optimizer steps (#9957)
|
2021-10-18 12:43:06 +00:00 |
Carlos Mocholí
|
e0470cc244
|
Update `resume_from_checkpoint` docs (#9952)
|
2021-10-18 17:40:47 +05:30 |
Carlos Mocholí
|
0684e5295f
|
Remove deprecated `DataModule.dims` usage in tests (#9948)
|
2021-10-18 17:35:41 +05:30 |
Carlos Mocholí
|
c69a79c86f
|
Fix `self.log(on_epoch=True)` on_batch_start (#9780)
|
2021-10-18 14:02:16 +02:00 |
Elad Segal
|
8c76cf5ae1
|
reset val dataloader for binsearch (#9975)
|
2021-10-18 12:54:26 +02:00 |
Adrian Wälchli
|
7a9151637c
|
loop customization docs (#9609)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
Co-authored-by: edenlightning <66261195+edenlightning@users.noreply.github.com>
|
2021-10-18 09:43:11 +00:00 |
Carlos Mocholí
|
01b304ec57
|
Update accelerator connector messages after the addition of strategy (#9937)
|
2021-10-18 01:10:48 +00:00 |
Carlos Mocholí
|
788f6864d9
|
Fix `LightningOptimizer` step and toggling logic (#9958)
|
2021-10-18 00:23:51 +00:00 |
ronif
|
7b4df7bf91
|
Fix issue with no-init dataclass fields in move_to_device (#9963)
Co-authored-by: ronif <ronif@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
|
2021-10-17 07:10:47 +00:00 |
Carlos Mocholí
|
e5dfdf34f9
|
Avoid deprecation warning after #9901 (#9951)
|
2021-10-16 17:36:25 +01:00 |
Mauricio Villegas
|
1f09cf2432
|
Fixed use of LightningCLI in computer_vision_fine_tuning.py example (#9934)
|
2021-10-16 17:04:02 +01:00 |
Kaushik B
|
5e8829b97d
|
(1/n) tests: Use strategy flag instead of accelerator for training strategies (#9931)
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
|
2021-10-16 20:40:25 +05:30 |
Carlos Mocholí
|
e973bcb76a
|
Use non-deprecated options in tests (#9949)
|
2021-10-15 16:58:07 -07:00 |
Carlos Mocholí
|
db4e770004
|
Validate the precision input earlier (#9763)
|
2021-10-15 17:30:00 +00:00 |
kingyiusuen
|
6429de8944
|
Add support for `len(datamodule)` (#9895)
Co-authored-by: tchaton <thomas@grid.ai>
|
2021-10-15 14:19:50 +02:00 |
Danielle Pintz
|
16213b1635
|
Deprecate `log_gpu_memory`, `gpu_metrics`, and util funcs in favor of `DeviceStatsMonitor` callback (#9921)
Co-authored-by: Kaushik B <45285388+kaushikb11@users.noreply.github.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
|
2021-10-14 22:45:44 +02:00 |
Oliver Borchert
|
afbf703684
|
Single-process multi-node CPU training (#9603)
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
|
2021-10-14 22:21:41 +02:00 |