Jirka Borovec
783ec43a85
parse strategies as own extras ( #12975 )
...
* parse strategies as own extras
* prune devel
* Update Makefile
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
* revert parse_requirements
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2022-05-09 09:25:53 -04:00
Jirka Borovec
7ce948edb6
Unpin CUDA docker image for GPU CI ( #12373 )
...
* unpin CUDA docker image for GPU CI
* Apply suggestions from code review
Co-authored-by: Aki Nitta <nitta@akihironitta.com>
Co-authored-by: Akihiro Nitta <akihiro@pytorchlightning.ai>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-05-06 02:56:57 +00:00
Jirka Borovec
bb51e2a55b
Merge pull request #12723 from PyTorchLightning/req/strategies
...
Separate strategies' requirements
2022-05-04 10:06:02 -04:00
Akihiro Nitta
bb81802bff
Update `deepspeed` and `fairscale` versions ( #12860 )
...
* Fix deepspeed installation
* Adapt to deepspeed>=0.5.9
* Fix fairscale installation
Co-authored-by: Akihiro Nitta <akihiro@pytorchlightning.ai>
2022-04-26 01:40:25 +02:00
Jirka Borovec
f9b69ce5b0
CI: check docker requires ( #12677 )
...
* check docker requires
* ci update
* bagua
* conda
* cuda
2022-04-12 00:29:54 +09:00
Rohit Gupta
02dc7751e9
Add `bfloat16` support to `DeepSpeedStrategy` ( #12508 )
...
Co-authored-by: carmocca <carlossmocholi@gmail.com>
2022-04-07 22:18:51 +00:00
Kaushik B
7b0d1183db
Update `gpus` flag with `accelerator` and `devices` flag ( #12156 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
2022-03-23 19:52:12 +00:00
four4fish
1eff3b53c1
Update fairscale version ( #11567 )
...
Co-authored-by: Aki Nitta <nitta@akihironitta.com>
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2022-03-21 11:38:55 +00:00
Aki Nitta
b8b855d411
Pin Docker image for testing on GPUs ( #12368 )
...
* Pin docker image sha
2022-03-18 01:16:54 +00:00
Jirka Borovec
1144673cd9
CI: sanity check for req. pkgs ( #11819 )
...
* CI: sanity check for req. pkgs
* scripts
* rename
* gcsfs ?
* rich !
* install extra
* move
* set -e
Co-authored-by: Aki Nitta <nitta@akihironitta.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-03-11 09:20:47 +00:00
Jirka Borovec
7bc87015ea
Unblock GPU CI ( #11934 )
...
Co-authored-by: Carlos Mocholi <carlossmocholi@gmail.com>
2022-02-16 21:15:44 +01:00
wangraying
8c07d8bf90
Add `Trainer(strategy="bagua")` ( #11146 )
...
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com>
Co-authored-by: Sean Naren <sean@grid.ai>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>
Co-authored-by: thomas chaton <thomas@grid.ai>
2022-02-04 17:02:09 +00:00
Sean Naren
c66cd12445
Remove partitioning of model in ZeRO 3 ( #10655 )
2021-12-17 12:36:53 +00:00
Carlos Mocholí
152eb57def
Rename special to standalone ( #10779 )
2021-11-26 17:13:14 +00:00
Carlos Mocholí
5788789f01
Move benchmarks into the test directory ( #10614 )
2021-11-19 03:07:33 +01:00
thomas chaton
9e844d9db6
Lite Docs and Example Improvements ( #10303 )
...
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-11-02 16:13:01 +01:00
Jirka Borovec
edea0d4bc3
switch azure pool ( #10266 )
2021-11-01 11:42:11 +00:00
Carlos Mocholí
3a4e9970d6
Pin fairscale version ( #10200 )
2021-10-27 23:24:17 +00:00
Kaushik B
5e8829b97d
(1/n) tests: Use strategy flag instead of accelerator for training strategies ( #9931 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2021-10-16 20:40:25 +05:30
Sean Naren
83acb8671d
Update DeepSpeed version, fix failing tests ( #9898 )
2021-10-11 22:35:33 +00:00
Carlos Mocholí
0dfc6a18bd
Call any trainer function from the `LightningCLI` ( #7508 )
2021-08-28 04:43:14 +00:00
Adrian Wälchli
de22e40095
restrict deepspeed version in CI ( #8951 )
2021-08-17 14:02:27 +01:00
thomas chaton
9e61de2063
Torch Elastic DDP DeadLock bug fix ( #8655 )
...
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2021-08-02 21:48:43 +02:00
thomas chaton
85bba06529
update ( #8674 )
2021-08-02 11:56:09 +02:00
Jirka Borovec
470842f5c8
CI: validate JSON & fix benchmark ( #8567 )
...
* CI: validate JSON
* as GHA
* PT1.8
* 32g
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2021-07-28 18:09:15 +02:00
Adrian Wälchli
96729fc45a
update links for collect_env_details.py script ( #8436 )
2021-07-19 11:26:09 +00:00
Carlos Mocholí
ae1fd6a201
Unblock GPU CI ( #8456 )
...
* Debug
* Increase SHM size
* Debug
* Refactor MNIST imports
* Undo debugging
* Prints
2021-07-19 09:41:18 +02:00
Carlos Mocholí
4184d7e738
Refactor GPU examples tests ( #8294 )
2021-07-06 13:14:04 +01:00
Carlos Mocholí
2c43bfc5ef
GPU CI - run torch 1.8 (LTS) ( #8116 )
2021-06-24 16:56:43 +00:00
Sean Naren
f7459f5328
DeepSpeed Infinity Update ( #7234 )
...
* Update configs to match latest API
* Ensure we move the entire model to device before configure optimizer is called
* Add missing param
* Expose parameters
* Update references, drop local rank as it's now infered from the environment variable
* Fix ref
* Force install deepspeed 0.3.16
* Add guard for init
* Update pytorch_lightning/plugins/training_type/deepspeed.py
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
* Revert type checking
* Install master for CI for testing purposes
* Update CI
* Fix tests
* Add check
* Update versions
* Set precision
* Fix
* See if i can force upgrade
* Attempt to fix
* Drop
* Add changelog
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
2021-06-14 16:38:28 +00:00
Carlos Mocholí
e16d4fbdee
CI code cleaning ( #7615 )
2021-05-21 11:35:12 +00:00
Louis Taylor
b64aea637c
CI: move azure-pipelines config to separate directory ( #7276 )
...
* CI: move azure pipelines to separate directory
This removes some extra clutter in the top level as we add more
pipelines.
* rename
Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
2021-05-04 10:50:16 -04:00