History

wangraying a0655611de Add bagua installation in dockerfile (#11283 ) Co-authored-by: Aki Nitta <nitta@akihironitta.com> Co-authored-by: Jirka <jirka.borovec@seznam.cz>		2022-02-24 15:17:31 +01:00
..
base-conda	Fix horovod installation `base-cuda` Dockerfile (#11811 )	2022-02-10 16:48:33 +09:00
base-cuda	Add bagua installation in dockerfile (#11283 )	2022-02-24 15:17:31 +01:00
base-ipu	Update Python testing (#10269 )	2021-11-04 18:26:24 +01:00
base-xla	Drop Python 3.6 support (#11117 )	2021-12-21 17:06:15 +00:00
ipu-ci-runner	Update Python testing (#10269 )	2021-11-04 18:26:24 +01:00
nvidia	update NGC (#10770 )	2021-11-29 14:14:37 +00:00
release	Upgrade CI after the 1.10 release (#10075 )	2021-11-10 17:59:10 +01:00
tpu-tests	Rename training plugin test files & names to strategy (#11303 )	2022-01-04 14:32:45 +01:00
README.md	Unblock GPU CI (#11934 )	2022-02-16 21:15:44 +01:00

README.md

Docker images

Builds images form attached Dockerfiles

You can build it on your own, note it takes lots of time, be prepared.

git clone <git-repository>
docker image build -t pytorch-lightning:latest -f dockers/conda/Dockerfile .

or with specific arguments

git clone <git-repository>
docker image build \
    -t pytorch-lightning:base-cuda-py3.9-pt1.8 \
    -f dockers/base-cuda/Dockerfile \
    --build-arg PYTHON_VERSION=3.9 \
    --build-arg PYTORCH_VERSION=1.8 \
    .

or nightly version from Conda

git clone <git-repository>
docker image build \
    -t pytorch-lightning:base-conda-py3.8-pt1.9 \
    -f dockers/base-conda/Dockerfile \
    --build-arg PYTHON_VERSION=3.8 \
    --build-arg PYTORCH_VERSION=1.9 \
    .

To run your docker use

docker image list
docker run --rm -it pytorch-lightning:latest bash

and if you do not need it anymore, just clean it:

docker image list
docker image rm pytorch-lightning:latest

Run docker image with GPUs

To run docker image with access to you GPUs you need to install

# Add the package repositories
distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add -
curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list

sudo apt-get update && sudo apt-get install -y nvidia-container-toolkit
sudo systemctl restart docker

and later run the docker image with --gpus all so for example

docker run --rm -it --gpus all pytorchlightning/pytorch_lightning:base-cuda-py3.7-torch1.6

Run Jupyter server

Inspiration comes from https://u.group/thinking/how-to-put-jupyter-notebooks-in-a-dockerfile

Build the docker image:

docker image build \
    -t pytorch-lightning:v1.3.1 \
    -f dockers/nvidia/Dockerfile \
    --build-arg LIGHTNING_VERSION=1.3.1 \
    .

start the server and map ports:

docker run --rm -it --runtime=nvidia -e NVIDIA_VISIBLE_DEVICES=all -p 8888:8888 pytorch-lightning:v1.3.1

Connect in local browser:
- copy the generated path e.g. http://hostname:8888/?token=0719fa7e1729778b0cec363541a608d5003e26d4910983c6
- replace the hostname by localhost