lightning/dockers
Justus Schock b4e29e0c8f
PL: Test PyTorch 2.0 pre-release on CPU and CUDA (#16764)
Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>
Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com>
Co-authored-by: Jirka <jirka.borovec@seznam.cz>
2023-02-28 18:38:40 +01:00
..
base-cuda PL: Test PyTorch 2.0 pre-release on CPU and CUDA (#16764) 2023-02-28 18:38:40 +01:00
base-ipu copyright Lightning AI team (#16647) 2023-02-06 15:26:51 +01:00
base-xla CI: Update colossalai version (#16747) 2023-02-14 14:07:29 -05:00
ci-runner-hpu Upgrade to HPU release 1.8.0 (#16621) 2023-02-03 10:33:57 +01:00
ci-runner-ipu copyright Lightning AI team (#16647) 2023-02-06 15:26:51 +01:00
docs copyright Lightning AI team (#16647) 2023-02-06 15:26:51 +01:00
nvidia Update Colossal AI docs and integration (#16778) 2023-02-16 16:14:24 +00:00
release copyright Lightning AI team (#16647) 2023-02-06 15:26:51 +01:00
README.md Update CI to CUDA 11.7.1 (#16123) 2023-01-04 22:44:23 +00:00

README.md

Docker images

Build images from Dockerfiles

You can build it on your own, note it takes lots of time, be prepared.

git clone https://github.com/Lightning-AI/lightning.git

# build with the default arguments
docker image build -t pytorch-lightning:latest -f dockers/base-cuda/Dockerfile .

# build with specific arguments
docker image build -t pytorch-lightning:base-cuda-py3.9-torch1.13-cuda11.7.1 -f dockers/base-cuda/Dockerfile --build-arg PYTHON_VERSION=3.9 --build-arg PYTORCH_VERSION=1.13 --build-arg CUDA_VERSION=11.7.1 .

To run your docker use

docker image list
docker run --rm -it pytorch-lightning:latest bash

and if you do not need it anymore, just clean it:

docker image list
docker image rm pytorch-lightning:latest

Run docker image with GPUs

To run docker image with access to your GPUs, you need to install

# Add the package repositories
distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add -
curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list

sudo apt-get update && sudo apt-get install -y nvidia-container-toolkit
sudo systemctl restart docker

and later run the docker image with --gpus all. For example,

docker run --rm -it --gpus all pytorchlightning/pytorch_lightning:base-cuda-py3.9-torch1.12-cuda11.7.1

Run Jupyter server

  1. Build the docker image:
    docker image build -t pytorch-lightning:v1.6.5 -f dockers/nvidia/Dockerfile --build-arg LIGHTNING_VERSION=1.6.5 .
    
  2. start the server and map ports:
    docker run --rm -it --gpus=all -p 8888:8888 pytorch-lightning:v1.6.5
    
  3. Connect in local browser:
    • copy the generated path e.g. http://hostname:8888/?token=0719fa7e1729778b0cec363541a608d5003e26d4910983c6
    • replace the hostname by localhost