From 5b2351cbb9dec443e588340661abea72b6fb9243 Mon Sep 17 00:00:00 2001 From: William Falcon Date: Wed, 8 Apr 2020 14:43:50 -0400 Subject: [PATCH] added slurm doc (#1418) * added slurm doc * added slurm doc --- README.md | 1 + docs/source/slurm.rst | 7 ++++++- 2 files changed, 7 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index c419a3b608..ab2f454258 100644 --- a/README.md +++ b/README.md @@ -44,6 +44,7 @@ pip install pytorch-lightning ## Docs - [master](https://pytorch-lightning.readthedocs.io/en/latest) +- [0.7.2](https://pytorch-lightning.readthedocs.io/en/0.7.2/) - [0.7.1](https://pytorch-lightning.readthedocs.io/en/0.7.1/) - [0.6.0](https://pytorch-lightning.readthedocs.io/en/0.6.0/) - [0.5.3.2](https://pytorch-lightning.readthedocs.io/en/0.5.3.2/) diff --git a/docs/source/slurm.rst b/docs/source/slurm.rst index 57d2a6b39a..db49aaa38c 100644 --- a/docs/source/slurm.rst +++ b/docs/source/slurm.rst @@ -88,4 +88,9 @@ to run into the walltime, and it does the following: 2. Requeues the job. 3. When the job starts, it loads the temporary checkpoint. -.. note:: To get this behavior you have to do nothing. +To get this behavior make sure to add the correct signal to your SLURM script + +.. code-block:: + + # 90 seconds before training ends + #SBATCH --signal=SIGUSR1@90