lightning/pl_examples/__init__.py

"""
Template model definition
-------------------------

In 99% of cases you want to just copy `one of the examples
<https://github.com/PyTorchLightning/pytorch-lightning/tree/master/pl_examples>`_
to start a new lightningModule and change the core of what your model is actually trying to do.

.. code-block:: bash

    # get a copy of the module template
    wget https://raw.githubusercontent.com/PyTorchLightning/pytorch-lightning/master/pl_examples/new_project_templates/lightning_module_template.py  # noqa: E501


Trainer Example
---------------

**`__main__` function**

Normally, we want to let the `__main__` function start the training.
 Inside the main we parse training arguments with whatever hyperparameters we want.
 Your LightningModule will have a chance to add hyperparameters.

.. code-block:: python

    from test_tube import HyperOptArgumentParser

    if __name__ == '__main__':

        # use default args given by lightning
        root_dir = os.path.split(os.path.dirname(sys.modules['__main__'].__file__))[0]
        parent_parser = HyperOptArgumentParser(strategy='random_search', add_help=False)
        add_default_args(parent_parser, root_dir)

        # allow model to overwrite or extend args
        parser = ExampleModel.add_model_specific_args(parent_parser)
        hyperparams = parser.parse_args()

        # train model
        main(hyperparams)

**Main Function**

The main function is your entry into the program. This is where you init your model, checkpoint directory,
 and launch the training. The main function should have 3 arguments:

- hparams: a configuration of hyperparameters.
- slurm_manager: Slurm cluster manager object (can be None)
- dict: for you to return any values you want (useful in meta-learning, otherwise set to)

.. code-block:: python

    def main(hparams, cluster, results_dict):
        # build model
        model = MyLightningModule(hparams)

        # configure trainer
        trainer = Trainer()

        # train model
        trainer.fit(model)


The `__main__` function will start training on your **main** function.
 If you use the HyperParameterOptimizer in hyper parameter optimization mode,
 this main function will get one set of hyperparameters. If you use it as a simple
 argument parser you get the default arguments in the argument parser.

So, calling main(hyperparams) runs the model with the default argparse arguments.::

    main(hyperparams)


CPU hyperparameter search
-------------------------

.. code-block:: python

    # run a grid search over 20 hyperparameter combinations.
    hyperparams.optimize_parallel_cpu(
        main_local,
        nb_trials=20,
        nb_workers=1
    )


Hyperparameter search on a single or multiple GPUs
--------------------------------------------------

.. code-block:: python

    # run a grid search over 20 hyperparameter combinations.
    hyperparams.optimize_parallel_gpu(
        main_local,
        nb_trials=20,
        nb_workers=1,
        gpus=[0,1,2,3]
    )


Hyperparameter search on a SLURM HPC cluster
--------------------------------------------

.. code-block:: python

    def optimize_on_cluster(hyperparams):
        # enable cluster training
        cluster = SlurmCluster(
            hyperparam_optimizer=hyperparams,
            log_path=hyperparams.tt_save_path,
            test_tube_exp_name=hyperparams.tt_name
        )

        # email for cluster coms
        cluster.notify_job_status(email='add_email_here', on_done=True, on_fail=True)

        # configure cluster
        cluster.per_experiment_nb_gpus = hyperparams.per_experiment_nb_gpus
        cluster.job_time = '48:00:00'
        cluster.gpu_type = '1080ti'
        cluster.memory_mb_per_node = 48000

        # any modules for code to run in env
        cluster.add_command('source activate pytorch_lightning')

        # name of exp
        job_display_name = hyperparams.tt_name.split('_')[0]
        job_display_name = job_display_name[0:3]

        # run hopt
        logging.info('submitting jobs...')
        cluster.optimize_parallel_cluster_gpu(
            main,
            nb_trials=hyperparams.nb_hopt_trials,
            job_name=job_display_name
        )

    # run cluster hyperparameter search
    optimize_on_cluster(hyperparams)

"""

from .basic_examples.lightning_module_template import LightningTemplateModel

__all__ = [
    'LightningTemplateModel'
]
Sphinx generated documentation (#521) * upgrade req. * move MkDocs * create Sphinx * init Sphinx * move md from MkDocs to Sphinx * CI: build docs * build Sphinx formatting move docs from MD to docstring in particular package/modules formatting add Sphinx ext. rename root_module to core drop implicit name "_logger" drop duplicate name "overwrite" fix imports use pytorch theme add sample link mapping try fix RTD build use forked template fix some docs warnings fix paths add deprecation warnings fix flake8 fix paths revert refactor revert MLFlowLogger * revert example import * update link * Update lightning_module_template.py 2019-11-28 17:48:55 +00:00			`"""`
			`Template model definition`
			`-------------------------`

			In 99% of cases you want to just copy `one of the examples
resolving documentation warnings (#833) * add more underline * fix LightningMudule import error * remove unneeded blank line * escape asterisk to fix inline emphasis warning * add PULL_REQUEST_TEMPLATE.md * add __init__.py and import imagenet_example * fix duplicate label * add noindex option to fix duplicate object warnings * remove unexpected indent * refer explicit LightningModule * fix minor bug * refer EarlyStopping explicitly * restore exclude patterns * change the way how to refer class * remove unused import * update badges & drop Travis/Appveyor (#826) * drop Travis * drop Appveyor * update badges * fix missing PyPI images & CI badges (#853) * docs - anchor links (#848) * docs - add links * add desc. * add Greeting action (#843) * add Greeting action * Update greetings.yml Co-authored-by: William Falcon <waf2107@columbia.edu> * add pep8speaks (#842) * advanced profiler describe + cleaned up tests (#837) * add py36 compatibility * add test case to capture previous bug * clean up tests * clean up tests * Update lightning_module_template.py * Update lightning.py * respond lint issues * break long line * break more lines * checkout conflicting files from master * shorten url * checkout from upstream/master * remove trailing whitespaces * remove unused import LightningModule * fix sphinx bot warnings * Apply suggestions from code review just to trigger CI * Update .github/workflows/greetings.yml Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: William Falcon <waf2107@columbia.edu> Co-authored-by: Jeremy Jordan <13970565+jeremyjordan@users.noreply.github.com> 2020-02-27 21:07:51 +00:00			<https://github.com/PyTorchLightning/pytorch-lightning/tree/master/pl_examples>`_
			`to start a new lightningModule and change the core of what your model is actually trying to do.`
Sphinx generated documentation (#521) * upgrade req. * move MkDocs * create Sphinx * init Sphinx * move md from MkDocs to Sphinx * CI: build docs * build Sphinx formatting move docs from MD to docstring in particular package/modules formatting add Sphinx ext. rename root_module to core drop implicit name "_logger" drop duplicate name "overwrite" fix imports use pytorch theme add sample link mapping try fix RTD build use forked template fix some docs warnings fix paths add deprecation warnings fix flake8 fix paths revert refactor revert MLFlowLogger * revert example import * update link * Update lightning_module_template.py 2019-11-28 17:48:55 +00:00
			`.. code-block:: bash`

			`# get a copy of the module template`
update org paths & convert logos (#685) * fix typos * update org paths * update links from READMe to docs * add svg logo * add svg logo-text * update logos * testing temp paths * prune links from readme * optimize imports * update logo * update paths in README * missing imports 2020-01-20 19:50:31 +00:00			`wget https://raw.githubusercontent.com/PyTorchLightning/pytorch-lightning/master/pl_examples/new_project_templates/lightning_module_template.py # noqa: E501`
Sphinx generated documentation (#521) * upgrade req. * move MkDocs * create Sphinx * init Sphinx * move md from MkDocs to Sphinx * CI: build docs * build Sphinx formatting move docs from MD to docstring in particular package/modules formatting add Sphinx ext. rename root_module to core drop implicit name "_logger" drop duplicate name "overwrite" fix imports use pytorch theme add sample link mapping try fix RTD build use forked template fix some docs warnings fix paths add deprecation warnings fix flake8 fix paths revert refactor revert MLFlowLogger * revert example import * update link * Update lightning_module_template.py 2019-11-28 17:48:55 +00:00

			`Trainer Example`
			`---------------`

			`__main__` function

			Normally, we want to let the `__main__` function start the training.
			`Inside the main we parse training arguments with whatever hyperparameters we want.`
			`Your LightningModule will have a chance to add hyperparameters.`

			`.. code-block:: python`

			`from test_tube import HyperOptArgumentParser`

			`if __name__ == '__main__':`

			`# use default args given by lightning`
			`root_dir = os.path.split(os.path.dirname(sys.modules['__main__'].__file__))[0]`
			`parent_parser = HyperOptArgumentParser(strategy='random_search', add_help=False)`
			`add_default_args(parent_parser, root_dir)`

			`# allow model to overwrite or extend args`
			`parser = ExampleModel.add_model_specific_args(parent_parser)`
			`hyperparams = parser.parse_args()`

			`# train model`
			`main(hyperparams)`

			`Main Function`

			`The main function is your entry into the program. This is where you init your model, checkpoint directory,`
			`and launch the training. The main function should have 3 arguments:`
update Docs [links & formatting] (#769) * wip * wip * debug imports docs formatting * WIP * formatting * fix setup 2020-02-09 22:39:10 +00:00
Sphinx generated documentation (#521) * upgrade req. * move MkDocs * create Sphinx * init Sphinx * move md from MkDocs to Sphinx * CI: build docs * build Sphinx formatting move docs from MD to docstring in particular package/modules formatting add Sphinx ext. rename root_module to core drop implicit name "_logger" drop duplicate name "overwrite" fix imports use pytorch theme add sample link mapping try fix RTD build use forked template fix some docs warnings fix paths add deprecation warnings fix flake8 fix paths revert refactor revert MLFlowLogger * revert example import * update link * Update lightning_module_template.py 2019-11-28 17:48:55 +00:00			`- hparams: a configuration of hyperparameters.`
			`- slurm_manager: Slurm cluster manager object (can be None)`
			`- dict: for you to return any values you want (useful in meta-learning, otherwise set to)`

			`.. code-block:: python`

			`def main(hparams, cluster, results_dict):`
			`# build model`
			`model = MyLightningModule(hparams)`

			`# configure trainer`
			`trainer = Trainer()`

			`# train model`
			`trainer.fit(model)`


			The `__main__` function will start training on your main function.
			`If you use the HyperParameterOptimizer in hyper parameter optimization mode,`
			`this main function will get one set of hyperparameters. If you use it as a simple`
			`argument parser you get the default arguments in the argument parser.`

			`So, calling main(hyperparams) runs the model with the default argparse arguments.::`

			`main(hyperparams)`


			`CPU hyperparameter search`
			`-------------------------`

			`.. code-block:: python`

			`# run a grid search over 20 hyperparameter combinations.`
			`hyperparams.optimize_parallel_cpu(`
			`main_local,`
			`nb_trials=20,`
			`nb_workers=1`
			`)`


			`Hyperparameter search on a single or multiple GPUs`
			`--------------------------------------------------`

			`.. code-block:: python`

			`# run a grid search over 20 hyperparameter combinations.`
			`hyperparams.optimize_parallel_gpu(`
			`main_local,`
			`nb_trials=20,`
			`nb_workers=1,`
			`gpus=[0,1,2,3]`
			`)`


			`Hyperparameter search on a SLURM HPC cluster`
			`--------------------------------------------`

			`.. code-block:: python`

			`def optimize_on_cluster(hyperparams):`
			`# enable cluster training`
			`cluster = SlurmCluster(`
			`hyperparam_optimizer=hyperparams,`
			`log_path=hyperparams.tt_save_path,`
			`test_tube_exp_name=hyperparams.tt_name`
			`)`

			`# email for cluster coms`
			`cluster.notify_job_status(email='add_email_here', on_done=True, on_fail=True)`

			`# configure cluster`
			`cluster.per_experiment_nb_gpus = hyperparams.per_experiment_nb_gpus`
			`cluster.job_time = '48:00:00'`
			`cluster.gpu_type = '1080ti'`
			`cluster.memory_mb_per_node = 48000`

			`# any modules for code to run in env`
			`cluster.add_command('source activate pytorch_lightning')`

			`# name of exp`
			`job_display_name = hyperparams.tt_name.split('_')[0]`
			`job_display_name = job_display_name[0:3]`

			`# run hopt`
			`logging.info('submitting jobs...')`
			`cluster.optimize_parallel_cluster_gpu(`
			`main,`
			`nb_trials=hyperparams.nb_hopt_trials,`
			`job_name=job_display_name`
			`)`

			`# run cluster hyperparameter search`
			`optimize_on_cluster(hyperparams)`

			`"""`

cleaned up demos 2019-10-05 18:13:32 +00:00			`from .basic_examples.lightning_module_template import LightningTemplateModel`
apply PEP8 2019-08-05 21:57:39 +00:00
			`__all__ = [`
			`'LightningTemplateModel'`
			`]`