lightning/docs/Trainer/Logging.md

Lighting offers options for logging information about model, gpu usage, etc, via several different logging frameworks. It also offers printing options for training monitoring.


---
### Setting up logging

Initialize your logger, which should inherit from `LightningBaseLogger`, and pass
it to `Trainer`.
```{.python}
my_logger = MyLightningLogger(...)
trainer = Trainer(logger=my_logger)
```

Lightning supports several common experiment tracking frameworks out of the box

---
#### Test tube

Log using [test tube](https://williamfalcon.github.io/test-tube/).

```{.python}
from pytorch_lightning.logging import TestTubeLogger
tt_logger = TestTubeLogger(
    save_dir=".",
    name="default",
    debug=False,
    create_git_tag=False
)
trainer = Trainer(logger=tt_logger)
```

---
#### MLFlow

Log using [mlflow](https://mlflow.org)

```{.python}
from pytorch_lightning.logging import MLFlowLogger
mlf_logger = MLFlowLogger(
    experiment_name="default",
    tracking_uri="file:/."
)
trainer = Trainer(logger=mlf_logger)
```

---
#### Custom logger

You can implement your own logger by writing a class that inherits from
`LightningLoggerBase`. Use the `rank_zero_only` decorator to make sure that
only the first process in DDP training logs data.

```{.python}
from pytorch_lightning.logging import LightningLoggerBase, rank_zero_only

class MyLogger(LightningLoggerBase):

    @rank_zero_only
    def log_hyperparams(self, params):
        # params is an argparse.Namespace
        # your code to record hyperparameters goes here
        pass
    
    @rank_zero_only
    def log_metrics(self, metrics, step_num):
        # metrics is a dictionary of metric names and values
        # your code to record metrics goes here
        pass
    
    def save(self):
        # Optional. Any code necessary to save logger data goes here
        pass
    
    @rank_zero_only
    def finalize(self, status):
        # Optional. Any code that needs to be run after training
        # finishes goes here
```

If you write a logger than may be useful to others, please send
a pull request to add it to Lighting!

---
### Using loggers

#### Display metrics in progress bar 
``` {.python}
# DEFAULT
trainer = Trainer(show_progress_bar=True)
```

---
#### Log metric row every k batches 
Every k batches lightning will make an entry in the metrics log
``` {.python}
# DEFAULT (ie: save a .csv log file every 10 batches)
trainer = Trainer(row_log_interval=10)
```   

---
#### Log GPU memory
Logs GPU memory when metrics are logged.   
``` {.python}
# DEFAULT
trainer = Trainer(log_gpu_memory=False)
```

---
#### Process position
When running multiple models on the same machine we want to decide which progress bar to use.
Lightning will stack progress bars according to this value. 
``` {.python}
# DEFAULT
trainer = Trainer(process_position=0)

# if this is the second model on the node, show the second progress bar below
trainer = Trainer(process_position=1)
```

---
#### Save a snapshot of all hyperparameters 
Log hyperparameters using the logger
``` {.python}
logger = TestTubeLogger(...)
logger.log_hyperparams(args)

Trainer(logger=logger)
```

---
#### Write logs file to csv every k batches 
Every k batches, lightning will write the new logs to disk
``` {.python}
# DEFAULT (ie: save a .csv log file every 100 batches)
trainer = Trainer(log_save_interval=100)
```
Enable any ML experiment tracking framework (#223) * Implement generic loggers for experiment tracking * Add tests for loggers * Get model tests passing * Test and fix logger pickling * Expand pickle test and fix bug * Missed exp -> logger conversion * Remove commented code * Add docstrings * Update logging docs * Add mlflow to test requirements * Make linter happy * Fix mlflow timestamp * Update Logging.md * Update test_models.py * Update test_models.py * Update test_models.py * Update properties.md * Fix tests * Line length 2019-09-27 16:05:29 +00:00			`Lighting offers options for logging information about model, gpu usage, etc, via several different logging frameworks. It also offers printing options for training monitoring.`
added val loop options 2019-06-27 17:47:19 +00:00

			`---`
Enable any ML experiment tracking framework (#223) * Implement generic loggers for experiment tracking * Add tests for loggers * Get model tests passing * Test and fix logger pickling * Expand pickle test and fix bug * Missed exp -> logger conversion * Remove commented code * Add docstrings * Update logging docs * Add mlflow to test requirements * Make linter happy * Fix mlflow timestamp * Update Logging.md * Update test_models.py * Update test_models.py * Update test_models.py * Update properties.md * Fix tests * Line length 2019-09-27 16:05:29 +00:00			`### Setting up logging`

			Initialize your logger, which should inherit from `LightningBaseLogger`, and pass
			it to `Trainer`.
			```{.python}
			`my_logger = MyLightningLogger(...)`
			`trainer = Trainer(logger=my_logger)`
			```

			`Lightning supports several common experiment tracking frameworks out of the box`

			`---`
			`#### Test tube`

			`Log using [test tube](https://williamfalcon.github.io/test-tube/).`

			```{.python}
			`from pytorch_lightning.logging import TestTubeLogger`
			`tt_logger = TestTubeLogger(`
			`save_dir=".",`
			`name="default",`
			`debug=False,`
			`create_git_tag=False`
			`)`
			`trainer = Trainer(logger=tt_logger)`
			```

			`---`
			`#### MLFlow`

			`Log using [mlflow](https://mlflow.org)`

			```{.python}
			`from pytorch_lightning.logging import MLFlowLogger`
			`mlf_logger = MLFlowLogger(`
			`experiment_name="default",`
			`tracking_uri="file:/."`
			`)`
			`trainer = Trainer(logger=mlf_logger)`
			```

			`---`
			`#### Custom logger`

			`You can implement your own logger by writing a class that inherits from`
			`LightningLoggerBase`. Use the `rank_zero_only` decorator to make sure that
			`only the first process in DDP training logs data.`

			```{.python}
			`from pytorch_lightning.logging import LightningLoggerBase, rank_zero_only`

			`class MyLogger(LightningLoggerBase):`

			`@rank_zero_only`
			`def log_hyperparams(self, params):`
			`# params is an argparse.Namespace`
			`# your code to record hyperparameters goes here`
			`pass`

			`@rank_zero_only`
			`def log_metrics(self, metrics, step_num):`
			`# metrics is a dictionary of metric names and values`
			`# your code to record metrics goes here`
			`pass`

			`def save(self):`
			`# Optional. Any code necessary to save logger data goes here`
			`pass`

			`@rank_zero_only`
			`def finalize(self, status):`
			`# Optional. Any code that needs to be run after training`
			`# finishes goes here`
			```

			`If you write a logger than may be useful to others, please send`
			`a pull request to add it to Lighting!`

			`---`
			`### Using loggers`

added val loop options 2019-06-27 17:47:19 +00:00			`#### Display metrics in progress bar`
			``` {.python}
			`# DEFAULT`
cleaned up progbar (#165) * cleaned up progbar * cleaned up progbar * cleaned up progbar * cleaned up progbar * cleaned up progbar * cleaned up progbar * cleaned up progbar * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * updated base files * flake 8 2019-08-24 01:23:27 +00:00			`trainer = Trainer(show_progress_bar=True)`
added val loop options 2019-06-27 17:47:19 +00:00			```

debugging and gpu guide 2019-06-27 18:22:00 +00:00			`---`
			`#### Log metric row every k batches`
			`Every k batches lightning will make an entry in the metrics log`
			``` {.python}
			`# DEFAULT (ie: save a .csv log file every 10 batches)`
Rename variables (#124) - data_batch → batch - batch_i → batch_idx - dataloader_i → dataloader_idx - tng → training - training_dataloader → train_dataloader - add_log_row_interval → row_log_interval - gradient_clip → gradient_clip_val - prog → progress - tqdm_dic → tqdm_dict 2019-09-25 23:05:06 +00:00			`trainer = Trainer(row_log_interval=10)`
Allow to deactivate GPU memory logging in Trainer (#190) * Allow to deactivate GPU memory logging in Trainer Adds the flag `log_gpu_memory` to Trainer to deactivate logging of GPU memory utilization. On some servers logging the GPU memory usage can significantly slow down training. * Update Logging.md * Update trainer.py 2019-09-04 14:43:46 +00:00			```

			`---`
Enable any ML experiment tracking framework (#223) * Implement generic loggers for experiment tracking * Add tests for loggers * Get model tests passing * Test and fix logger pickling * Expand pickle test and fix bug * Missed exp -> logger conversion * Remove commented code * Add docstrings * Update logging docs * Add mlflow to test requirements * Make linter happy * Fix mlflow timestamp * Update Logging.md * Update test_models.py * Update test_models.py * Update test_models.py * Update properties.md * Fix tests * Line length 2019-09-27 16:05:29 +00:00			`#### Log GPU memory`
Allow to deactivate GPU memory logging in Trainer (#190) * Allow to deactivate GPU memory logging in Trainer Adds the flag `log_gpu_memory` to Trainer to deactivate logging of GPU memory utilization. On some servers logging the GPU memory usage can significantly slow down training. * Update Logging.md * Update trainer.py 2019-09-04 14:43:46 +00:00			`Logs GPU memory when metrics are logged.`
			``` {.python}
			`# DEFAULT`
			`trainer = Trainer(log_gpu_memory=False)`
debugging and gpu guide 2019-06-27 18:22:00 +00:00			```
added val loop options 2019-06-27 17:47:19 +00:00
added val loop options 2019-06-27 17:58:13 +00:00			`---`
			`#### Process position`
			`When running multiple models on the same machine we want to decide which progress bar to use.`
			`Lightning will stack progress bars according to this value.`
			``` {.python}
			`# DEFAULT`
			`trainer = Trainer(process_position=0)`

			`# if this is the second model on the node, show the second progress bar below`
			`trainer = Trainer(process_position=1)`
			```
added val loop options 2019-06-27 17:47:19 +00:00
			`---`
added val loop options 2019-06-27 17:58:13 +00:00			`#### Save a snapshot of all hyperparameters`
Enable any ML experiment tracking framework (#223) * Implement generic loggers for experiment tracking * Add tests for loggers * Get model tests passing * Test and fix logger pickling * Expand pickle test and fix bug * Missed exp -> logger conversion * Remove commented code * Add docstrings * Update logging docs * Add mlflow to test requirements * Make linter happy * Fix mlflow timestamp * Update Logging.md * Update test_models.py * Update test_models.py * Update test_models.py * Update properties.md * Fix tests * Line length 2019-09-27 16:05:29 +00:00			`Log hyperparameters using the logger`
debugging and gpu guide 2019-06-27 18:22:00 +00:00			``` {.python}
Enable any ML experiment tracking framework (#223) * Implement generic loggers for experiment tracking * Add tests for loggers * Get model tests passing * Test and fix logger pickling * Expand pickle test and fix bug * Missed exp -> logger conversion * Remove commented code * Add docstrings * Update logging docs * Add mlflow to test requirements * Make linter happy * Fix mlflow timestamp * Update Logging.md * Update test_models.py * Update test_models.py * Update test_models.py * Update properties.md * Fix tests * Line length 2019-09-27 16:05:29 +00:00			`logger = TestTubeLogger(...)`
			`logger.log_hyperparams(args)`
added tb docs 2019-07-27 22:40:29 +00:00
Enable any ML experiment tracking framework (#223) * Implement generic loggers for experiment tracking * Add tests for loggers * Get model tests passing * Test and fix logger pickling * Expand pickle test and fix bug * Missed exp -> logger conversion * Remove commented code * Add docstrings * Update logging docs * Add mlflow to test requirements * Make linter happy * Fix mlflow timestamp * Update Logging.md * Update test_models.py * Update test_models.py * Update test_models.py * Update properties.md * Fix tests * Line length 2019-09-27 16:05:29 +00:00			`Trainer(logger=logger)`
added tb docs 2019-07-27 22:40:29 +00:00			```
debugging and gpu guide 2019-06-27 18:22:00 +00:00
added val loop options 2019-06-27 17:58:13 +00:00			`---`
			`#### Write logs file to csv every k batches`
			`Every k batches, lightning will write the new logs to disk`
			``` {.python}
			`# DEFAULT (ie: save a .csv log file every 100 batches)`
			`trainer = Trainer(log_save_interval=100)`
added val loop options 2019-06-27 17:47:19 +00:00			```
added val loop options 2019-06-27 17:58:13 +00:00