lightning

Commit Graph

Author	SHA1	Message	Date
Jirka Borovec	c09317e68f	cleaning (#2030 ) * cleaning * optim imports * fix * typo * on * mergify	2020-06-04 11:25:07 -04:00
Wah Loon Keng	6e993c608b	correct trainer.fit production example (#2068 ) trainer.fit uses the parameter `val_dataloaders` but in the documentation it is `val_dataloader`, which is invalid.	2020-06-04 11:24:12 -04:00
Daniel Li	1ad81570e6	Update the documentation of configure_optimizers() (#2071 ) * Explain the default value for scheduler Co-authored-by: Qinru Li <q4li@eng.ucsd.edu>	2020-06-04 11:23:44 -04:00
William Falcon	d96df75d6a	testing new speed (#1587 ) * fixed new amp bugs * fixed new amp bugs * fixed new amp bugs * try exit * larger dataset * full mnist * full mnist * trainer * assert * .05 * .10, #4 * #5 * #5 * #5 * refactor * abs diff * speed * speed * speed * speed Co-authored-by: J. Borovec <jirka.borovec@seznam.cz> Co-authored-by: Jirka <jirka@pytorchlightning.ai>	2020-06-04 11:20:12 -04:00
Adrian Wälchli	4234992302	Fix local variables being collected into module_arguments dict (#2048 ) * do not include local vars in auto collection * add test * add test for model with "self" renamed to "obj" * skip decorator * changelog * changelog * update docs * remove obsolete child collection * generalize args, kwargs names * docs * also update varargs passed in * Revert "also update varargs passed in" This reverts commit 3d7a30dbee07a513ee13e1cc3e08ca5ccdb85734. * update test	2020-06-04 08:35:50 -04:00
kumuji	fd7814d287	Added black formater for the code with code-checker on pull (#1610 ) * black Added throught black.toml other options are hard so far No caching for black github action Moved from black.toml to pyproject.toml Exclude not only yml but also yaml Update pyproject.toml Co-authored-by: Thomas Johansen <thomasjo@gmail.com> Update .github/workflows/code-formatting-check.yml mergify Remove formating check E231 error ignoring because of black formating Updated CONTRIBUTING to the master * Update .github/workflows/code-formatting-check.yml * Bump black to 19.10b0 version * resolved incorrect merge of CONTRIBUTING, Black skipping string normalization * Minor fixes in CONTRIBUTING, two typos * Update setup.cfg * chlog Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Jirka <jirka@pytorchlightning.ai>	2020-06-03 18:23:14 +02:00
Jirka Borovec	5d93d57573	Tests/drop macos py38 (#2061 ) * tests drop macOS py38 * ignore single test * try freeze env * drop * drop * drop * drop * drop skips * imports * fix	2020-06-03 08:38:56 -04:00
Jirka Borovec	c438d0dd90	increase acc (#2039 ) * increase acc * try 0.45 * @pytest * @pytest * try .50 * duration * pytest	2020-06-03 08:28:19 -04:00
Jirka Borovec	b4eb6ef5a1	tests drop macOS py38 (#2054 ) * tests drop macOS py38 * ignore single test * try freeze env * drop * drop * drop * drop * drop skips * drop macOS py38 * imports	2020-06-03 06:48:20 -04:00
Adrian Wälchli	8211256c46	data transfer model hook (+ refactor) (#1756 ) * refactor and added hook variant a variant b add test revert rename add changelog docs * resolve merge duplication * overridden typo * fix test * tpu id * raise if TPU not available * re-use apply_to_collection function for parsing collections * comment * make utility function available to user * documentation * move changelog entry to top * fix tpu transfer call * fix call * remove hardcoded string * improve test * call model hook by default * Apply suggestions from code review * rename utility function Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>	2020-06-02 21:45:19 -04:00
Devashish Shankar	ade3f36b7a	Raise an error when lightning replaces an existing sampler (#2020 ) * Raise an error when lightning replaces an existing sampler Currently, Trainer replaces the existing sampler with DistributedSampler if running distributing training and `replace_sampler_ddp=True` (default behaviour). If a user has configured an existing sampler, this would lead to widely different results if running a distributed vs non-distributed training. This PR fixes this by raising an Error if user has configured a sampler and uses `replace_sampler_ddp=True`. The recommended behavior from now on is to either remove the sampler or set `replace_sampler_ddp=False` * Fix tests * Simpler fix * Fix tests * Make inner method protected * Apply suggestions from code review Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>	2020-06-02 18:52:04 -04:00
Ivan Nazarov	e85a646a41	Mistake in parameters' grad norm tracking (#2012 ) * fix grad norm formula * grad-norm tracker test * fixed seed and explicit rtol in grad norm tracking test * a docstring for grad-norms and forced cast to float of norm_type * support for inf-norm * renamed the grad norm test * docs * fixed language in docstring * Apply suggestions from code review Co-authored-by: Jirka <jirka@pytorchlightning.ai> Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>	2020-06-02 18:51:09 -04:00
Adrian Wälchli	a699003e67	Update/merge multi-gpu docs (#2021 ) * merge multi-gpu docs * extend slurm docs * update links to elastic * format docs and type hints in distrib parts * reference multi-gpu/slurm in trainer args docs * fix doctest * typo * doctest * Apply suggestions from code review Co-authored-by: Lucas Vazquez <lucasgouvaz@gmail.com> * wall time * Update docs/source/slurm.rst Co-authored-by: Lucas Vazquez <lucasgouvaz@gmail.com> * fix title * update docs for weights summary * update changelog Co-authored-by: Lucas Vazquez <lucasgouvaz@gmail.com>	2020-06-02 18:50:08 -04:00
Udit Arora	26b69917b4	Add Open MPI installation details for horovod (#2050 )	2020-06-02 18:48:26 -04:00
Lezwon Castelino	943c4b20af	slow tpu train (#2033 ) * use parallel loader * Revert "use parallel loader" This reverts commit ed6e7583 * select tpu id for pl * condition if tpu_id is None * added info to changelog * Revert "condition if tpu_id is None" This reverts commit `1fb6e586` * Apply suggestions from code review Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>	2020-06-02 18:48:05 -04:00
Rohit Gupta	fa696ce512	fix bug_report template (#2052 ) * fix bug_report template * article	2020-06-02 18:47:21 -04:00
Jirka Borovec	69575204f2	notes on Bug fixing (#2053 ) * import * Apply suggestions from code review Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>	2020-06-02 18:47:03 -04:00
Boris Dayma	00f1ac11e6	fix(wandb): use same logger on multiple training loops (#2055 ) * fix(wandb): use same logger on multiple training loops New training loops reset step to 0 which would previously try to overwrite logs fix #2015 * docs(changelog.md): add reference to PR 2055	2020-06-02 18:46:02 -04:00
Rohit Gupta	0914873bc2	Fix domain_template scripts (#2014 ) * Fix domain_templates * Fix type of fake labels * type * args	2020-06-01 11:38:52 -04:00
William Falcon	82a20296e3	Replaces ddp .spawn with subprocess (#2029 ) * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix	2020-06-01 11:00:32 -04:00
Jirka Borovec	fd38f52e55	sooner CI testing (#2037 )	2020-06-01 10:21:52 -04:00
William Falcon	0be530a427	Revert "Fixes EarlyStopping With Precision=16 (#1996 )" (#2032 ) This reverts commit `bf39cb26c5`.	2020-05-31 15:20:18 -04:00
authman	bf39cb26c5	Fixes EarlyStopping With Precision=16 (#1996 ) * Patch for issue 1815, which will allow EarlyStopping to work on precision=16 * Added a whitespace to the end of the line so CICD can rerun. No reason for the latest macos test to have been cancelled. * Format.	2020-05-31 15:02:19 -04:00
Fabio Natanael Kepler	8b9b923ca8	Keep track of the best model's path saved by ModelCheckpoint (#1799 ) * Add an additional attribute to ModelCheckpoint to keep track of the best model's path Currently, only the best metric value is directly tracked. This new attribute will help in uses cases where the trained model needs to be used or tracked right after training. * Add small description and usage example to docs * Fix PEP8 issues * Fix doctest example * Fix expected output in doctest * Apply suggestions from code review * Show example as code block instead of doctest * Apply suggestions from code review * Update CHANGELOG.md * Rename `ModelCheckpoint.best` to `ModelCheckpoint.best_model_score` Also rename `ModelCheckpoint.best_model` (added in this PR) to `ModelCheckpoint.best_model_path`, for consistency, and `kth_best_model` to `kth_best_model_path`. * Update pytorch_lightning/trainer/training_io.py Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> * Add warning when loading checkpoint from an old version Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>	2020-05-31 08:47:13 -04:00
Artem Lobantsev	55fdfe3845	Bugfix/fix gan example (#2019 ) * 🐛 fixed fake example type assigning and hparams arg * fixed GAN example to work with dp, ddp., ddp_cpu * Update generative_adversarial_net.py Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-05-31 08:31:21 -04:00
William Falcon	0e37e8c4d2	hotfix to unblock hparams and OmniConf - removes auto_register_init_args by default (#2025 ) * ogc install * cleaned up tests * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix	2020-05-31 08:29:51 -04:00
Jirka Borovec	9893681859	fix changelog (#1864 ) * fix chlog * test for #1729 * hist * update * Document use case of passing test dataloaders to Trainer.test() (#1992) * Issue 1990 Doc patch. * Codeblock directive. * Update to reflect current state of pytorch-lightning * Final grammar cleaning. I hope these commits are squashed. * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: authman <uapatira@gmail.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>	2020-05-31 00:48:05 -04:00
Yassine Alouini	d8dc0a7228	Few typo correction (#2011 )	2020-05-31 00:39:56 -04:00
edenafek	fdbbe96825	docs/fix_CONTRIBUTING.md (#1984 ) * Update CONTRIBUTING.md typos * Apply suggestions from code review Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>	2020-05-29 23:05:43 -04:00
Justus Schock	ceecf1cea9	Graceful shutdown on python interpreter exit (#1631 ) * Fraceful shutdown on python interpreter exit * Update CHANGELOG.md * Update training_loop.py * Update training_loop.py * Update CHANGELOG.md Co-Authored-By: Jirka Borovec <Borda@users.noreply.github.com> * pep8, move to constant * Update training_loop.py * Update training_loop.py * Update training_loop.py * pep8, move to constant * pep8 * timeout Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Jirka <jirka.borovec@seznam.cz>	2020-05-29 16:20:04 +02:00
Andreas Kirsch	3af3f37d43	Add toma comments to auto_scale_batch_size (#1994 ) * Add source comments * Update training_tricks.rst	2020-05-29 05:57:50 +00:00
Tejasvi S Tomar	cd3fed03a2	Minor typo (#1987 )	2020-05-28 20:06:15 +00:00
Jirka Borovec	8ee6d91d0e	code guideline (#1949 ) * code rule * Apply suggestions from code review Co-authored-by: Jeremy Jordan <13970565+jeremyjordan@users.noreply.github.com> * chlog Co-authored-by: Jeremy Jordan <13970565+jeremyjordan@users.noreply.github.com> Co-authored-by: Nicki Skafte <skaftenicki@gmail.com>	2020-05-28 14:40:49 +00:00
Loïc Grobol	c3cf33d1de	Fix root node resolution (#1954 )	2020-05-27 22:50:37 -04:00
Jirka Borovec	df78e84060	unify tests (#1940 ) * unify tests * Apply suggestions from code review Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>	2020-05-27 22:45:23 -04:00
Ivan Nazarov	7c19c373ac	LearningRateLogger in multi-scheduler setting (#1944 ) * fixed undesired behaviour due to dict.fromkeys * a test for log length consistency * runtime-warn if no schedulers are configured * chlog * move Co-authored-by: Jirka <jirka@pytorchlightning.ai>	2020-05-27 22:44:46 -04:00
Mateusz Pieniak	3af4994d5a	Removing unecessary early stopping calls (#1863 ) * Removing unecessary early stopping calls * Update CHANGELOG.md Co-authored-by: Mateusz Pieniak <mateusz.pieniak@evidenceprime.com> Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-05-26 19:06:06 -04:00
Jirka Borovec	5e8c5abf63	fix default arg (#1927 ) * fix default * formatting errors * update * flake8	2020-05-26 19:04:42 -04:00
Jirka Borovec	ca815698f5	Revert "Remove unused param tpu_core_idx (#1948 )" (#1963 ) This reverts commit `d0ec11b9d6`.	2020-05-26 19:02:51 -04:00
William Falcon	460ab5485e	Gen ddp support (#1961 ) * updated docs * added mixed * added mixed	2020-05-26 19:02:30 -04:00
Nand Dalal	c967b88fc8	Update unet.py (#1955 )	2020-05-26 13:28:17 +00:00
Rohit Gupta	d0ec11b9d6	Remove unused param tpu_core_idx (#1948 )	2020-05-25 16:04:53 -04:00
Adrian Wälchli	34237cfcaf	handle unknown args passed to Trainer.from_argparse_args (#1932 ) * filter valid args * error on unknown manual args * added test * changelog * update docs and doctest * simplify * doctest * doctest * doctest * better test with mock check for init call * fstring * extend test * skip test on 3.6 not working Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-05-25 16:01:29 -04:00
William Falcon	f46a7bae77	updated docs (#1941 )	2020-05-25 15:59:32 -04:00
Federico Baldassarre	65b4352930	early stopping checks on_validation_end (#1458 ) * Fixes PyTorchLightning/pytorch-lightning#490 `EarlyStopping` should check the metric of interest `on_validation_end` rather than `on_epoch_end`. In a normal scenario, this does not cause a problem, but in combination with `check_val_every_n_epoch>1` in the `Trainer` it results in a warning or in a `RuntimeError` depending on `strict`. * Highlighted that ES callback runs on val epochs in docstring * Updated EarlyStopping in rst doc * Update early_stopping.py * Update early_stopping.rst * Update early_stopping.rst * Update early_stopping.rst * Update early_stopping.rst * Apply suggestions from code review Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> * Update docs/source/early_stopping.rst * fix doctest indentation warning * Train loop calls early_stop.on_validation_end * chlog Co-authored-by: William Falcon <waf2107@columbia.edu> Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: Jirka <jirka@pytorchlightning.ai>	2020-05-25 17:33:00 +00:00
Adrian Wälchli	8ca8336ce5	protect progress bar callback (#1855 ) * wip protected progress bar settings * remove callback attr from LRfinder * whitespace * changelog	2020-05-25 07:49:23 -04:00
Lucas Vazquez	112dd5c4f6	Adds the option of saving the last model on checkpoint (#1908 ) * saves model every epoch * implement test for save_last * Update CHANGELOG.md * Update CHANGELOG.md * changes test description Co-authored-by: Jeremy Jordan <13970565+jeremyjordan@users.noreply.github.com> Co-authored-by: Jeremy Jordan <13970565+jeremyjordan@users.noreply.github.com>	2020-05-25 07:47:44 -04:00
Nicki Skafte	a34eb9e169	Fix logger bug and prepare data bug (#1933 ) * tests, fix logger bug and prepare data bug * add CHANGELOG.md Co-authored-by: Nicki Skafte <nugginea@gmail.com>	2020-05-25 07:43:56 -04:00
Jirka Borovec	033ddc0c29	update min req (#1934 )	2020-05-25 07:43:17 -04:00
Justus Schock	6456247287	Re-Enable Import Errors (#1938 ) * update logger imports * pep8 fixes * pep8	2020-05-25 07:31:35 -04:00

1 2 3 4 5 ...

2456 Commits All Branches Search

2456 Commits

All Branches