lightning

Commit Graph

Author	SHA1	Message	Date
William Falcon	db0c94e4a4	Update README.md	2020-06-09 07:30:10 -04:00
Pattarawat Chormai	3be557dc5b	document: fix callback signature (#2113 )	2020-06-09 07:10:44 -04:00
Tushar Jain	8d3d471f03	Update README.md (#2117 )	2020-06-09 07:09:43 -04:00
Udit Arora	a1658ea63d	Add docs about example dependencies (#2122 ) * Add torchvision and gym dependencies * Add pl_examples/requirements.txt to the list of dependencies for running local tests	2020-06-09 07:09:03 -04:00
Tullie Murrell	6537642f6a	Remove explicit flush from tensorboard logger (#2126 ) * Remove explicit flush from tensorboard logger * Update changelog	2020-06-09 07:08:12 -04:00
William Falcon	3f28a8ef32	Update __init__.py	2020-06-08 19:28:50 -04:00
William Falcon	479ab49d03	temporarily fixes early stopping bug (#2119 ) * fixes early stopping bug * fixes early stopping bug * fixes early stopping bug * fixes early stopping bug * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * fixe docs * added test	2020-06-08 19:28:26 -04:00
William Falcon	73a6a957fd	fixe docs	2020-06-08 18:00:24 -04:00
William Falcon	3260e59b27	Adds back the slow spawn ddp implementation that people want (#2115 ) * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * training batch clean up * adding spawn * adding spawn * adding spawn * adding spawn * adding spawn * adding spawn * adding spawn * adding spawn	2020-06-08 17:55:25 -04:00
William Falcon	0bd7780adc	Fixes CPU and hanging GPU crash (#2118 ) * training batch clean up * training batch clean up * training batch clean up	2020-06-08 16:30:20 -04:00
edenlightning	9e8716afe8	Update Readme with tunning overhead time (#2082 )	2020-06-08 07:26:58 -04:00
Adrian Wälchli	1f95fb9af7	update readme with conda installation instruction (#2099 ) * update readme with conda installation instruction * fix team header * bibtex spelling * Update README.md Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-06-08 07:22:54 -04:00
Jirka Borovec	d2967d9305	update hparams, allow OmegaConf (#2047 ) * DictConf * inits * Apply suggestions from code review Co-authored-by: Omry Yadan <omry@fb.com> * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * atrib * wip * wip * wip * added hparams test * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * Update test_hparams.py * added hparams test * added hparams test * pep8 * pep8 * pep8 * docs * wip * wip * clean * review @omry * Update docs/source/hyperparameters.rst Co-authored-by: Omry Yadan <omry@fb.com> Co-authored-by: Omry Yadan <omry@fb.com> Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-06-08 07:19:34 -04:00
Jirka Borovec	c09317e68f	cleaning (#2030 ) * cleaning * optim imports * fix * typo * on * mergify	2020-06-04 11:25:07 -04:00
Wah Loon Keng	6e993c608b	correct trainer.fit production example (#2068 ) trainer.fit uses the parameter `val_dataloaders` but in the documentation it is `val_dataloader`, which is invalid.	2020-06-04 11:24:12 -04:00
Daniel Li	1ad81570e6	Update the documentation of configure_optimizers() (#2071 ) * Explain the default value for scheduler Co-authored-by: Qinru Li <q4li@eng.ucsd.edu>	2020-06-04 11:23:44 -04:00
William Falcon	d96df75d6a	testing new speed (#1587 ) * fixed new amp bugs * fixed new amp bugs * fixed new amp bugs * try exit * larger dataset * full mnist * full mnist * trainer * assert * .05 * .10, #4 * #5 * #5 * #5 * refactor * abs diff * speed * speed * speed * speed Co-authored-by: J. Borovec <jirka.borovec@seznam.cz> Co-authored-by: Jirka <jirka@pytorchlightning.ai>	2020-06-04 11:20:12 -04:00
Adrian Wälchli	4234992302	Fix local variables being collected into module_arguments dict (#2048 ) * do not include local vars in auto collection * add test * add test for model with "self" renamed to "obj" * skip decorator * changelog * changelog * update docs * remove obsolete child collection * generalize args, kwargs names * docs * also update varargs passed in * Revert "also update varargs passed in" This reverts commit 3d7a30dbee07a513ee13e1cc3e08ca5ccdb85734. * update test	2020-06-04 08:35:50 -04:00
kumuji	fd7814d287	Added black formater for the code with code-checker on pull (#1610 ) * black Added throught black.toml other options are hard so far No caching for black github action Moved from black.toml to pyproject.toml Exclude not only yml but also yaml Update pyproject.toml Co-authored-by: Thomas Johansen <thomasjo@gmail.com> Update .github/workflows/code-formatting-check.yml mergify Remove formating check E231 error ignoring because of black formating Updated CONTRIBUTING to the master * Update .github/workflows/code-formatting-check.yml * Bump black to 19.10b0 version * resolved incorrect merge of CONTRIBUTING, Black skipping string normalization * Minor fixes in CONTRIBUTING, two typos * Update setup.cfg * chlog Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Jirka <jirka@pytorchlightning.ai>	2020-06-03 18:23:14 +02:00
Jirka Borovec	5d93d57573	Tests/drop macos py38 (#2061 ) * tests drop macOS py38 * ignore single test * try freeze env * drop * drop * drop * drop * drop skips * imports * fix	2020-06-03 08:38:56 -04:00
Jirka Borovec	c438d0dd90	increase acc (#2039 ) * increase acc * try 0.45 * @pytest * @pytest * try .50 * duration * pytest	2020-06-03 08:28:19 -04:00
Jirka Borovec	b4eb6ef5a1	tests drop macOS py38 (#2054 ) * tests drop macOS py38 * ignore single test * try freeze env * drop * drop * drop * drop * drop skips * drop macOS py38 * imports	2020-06-03 06:48:20 -04:00
Adrian Wälchli	8211256c46	data transfer model hook (+ refactor) (#1756 ) * refactor and added hook variant a variant b add test revert rename add changelog docs * resolve merge duplication * overridden typo * fix test * tpu id * raise if TPU not available * re-use apply_to_collection function for parsing collections * comment * make utility function available to user * documentation * move changelog entry to top * fix tpu transfer call * fix call * remove hardcoded string * improve test * call model hook by default * Apply suggestions from code review * rename utility function Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>	2020-06-02 21:45:19 -04:00
Devashish Shankar	ade3f36b7a	Raise an error when lightning replaces an existing sampler (#2020 ) * Raise an error when lightning replaces an existing sampler Currently, Trainer replaces the existing sampler with DistributedSampler if running distributing training and `replace_sampler_ddp=True` (default behaviour). If a user has configured an existing sampler, this would lead to widely different results if running a distributed vs non-distributed training. This PR fixes this by raising an Error if user has configured a sampler and uses `replace_sampler_ddp=True`. The recommended behavior from now on is to either remove the sampler or set `replace_sampler_ddp=False` * Fix tests * Simpler fix * Fix tests * Make inner method protected * Apply suggestions from code review Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>	2020-06-02 18:52:04 -04:00
Ivan Nazarov	e85a646a41	Mistake in parameters' grad norm tracking (#2012 ) * fix grad norm formula * grad-norm tracker test * fixed seed and explicit rtol in grad norm tracking test * a docstring for grad-norms and forced cast to float of norm_type * support for inf-norm * renamed the grad norm test * docs * fixed language in docstring * Apply suggestions from code review Co-authored-by: Jirka <jirka@pytorchlightning.ai> Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>	2020-06-02 18:51:09 -04:00
Adrian Wälchli	a699003e67	Update/merge multi-gpu docs (#2021 ) * merge multi-gpu docs * extend slurm docs * update links to elastic * format docs and type hints in distrib parts * reference multi-gpu/slurm in trainer args docs * fix doctest * typo * doctest * Apply suggestions from code review Co-authored-by: Lucas Vazquez <lucasgouvaz@gmail.com> * wall time * Update docs/source/slurm.rst Co-authored-by: Lucas Vazquez <lucasgouvaz@gmail.com> * fix title * update docs for weights summary * update changelog Co-authored-by: Lucas Vazquez <lucasgouvaz@gmail.com>	2020-06-02 18:50:08 -04:00
Udit Arora	26b69917b4	Add Open MPI installation details for horovod (#2050 )	2020-06-02 18:48:26 -04:00
Lezwon Castelino	943c4b20af	slow tpu train (#2033 ) * use parallel loader * Revert "use parallel loader" This reverts commit ed6e7583 * select tpu id for pl * condition if tpu_id is None * added info to changelog * Revert "condition if tpu_id is None" This reverts commit `1fb6e586` * Apply suggestions from code review Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>	2020-06-02 18:48:05 -04:00
Rohit Gupta	fa696ce512	fix bug_report template (#2052 ) * fix bug_report template * article	2020-06-02 18:47:21 -04:00
Jirka Borovec	69575204f2	notes on Bug fixing (#2053 ) * import * Apply suggestions from code review Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>	2020-06-02 18:47:03 -04:00
Boris Dayma	00f1ac11e6	fix(wandb): use same logger on multiple training loops (#2055 ) * fix(wandb): use same logger on multiple training loops New training loops reset step to 0 which would previously try to overwrite logs fix #2015 * docs(changelog.md): add reference to PR 2055	2020-06-02 18:46:02 -04:00
Rohit Gupta	0914873bc2	Fix domain_template scripts (#2014 ) * Fix domain_templates * Fix type of fake labels * type * args	2020-06-01 11:38:52 -04:00
William Falcon	82a20296e3	Replaces ddp .spawn with subprocess (#2029 ) * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * replace ddp spawn with subprocess * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix	2020-06-01 11:00:32 -04:00
Jirka Borovec	fd38f52e55	sooner CI testing (#2037 )	2020-06-01 10:21:52 -04:00
William Falcon	0be530a427	Revert "Fixes EarlyStopping With Precision=16 (#1996 )" (#2032 ) This reverts commit `bf39cb26c5`.	2020-05-31 15:20:18 -04:00
authman	bf39cb26c5	Fixes EarlyStopping With Precision=16 (#1996 ) * Patch for issue 1815, which will allow EarlyStopping to work on precision=16 * Added a whitespace to the end of the line so CICD can rerun. No reason for the latest macos test to have been cancelled. * Format.	2020-05-31 15:02:19 -04:00
Fabio Natanael Kepler	8b9b923ca8	Keep track of the best model's path saved by ModelCheckpoint (#1799 ) * Add an additional attribute to ModelCheckpoint to keep track of the best model's path Currently, only the best metric value is directly tracked. This new attribute will help in uses cases where the trained model needs to be used or tracked right after training. * Add small description and usage example to docs * Fix PEP8 issues * Fix doctest example * Fix expected output in doctest * Apply suggestions from code review * Show example as code block instead of doctest * Apply suggestions from code review * Update CHANGELOG.md * Rename `ModelCheckpoint.best` to `ModelCheckpoint.best_model_score` Also rename `ModelCheckpoint.best_model` (added in this PR) to `ModelCheckpoint.best_model_path`, for consistency, and `kth_best_model` to `kth_best_model_path`. * Update pytorch_lightning/trainer/training_io.py Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> * Add warning when loading checkpoint from an old version Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>	2020-05-31 08:47:13 -04:00
Artem Lobantsev	55fdfe3845	Bugfix/fix gan example (#2019 ) * 🐛 fixed fake example type assigning and hparams arg * fixed GAN example to work with dp, ddp., ddp_cpu * Update generative_adversarial_net.py Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-05-31 08:31:21 -04:00
William Falcon	0e37e8c4d2	hotfix to unblock hparams and OmniConf - removes auto_register_init_args by default (#2025 ) * ogc install * cleaned up tests * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix * hot fix	2020-05-31 08:29:51 -04:00
Jirka Borovec	9893681859	fix changelog (#1864 ) * fix chlog * test for #1729 * hist * update * Document use case of passing test dataloaders to Trainer.test() (#1992) * Issue 1990 Doc patch. * Codeblock directive. * Update to reflect current state of pytorch-lightning * Final grammar cleaning. I hope these commits are squashed. * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: authman <uapatira@gmail.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>	2020-05-31 00:48:05 -04:00
Yassine Alouini	d8dc0a7228	Few typo correction (#2011 )	2020-05-31 00:39:56 -04:00
edenafek	fdbbe96825	docs/fix_CONTRIBUTING.md (#1984 ) * Update CONTRIBUTING.md typos * Apply suggestions from code review Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>	2020-05-29 23:05:43 -04:00
Justus Schock	ceecf1cea9	Graceful shutdown on python interpreter exit (#1631 ) * Fraceful shutdown on python interpreter exit * Update CHANGELOG.md * Update training_loop.py * Update training_loop.py * Update CHANGELOG.md Co-Authored-By: Jirka Borovec <Borda@users.noreply.github.com> * pep8, move to constant * Update training_loop.py * Update training_loop.py * Update training_loop.py * pep8, move to constant * pep8 * timeout Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Jirka <jirka.borovec@seznam.cz>	2020-05-29 16:20:04 +02:00
Andreas Kirsch	3af3f37d43	Add toma comments to auto_scale_batch_size (#1994 ) * Add source comments * Update training_tricks.rst	2020-05-29 05:57:50 +00:00
Tejasvi S Tomar	cd3fed03a2	Minor typo (#1987 )	2020-05-28 20:06:15 +00:00
Jirka Borovec	8ee6d91d0e	code guideline (#1949 ) * code rule * Apply suggestions from code review Co-authored-by: Jeremy Jordan <13970565+jeremyjordan@users.noreply.github.com> * chlog Co-authored-by: Jeremy Jordan <13970565+jeremyjordan@users.noreply.github.com> Co-authored-by: Nicki Skafte <skaftenicki@gmail.com>	2020-05-28 14:40:49 +00:00
Loïc Grobol	c3cf33d1de	Fix root node resolution (#1954 )	2020-05-27 22:50:37 -04:00
Jirka Borovec	df78e84060	unify tests (#1940 ) * unify tests * Apply suggestions from code review Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>	2020-05-27 22:45:23 -04:00
Ivan Nazarov	7c19c373ac	LearningRateLogger in multi-scheduler setting (#1944 ) * fixed undesired behaviour due to dict.fromkeys * a test for log length consistency * runtime-warn if no schedulers are configured * chlog * move Co-authored-by: Jirka <jirka@pytorchlightning.ai>	2020-05-27 22:44:46 -04:00
Mateusz Pieniak	3af4994d5a	Removing unecessary early stopping calls (#1863 ) * Removing unecessary early stopping calls * Update CHANGELOG.md Co-authored-by: Mateusz Pieniak <mateusz.pieniak@evidenceprime.com> Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-05-26 19:06:06 -04:00

1 2 3 4 5 ...

2469 Commits All Branches Search

2469 Commits

All Branches