lightning

Commit Graph

Author	SHA1	Message	Date
Rohit Gupta	4c7ebdc32b	Add dirpath and filename parameter in ModelCheckpoint (#4213 ) * Add dirpath and filename parameter in ModelCheckpoint * remove old function * chlog * codefactor * update tests * docs * fix doctest and added tests * pathlib dirpath * dep version and docs * try fix doctest * pep * suggestions Co-authored-by: carmocca <carlossmocholi@gmail.com> * suggestions * fix test * pep * trigger tests * Apply suggestions from code review Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com> * suggestions * try fix windows test * add and update some tests * trigger tests * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: Carlos Mocholí <carlossmocholi@gmail.com> Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-10-23 09:59:12 +05:30
William Falcon	45d05ff68d	Fixes #4141 (#4169 ) * fix val epoch agg * fix val agg metrics * fix val agg metrics * fix val agg metrics	2020-10-15 09:12:05 -04:00
William Falcon	09c2020a93	notices (#4118 )	2020-10-13 07:18:07 -04:00
William Falcon	7ffe05a3d1	ref: accelerator names (#4066 ) * ref: accelerator names * docs	2020-10-11 01:05:14 -04:00
Ananya Harsh Jha	ae8772490d	classification metrics (#4043 ) * docs + precision + recall + f_beta + refactor Co-authored-by: Teddy Koker <teddy.koker@gmail.com> * rebase Co-authored-by: Teddy Koker <teddy.koker@gmail.com> * fixes Co-authored-by: Teddy Koker <teddy.koker@gmail.com> * added missing file * docs * docs * extra import Co-authored-by: Teddy Koker <teddy.koker@gmail.com>	2020-10-10 12:31:00 -04:00
Nrupatunga	fcfa587492	Bugfix/update trainer properties (#3975 ) * make current_epoch and global_step to be same as trainer, after model restore. * remove assignment here * test * minor modification * merge with parent's master * [bug-fix]: update trainer properties * minor comment fix * minor comment fix * reset train loader in `on_train_epoch_start` hook * makes sure the changes work * minor chane * update changelog * adding unit test for reload_dataloaders_every_epoch arg * modified changelog, to add PR number * revert imports * changes to unit test Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>	2020-10-08 10:20:55 -04:00
Ananya Harsh Jha	6f1a2ce517	integrate metrics API with self.log (#3961 ) * metrics integration into self.log Co-authored-by: Teddy Koker <teddy.koker@gmail.com> * ddp and regualr test for self.log + metrics Co-authored-by: Teddy Koker <teddy.koker@gmail.com> * pep8 * fix log tests Co-authored-by: Teddy Koker <teddy.koker@gmail.com> * docs Co-authored-by: Teddy Koker <teddy.koker@gmail.com> Co-authored-by: Teddy Koker <teddy.koker@gmail.com>	2020-10-07 22:54:32 -04:00
William Falcon	838940eee7	removing this troubling test that has random behavior (#3941 ) * threshold * threshold	2020-10-07 12:01:51 -04:00
William Falcon	71a4c61f6e	fixes #3871 (#3919 ) * fixes #3871 * ✅ tests * ✅ tests * ✅ tests * ✅ tests * ✅ tests * ✅ tests * ✅ tests * moves sync bn to each backend * moves sync bn to each backend Co-authored-by: nateraw <nxr9266@g.rit.edu>	2020-10-06 22:56:34 -04:00
Nathan Raw	1954d7c87a	Write predictions in LightningModule instead of EvalResult (#3882 ) * ✨ add self.write_prediction * ✨ add self.write_prediction_dict to lightning module	2020-10-05 18:04:02 -04:00
William Falcon	0fb8c54fda	remove deprecated test (#3820 )	2020-10-03 13:21:10 -04:00
William Falcon	d9bc95f83e	ref: bug fix with logging val epoch end + monitor (#3812 ) * ref: fix metric err * ref: fix metric err * ref: fix metric err * ref: merge * ref: merge * ref: merge * ref: merge * ref: decoupled ddp2 * ref: decoupled ddp2 * ref: decoupled ddp2 * ref: decoupled ddp2 * ref: decoupled ddp2 * ref: clean up ddp before final fix * ref: clean up ddp before final fix * ref: clean up ddp before final fix * ref: clean up ddp before final fix * ref: clean up ddp before final fix * ref: clean up ddp before final fix * ref: clean up ddp before final fix * ref: clean up ddp before final fix * ref: clean up ddp before final fix * ref: clean up ddp before final fix * ref: clean up ddp before final fix	2020-10-03 12:33:29 -04:00
William Falcon	a38d108a68	add dist lib to enable syncing anything across devices (#3762 ) * add dist lib to enable syncing anything across devices	2020-10-01 01:21:38 -04:00
ananthsub	3dcf7130c5	Support checkpoint hooks on data module (#3563 ) * Split out changes from #3563 to make that PR easier to review. This formats the file according to the Black formatter * Store a reference to the trainer on the datamodule Fixes #3682 * Update data_connector.py * Update data_connector.py * Update test_datamodules.py * Split out changes from #3563 to make that PR easier to review. This formats the file according to the Black formatter * support checkpoint hooks for datamodule refactor on_{save/load}_checkpoint to a separate hook class that both the lightning module and data module inherit add spots in callback connector to call new datamodule hooks if available * hooks formatting * Update hooks.py * Update checkpoint_connector.py * Update lightning.py * update based on upstream/master checkout upstream/master * Update checkpoint_connector.py * add tests * undo format revert * Updated CHANGELOG.md * add checkpoint hooks * add Dict type * import CheckpointHooks	2020-09-29 19:51:44 +02:00
William Falcon	931995b55b	remove flake 8 (#3687 )	2020-09-27 20:40:02 -04:00
ananthsub	94c79bb3ba	Add a reference to the Trainer on the LightningDataModule (#3684 ) * Split out changes from #3563 to make that PR easier to review. This formats the file according to the Black formatter * Store a reference to the trainer on the datamodule Fixes #3682 * Update data_connector.py * Update data_connector.py * Update test_datamodules.py	2020-09-27 19:48:01 -04:00
William Falcon	d79bce1dff	enable None model checkpoint default (#3669 ) * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default * enable None model checkpoint default	2020-09-26 23:14:04 -04:00
Antoine Broyelle	17c8c95fbc	Wrap prepare_data and setup only once inside DataModule (#3654 ) Fix #3652	2020-09-25 07:09:50 -04:00
Adrian Wälchli	3affa0e49a	use tmpdir in tests when writing predictions to disk (#3561 ) * save to tmpdir * path	2020-09-23 07:44:15 -04:00
William Falcon	21cfdf6874	ref: result 1/n (make monitor default to checkpoint_on to simplify re… (#3571 ) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * Update pytorch_lightning/callbacks/model_checkpoint.py Co-authored-by: ananthsub <ananth.subramaniam@gmail.com> * ref: result 1/n (make monitor default to checkpoint_on to simplify result syntax) * force crash when max_epochs < epochs in a checkpoint Co-authored-by: ananthsub <ananth.subramaniam@gmail.com>	2020-09-20 22:58:43 -04:00
William Falcon	722c44c7d0	ref: device to gpus (#3405 ) * ref: device to gpus * ref: device to gpus * ref: device to gpus * ref: device to gpus * ref: device to gpus	2020-09-08 22:14:17 -04:00
William Falcon	0b5b70d6c9	ref: inner train loop (intermediate step) 17/n (#3376 ) * ref: inner train loop (intermediate step) 17/n * ref: inner train loop (intermediate step) 17/n * ref: inner train loop (intermediate step) 17/n	2020-09-07 09:31:42 -04:00
William Falcon	7073de8a95	ref: inner train loop (intermediate step) 14/n (#3373 ) * ref: inner train loop (intermediate step) 14/n * ref: inner train loop (intermediate step) 14/n	2020-09-06 19:55:18 -04:00
William Falcon	7d57f8d407	ref: move prepare_data to data connector (#3307 ) * ref: moved argparse code to central class * ref: moved argparse code to central class * ref: moved argparse code to central class	2020-09-01 14:59:09 -04:00
William Falcon	caf7893f27	ref: modular is_overridden (#3290 ) * ref: modular is_overridden * ref: modular is_overridden * ref: modular is_overridden * ref: modular is_overridden	2020-08-31 12:12:02 -04:00
William Falcon	8d7ca5cd2c	ref: refactored gpu backend __step (#3120 ) * refactored gpu backend __step * refactored gpu backend __step * refactored gpu backend __step * refactored gpu backend __step	2020-08-24 09:22:05 -04:00
Nathan Raw	bab89b8d21	Add transfer_batch_to_device hook to DataModule (#3038 ) * ✨ add dm to_device logic in trainer * 🔥 remove unnecessary comment * ✨ add to_device logic to datamodule * ✅ add test * updated docs Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-08-20 08:47:11 -04:00
Justus Schock	7358d456f3	Retrieve last logged val from result by key (#3049 ) * return last logged value * Update test_results.py * Update step_result.py * Update step_result.py * pep8 * pep8	2020-08-19 18:59:14 -04:00
Adrian Wälchli	9031dc3b81	Fix result gathering with varying tensor shapes (#3020 ) * test for gethering results * fix gather * document tests * changelog * assert dtype * default to concat * additional test	2020-08-18 20:27:48 -04:00
Nathan Raw	b9695237f1	Save test predictions on multiple GPUs (#2926 ) * Save test predictions on multiple GPUs	2020-08-14 17:52:43 -04:00
Jirka Borovec	665c1507f0	deterministic=True (#2944 )	2020-08-13 06:29:27 -04:00
William Falcon	d13e5c9e53	document lightiningmodule better (#2920 ) * updated docs	2020-08-11 19:39:43 -04:00
Jirka Borovec	f8c058215f	simplify tests & cleaning (#2588 ) * simplify * tmpdir * revert * clean * accel * types * test * edit test acc Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com> * Update test acc Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>	2020-08-07 23:22:05 +02:00
Nicki Skafte	9a402461da	Bugfix: Lr finder and hparams compatibility (#2821 ) * fix hparams lr finder bug * add tests for new functions * better tests * fix codefactor * fix styling * fix tests * fix codefactor * Apply suggestions from code review * modified hook Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: William Falcon <waf2107@columbia.edu> Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>	2020-08-07 00:34:48 +02:00
Jirka Borovec	ed3ee982b3	clean tests imports (#2834 )	2020-08-06 16:58:51 +02:00
Justus Schock	fe29c53ab5	add ddp sync for logging in result step (#2822 ) * add ddp sync for logging in result step * pep8 * pep8 * make ddp tests run also on cpu (except windowws) * create class instance in ddp test * revert automated formatting * pep8	2020-08-05 20:42:09 -04:00
William Falcon	b507c42c47	clarify batch hooks (#2842 ) * modified hook * modified hook * modified hook * modified hook * modified hook * modified hook * modified hook * modified hook * modified hook * modified hook * modified hook * modified hook * modified hook	2020-08-05 20:01:30 -04:00
Nathan Raw	036bcea499	Call DataModule hooks implicitly in trainer (#2755 ) * ✨ call dm hooks in trainer implicitly * ✅ update tests * 📝 remove unused stage arg from dm docs * ✅ update tests * ✅ update tests * 🚧 include stage in datamodule.setup * 📝 docs * 📝 docs * added more dm tests * added more dm tests * 🐛 call dm.setup everywhere * 🔥 pickle tests now implied by accelerator tests * 🎨 set dm as attr of trainer * 🐛 . * 🚧 wip * add can prepare test * add can prepare test * verified setup in fit * fixed setup call * fixed setup call * fixed setup call Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-08-01 20:17:57 -04:00
Nathan Raw	9076551aec	Enable val/test loop disabling + datamodule tests (#2692 ) * 🎨 warn instead of error out on loaders * 🐛 test misconfiguration should still fail * 🚧 . * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj * updated docs with new result obj Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-07-25 12:57:40 -04:00
Adrian Wälchli	6bfcfa8671	fix dtype conversion of example_input_array in model summary (#2510 ) * fix dtype conversion * changelog	2020-07-05 07:17:22 -04:00
Adrian Wälchli	f972ab3a82	Fix summary hook handles not getting removed (#2298 ) * detach hooks after completion * detach hook * update docs * add test * docs * changelog	2020-06-20 07:38:47 -04:00
Jirka Borovec	db7bb4c348	cleaning tests (#2201 )	2020-06-15 22:03:40 -04:00
Adrian Wälchli	7dc58bd286	Refactor model summary + generalize example input array (#1773 ) * squash variant a variant b add test revert rename add changelog docs move changelog entry to top use hooks wip wipp layer summary clean up, refactor type hints rename remove obsolete code rename unused imports simplify formatting of table and increase readability doctest superclass object update examples print unknown sizes more docs and doctest testing unknown layers add rnn test remove main restore train mode test device wip device constant simplify model forward transfer return summary object in method extend tests fix summary for empty module extend tests refactor and added hook variant a variant b add test revert rename add changelog docs move changelog entry to top remove hardcoded string simplify test unknown shapes and all others comments for tests fix hparams attribute * update default * unused import * clean up * replace hardcoded strings * fix doctest * fix top/full * black * fix rnn test * fix rnn * update debugging docs update docs typo update docs update docs * add changelog * extract constant * setter and getter * move parity models to test folder * parameterize mode	2020-06-15 17:05:58 -04:00
Adrian Wälchli	22d9464e56	HenryJia: auto-move data decorator (#1905 ) * First attempt at auto-moving data for inference * Correct my copypaste errors * Correct for if device is CPU * Get rid of the WIP code I accidentally added * Add tests * Make tests more foolproof * Make sure we stick with pep8 formatting * Clarify docs a little * Apply suggestions from code review * Get everything working again hopefully * refactor and added hook variant a variant b add test revert rename add changelog docs * move changelog entry to top * Move data transfer to utilities * Add back in warnings for autotransfer * Get rid of the test code I ended up accidentally commiting again * Add docs any changelog * Correct PR number in Changelog * Correct changelog * Update data.py * Update test_cpu.py * make a decorator * type hint * changelog * changelog * remove old function * import * test for decorator * fix test * remove old test * doctest * apply decorator directly * convert doctest to code block * prevent side effects in tests * fix merge * update forward docs * update docs * added docs in section "deployment / prediction" * update changelog Co-authored-by: Hengjian Jia <henryjia18@gmail.com> Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com> Co-authored-by: William Falcon <waf2107@columbia.edu>	2020-06-15 17:04:32 -04:00

44 Commits