genienlp

Commit Graph

Author	SHA1	Message	Date
mehrad	f3545ad9d3	add max-margin loss	2019-02-19 13:48:52 -08:00
Mehrad Moradshahi	f2335e456b	Merge pull request #1 from Stanford-Mobisocial-IoT-Lab/wip/field Remove hacks needed to use a different tokenizer for Almond	2019-01-24 10:44:33 -08:00
mehrad	e4afb21928	minor fix	2019-01-23 19:13:26 -08:00
mehrad	8c54dc1391	updates and fixes	2019-01-23 16:41:37 -08:00
Giovanni Campagna	fdfdd154c4	Remove hacks needed to use a different tokenizer for Almond Almond data is always pretokenized (because we must preprocess numbers/quoted strings/etc.). Previously, we used a bad HACK of hardcoding almond in the generic Field subclass. Now, we instead thread through the tokenizer/detokenizer arguments from the right places. In the future, we will probably want task classes, to clean up the mess of hacks and hardcoded task-specific tweaks everywhere.	2019-01-23 10:48:49 -08:00
Giovanni Campagna	91ad08211f	Fix duplicate elmo option	2019-01-23 10:45:30 -08:00
Giovanni Campagna	9ec8d7cf2d	Merge remote-tracking branch 'upstream/master'	2019-01-23 10:44:20 -08:00
Giovanni Campagna	09bcae5dd5	Merge branch 'master' into mehrad/master	2019-01-23 10:34:17 -08:00
Bryan McCann	cd997f257f	prepend root before save	2019-01-10 21:24:43 +00:00
Bryan McCann	eed545bbf7	log_dir needs root prefix	2019-01-09 18:06:04 +00:00
Bryan McCann	81b7ea7e72	options for sgd	2019-01-09 01:13:46 +00:00
Bryan McCann	27bb192249	unique val tasks	2019-01-09 00:28:11 +00:00
Bryan McCann	287b911d9c	separate out root dir; add option for experiment name	2019-01-08 02:05:55 +00:00
mehrad	873236860a	fix cuda error	2018-12-18 14:56:29 -08:00
mehrad	b4bb1ad661	working version of differentiable BLEU loss	2018-12-17 16:43:06 -08:00
Bryan McCann	07dd886f9a	Update README.md	2018-12-12 12:31:44 -08:00
Bryan Marcus McCann	c096a1f5ba	bugs in overwrite; new best mqan model	2018-12-12 20:29:46 +00:00
Bryan Marcus McCann	1dc5f7d28e	moving schema raw files to s3; updated pretrained models	2018-12-10 21:52:34 +00:00
Bryan Marcus McCann	1bd922b5bc	back compat predict.py	2018-12-10 19:41:27 +00:00
Bryan Marcus McCann	9a4493b655	more dynamic elmo use cases	2018-12-10 19:41:27 +00:00
Bryan Marcus McCann	37003d2a0a	elmo works better w/o dropout	2018-12-10 19:41:27 +00:00
Bryan Marcus McCann	6260c9f527	adding elmo naively as a replacement	2018-12-10 19:41:27 +00:00
Bryan Marcus McCann	3aecae06d9	better to have tensorboard logging on as default	2018-12-10 19:41:27 +00:00
Mehrad0711	a3ede17b3e	bug fixes	2018-12-06 08:29:11 +00:00
mehrad	e8d8ae0eab	add ELMo embeddings	2018-12-05 17:07:07 -08:00
mehrad	a9b48d2b97	update error analysis	2018-11-29 14:48:45 -08:00
mehrad	787b0887dc	adding device accuracy	2018-11-29 10:17:25 -08:00
mehrad	cd4782574d	more updates	2018-11-28 18:04:04 -08:00
mehrad	827e27d472	updates on post_processing	2018-11-28 18:00:05 -08:00
mehrad	616b2833d3	updates	2018-11-28 16:36:23 -08:00
Damini Satya	6b31d8f4cb	Remove CUDA from the command to build docker image	2018-11-28 07:40:44 -08:00
mehrad	17c09d977b	Add differentiable BLEU loss -Using differentiable BLEU loss instead of cross_entropy loss -it helps decreasing train-test evaluation gap	2018-11-27 15:22:38 -08:00
Bryan McCann	39720d0713	Update README.md	2018-11-16 18:31:36 -08:00
Bryan McCann	42f1bad639	Update README.md	2018-11-16 18:23:23 -08:00
Bryan McCann	ae8d64b898	CoVe results	2018-11-16 18:20:57 -08:00
Bryan Marcus McCann	fe580047ce	better predict; better MQAN+CoVe	2018-11-17 02:17:29 +00:00
mehrad	358ed621fd	bug fixes	2018-11-08 13:52:48 -08:00
mehrad	f9cba2b427	Updates 1) Use CPU automatically if cuda is not available 2) Use a different directory to store eval results	2018-11-08 13:23:12 -08:00
mehrad	ca5818dcf1	Getting the best from both sides The branches are getting messy! This branch is a clean updated version of the codebase which is actually compatible with 1) new version of pytorch 2) Almond project	2018-11-07 15:06:41 -08:00
mehrad	ccca17c071	Merge branch 'master' of https://github.com/Stanford-Mobisocial-IoT-Lab/decaNLP	2018-11-07 11:23:27 -08:00
mehrad	dce20da5d2	updates	2018-11-07 11:23:06 -08:00
mehrad	9c5e7c31db	updates	2018-11-06 14:13:27 -08:00
Bryan McCann	9447f41075	repairing docker commands in readme	2018-10-26 11:13:13 -07:00
Bryan McCann	72aeb19ec3	Update README.md	2018-10-25 16:31:35 -07:00
Victoria X Lin	c0e4f7083a	Fix embedding layer implementation Embedding dropout parameter is not used	2018-10-25 16:04:40 -07:00
Bryan Marcus McCann	4523bcabe8	getting rid of .data in models/common.py	2018-10-25 16:02:48 -07:00
Bryan Marcus McCann	2c837eaf6e	ptr-gen mem efficient dist; only trainable params	2018-10-25 16:02:48 -07:00
Bryan Marcus McCann	754d76f112	pg self_attended_context in greedy	2018-10-25 16:02:48 -07:00
Bryan McCann	f835558916	Reducing train_batch_tokens; 10k OOMs on 0.4.1	2018-10-25 16:02:48 -07:00
Bryan Marcus McCann	38ce8ac161	mv save out of val; add arg for no tensorboard logging	2018-10-25 16:02:48 -07:00

... 37 38 39 40 41 ...

2055 Commits All Branches Search

2055 Commits

All Branches