Commit Graph

2055 Commits

Author SHA1 Message Date
mehrad f3545ad9d3 add max-margin loss 2019-02-19 13:48:52 -08:00
Mehrad Moradshahi f2335e456b
Merge pull request #1 from Stanford-Mobisocial-IoT-Lab/wip/field
Remove hacks needed to use a different tokenizer for Almond
2019-01-24 10:44:33 -08:00
mehrad e4afb21928 minor fix 2019-01-23 19:13:26 -08:00
mehrad 8c54dc1391 updates and fixes 2019-01-23 16:41:37 -08:00
Giovanni Campagna fdfdd154c4 Remove hacks needed to use a different tokenizer for Almond
Almond data is always pretokenized (because we must preprocess
numbers/quoted strings/etc.).

Previously, we used a bad HACK of hardcoding almond in the generic
Field subclass. Now, we instead thread through the tokenizer/detokenizer
arguments from the right places.

In the future, we will probably want task classes, to clean up
the mess of hacks and hardcoded task-specific tweaks everywhere.
2019-01-23 10:48:49 -08:00
Giovanni Campagna 91ad08211f Fix duplicate elmo option 2019-01-23 10:45:30 -08:00
Giovanni Campagna 9ec8d7cf2d Merge remote-tracking branch 'upstream/master' 2019-01-23 10:44:20 -08:00
Giovanni Campagna 09bcae5dd5 Merge branch 'master' into mehrad/master 2019-01-23 10:34:17 -08:00
Bryan McCann cd997f257f prepend root before save 2019-01-10 21:24:43 +00:00
Bryan McCann eed545bbf7 log_dir needs root prefix 2019-01-09 18:06:04 +00:00
Bryan McCann 81b7ea7e72 options for sgd 2019-01-09 01:13:46 +00:00
Bryan McCann 27bb192249 unique val tasks 2019-01-09 00:28:11 +00:00
Bryan McCann 287b911d9c separate out root dir; add option for experiment name 2019-01-08 02:05:55 +00:00
mehrad 873236860a fix cuda error 2018-12-18 14:56:29 -08:00
mehrad b4bb1ad661 working version of differentiable BLEU loss 2018-12-17 16:43:06 -08:00
Bryan McCann 07dd886f9a
Update README.md 2018-12-12 12:31:44 -08:00
Bryan Marcus McCann c096a1f5ba bugs in overwrite; new best mqan model 2018-12-12 20:29:46 +00:00
Bryan Marcus McCann 1dc5f7d28e moving schema raw files to s3; updated pretrained models 2018-12-10 21:52:34 +00:00
Bryan Marcus McCann 1bd922b5bc back compat predict.py 2018-12-10 19:41:27 +00:00
Bryan Marcus McCann 9a4493b655 more dynamic elmo use cases 2018-12-10 19:41:27 +00:00
Bryan Marcus McCann 37003d2a0a elmo works better w/o dropout 2018-12-10 19:41:27 +00:00
Bryan Marcus McCann 6260c9f527 adding elmo naively as a replacement 2018-12-10 19:41:27 +00:00
Bryan Marcus McCann 3aecae06d9 better to have tensorboard logging on as default 2018-12-10 19:41:27 +00:00
Mehrad0711 a3ede17b3e bug fixes 2018-12-06 08:29:11 +00:00
mehrad e8d8ae0eab add ELMo embeddings 2018-12-05 17:07:07 -08:00
mehrad a9b48d2b97 update error analysis 2018-11-29 14:48:45 -08:00
mehrad 787b0887dc adding device accuracy 2018-11-29 10:17:25 -08:00
mehrad cd4782574d more updates 2018-11-28 18:04:04 -08:00
mehrad 827e27d472 updates on post_processing 2018-11-28 18:00:05 -08:00
mehrad 616b2833d3 updates 2018-11-28 16:36:23 -08:00
Damini Satya 6b31d8f4cb Remove CUDA from the command to build docker image 2018-11-28 07:40:44 -08:00
mehrad 17c09d977b Add differentiable BLEU loss
-Using differentiable BLEU loss instead of cross_entropy loss
-it helps decreasing train-test evaluation gap
2018-11-27 15:22:38 -08:00
Bryan McCann 39720d0713
Update README.md 2018-11-16 18:31:36 -08:00
Bryan McCann 42f1bad639
Update README.md 2018-11-16 18:23:23 -08:00
Bryan McCann ae8d64b898
CoVe results 2018-11-16 18:20:57 -08:00
Bryan Marcus McCann fe580047ce better predict; better MQAN+CoVe 2018-11-17 02:17:29 +00:00
mehrad 358ed621fd bug fixes 2018-11-08 13:52:48 -08:00
mehrad f9cba2b427 Updates
1) Use CPU automatically if cuda is not available
2) Use a different directory to store eval results
2018-11-08 13:23:12 -08:00
mehrad ca5818dcf1 Getting the best from both sides
The branches are getting messy! This branch is a clean updated version of the codebase which is actually compatible with 1) new version of pytorch 2) Almond project
2018-11-07 15:06:41 -08:00
mehrad ccca17c071 Merge branch 'master' of https://github.com/Stanford-Mobisocial-IoT-Lab/decaNLP 2018-11-07 11:23:27 -08:00
mehrad dce20da5d2 updates 2018-11-07 11:23:06 -08:00
mehrad 9c5e7c31db updates 2018-11-06 14:13:27 -08:00
Bryan McCann 9447f41075
repairing docker commands in readme 2018-10-26 11:13:13 -07:00
Bryan McCann 72aeb19ec3
Update README.md 2018-10-25 16:31:35 -07:00
Victoria X Lin c0e4f7083a Fix embedding layer implementation
Embedding dropout parameter is not used
2018-10-25 16:04:40 -07:00
Bryan Marcus McCann 4523bcabe8 getting rid of .data in models/common.py 2018-10-25 16:02:48 -07:00
Bryan Marcus McCann 2c837eaf6e ptr-gen mem efficient dist; only trainable params 2018-10-25 16:02:48 -07:00
Bryan Marcus McCann 754d76f112 pg self_attended_context in greedy 2018-10-25 16:02:48 -07:00
Bryan McCann f835558916 Reducing train_batch_tokens; 10k OOMs on 0.4.1 2018-10-25 16:02:48 -07:00
Bryan Marcus McCann 38ce8ac161 mv save out of val; add arg for no tensorboard logging 2018-10-25 16:02:48 -07:00