mehrad
f3545ad9d3
add max-margin loss
2019-02-19 13:48:52 -08:00
Mehrad Moradshahi
f2335e456b
Merge pull request #1 from Stanford-Mobisocial-IoT-Lab/wip/field
...
Remove hacks needed to use a different tokenizer for Almond
2019-01-24 10:44:33 -08:00
mehrad
e4afb21928
minor fix
2019-01-23 19:13:26 -08:00
mehrad
8c54dc1391
updates and fixes
2019-01-23 16:41:37 -08:00
Giovanni Campagna
fdfdd154c4
Remove hacks needed to use a different tokenizer for Almond
...
Almond data is always pretokenized (because we must preprocess
numbers/quoted strings/etc.).
Previously, we used a bad HACK of hardcoding almond in the generic
Field subclass. Now, we instead thread through the tokenizer/detokenizer
arguments from the right places.
In the future, we will probably want task classes, to clean up
the mess of hacks and hardcoded task-specific tweaks everywhere.
2019-01-23 10:48:49 -08:00
Giovanni Campagna
91ad08211f
Fix duplicate elmo option
2019-01-23 10:45:30 -08:00
Giovanni Campagna
9ec8d7cf2d
Merge remote-tracking branch 'upstream/master'
2019-01-23 10:44:20 -08:00
Giovanni Campagna
09bcae5dd5
Merge branch 'master' into mehrad/master
2019-01-23 10:34:17 -08:00
Bryan McCann
cd997f257f
prepend root before save
2019-01-10 21:24:43 +00:00
Bryan McCann
eed545bbf7
log_dir needs root prefix
2019-01-09 18:06:04 +00:00
Bryan McCann
81b7ea7e72
options for sgd
2019-01-09 01:13:46 +00:00
Bryan McCann
27bb192249
unique val tasks
2019-01-09 00:28:11 +00:00
Bryan McCann
287b911d9c
separate out root dir; add option for experiment name
2019-01-08 02:05:55 +00:00
mehrad
873236860a
fix cuda error
2018-12-18 14:56:29 -08:00
mehrad
b4bb1ad661
working version of differentiable BLEU loss
2018-12-17 16:43:06 -08:00
Bryan McCann
07dd886f9a
Update README.md
2018-12-12 12:31:44 -08:00
Bryan Marcus McCann
c096a1f5ba
bugs in overwrite; new best mqan model
2018-12-12 20:29:46 +00:00
Bryan Marcus McCann
1dc5f7d28e
moving schema raw files to s3; updated pretrained models
2018-12-10 21:52:34 +00:00
Bryan Marcus McCann
1bd922b5bc
back compat predict.py
2018-12-10 19:41:27 +00:00
Bryan Marcus McCann
9a4493b655
more dynamic elmo use cases
2018-12-10 19:41:27 +00:00
Bryan Marcus McCann
37003d2a0a
elmo works better w/o dropout
2018-12-10 19:41:27 +00:00
Bryan Marcus McCann
6260c9f527
adding elmo naively as a replacement
2018-12-10 19:41:27 +00:00
Bryan Marcus McCann
3aecae06d9
better to have tensorboard logging on as default
2018-12-10 19:41:27 +00:00
Mehrad0711
a3ede17b3e
bug fixes
2018-12-06 08:29:11 +00:00
mehrad
e8d8ae0eab
add ELMo embeddings
2018-12-05 17:07:07 -08:00
mehrad
a9b48d2b97
update error analysis
2018-11-29 14:48:45 -08:00
mehrad
787b0887dc
adding device accuracy
2018-11-29 10:17:25 -08:00
mehrad
cd4782574d
more updates
2018-11-28 18:04:04 -08:00
mehrad
827e27d472
updates on post_processing
2018-11-28 18:00:05 -08:00
mehrad
616b2833d3
updates
2018-11-28 16:36:23 -08:00
Damini Satya
6b31d8f4cb
Remove CUDA from the command to build docker image
2018-11-28 07:40:44 -08:00
mehrad
17c09d977b
Add differentiable BLEU loss
...
-Using differentiable BLEU loss instead of cross_entropy loss
-it helps decreasing train-test evaluation gap
2018-11-27 15:22:38 -08:00
Bryan McCann
39720d0713
Update README.md
2018-11-16 18:31:36 -08:00
Bryan McCann
42f1bad639
Update README.md
2018-11-16 18:23:23 -08:00
Bryan McCann
ae8d64b898
CoVe results
2018-11-16 18:20:57 -08:00
Bryan Marcus McCann
fe580047ce
better predict; better MQAN+CoVe
2018-11-17 02:17:29 +00:00
mehrad
358ed621fd
bug fixes
2018-11-08 13:52:48 -08:00
mehrad
f9cba2b427
Updates
...
1) Use CPU automatically if cuda is not available
2) Use a different directory to store eval results
2018-11-08 13:23:12 -08:00
mehrad
ca5818dcf1
Getting the best from both sides
...
The branches are getting messy! This branch is a clean updated version of the codebase which is actually compatible with 1) new version of pytorch 2) Almond project
2018-11-07 15:06:41 -08:00
mehrad
ccca17c071
Merge branch 'master' of https://github.com/Stanford-Mobisocial-IoT-Lab/decaNLP
2018-11-07 11:23:27 -08:00
mehrad
dce20da5d2
updates
2018-11-07 11:23:06 -08:00
mehrad
9c5e7c31db
updates
2018-11-06 14:13:27 -08:00
Bryan McCann
9447f41075
repairing docker commands in readme
2018-10-26 11:13:13 -07:00
Bryan McCann
72aeb19ec3
Update README.md
2018-10-25 16:31:35 -07:00
Victoria X Lin
c0e4f7083a
Fix embedding layer implementation
...
Embedding dropout parameter is not used
2018-10-25 16:04:40 -07:00
Bryan Marcus McCann
4523bcabe8
getting rid of .data in models/common.py
2018-10-25 16:02:48 -07:00
Bryan Marcus McCann
2c837eaf6e
ptr-gen mem efficient dist; only trainable params
2018-10-25 16:02:48 -07:00
Bryan Marcus McCann
754d76f112
pg self_attended_context in greedy
2018-10-25 16:02:48 -07:00
Bryan McCann
f835558916
Reducing train_batch_tokens; 10k OOMs on 0.4.1
2018-10-25 16:02:48 -07:00
Bryan Marcus McCann
38ce8ac161
mv save out of val; add arg for no tensorboard logging
2018-10-25 16:02:48 -07:00