Giovanni Campagna
9a9b77ec17
Almond: load data from the TSV files directly
...
Almond is not a translation task, it's a semantic parsing task.
And it has its own established on-disk file format, and we should
respect that, instead of messing with .tt files
2019-03-01 09:36:14 -08:00
Giovanni Campagna
73d94f187b
SummaryWriter: replace add_scalars with add_scalar
...
tensorboardX's add_scalars is bonkers, and will create a different
file for each tag, which makes the tensorboard completely unreadable
(See https://github.com/lanpa/tensorboardX/issues/366 )
Instead, use add_scalar(), which puts everything in the same file,
as tensorboard is designed to do.
2019-03-01 08:44:06 -08:00
Giovanni Campagna
ae4bbfea3a
arguments: remove bad ugly log dir messing
...
Stick with what the user says as the --save directory. If the user
needs, they can provide the timestamp on the command line.
2019-03-01 08:30:35 -08:00
mehrad
7738b66e72
updates
2019-02-28 19:52:13 -08:00
mehrad
d2656bb53a
update post_process_decoded_results.py
2019-02-27 12:01:42 -08:00
mehrad
26f57a3059
don't shuffle dataset during prediction
2019-02-27 11:17:38 -08:00
mehrad
eff58ba1eb
merge updates
2019-02-27 10:54:01 -08:00
mehrad
2ef63c660e
export results to csv files
2019-02-27 10:52:38 -08:00
mehrad
fede42e471
add cached option for predict
2019-02-20 11:22:32 -08:00
mehrad
b8f4c1b0e5
fix max-margin loss implementation
2019-02-20 11:06:49 -08:00
mehrad
f47da57330
fix bug for joining paths
2019-02-19 16:21:34 -08:00
mehrad
ad5c9c6dc1
save cached files to an assigned path
...
useful when you don't have write permissions to dataset directory
2019-02-19 15:55:20 -08:00
mehrad
f3545ad9d3
add max-margin loss
2019-02-19 13:48:52 -08:00
Mehrad Moradshahi
f2335e456b
Merge pull request #1 from Stanford-Mobisocial-IoT-Lab/wip/field
...
Remove hacks needed to use a different tokenizer for Almond
2019-01-24 10:44:33 -08:00
mehrad
e4afb21928
minor fix
2019-01-23 19:13:26 -08:00
mehrad
8c54dc1391
updates and fixes
2019-01-23 16:41:37 -08:00
Giovanni Campagna
fdfdd154c4
Remove hacks needed to use a different tokenizer for Almond
...
Almond data is always pretokenized (because we must preprocess
numbers/quoted strings/etc.).
Previously, we used a bad HACK of hardcoding almond in the generic
Field subclass. Now, we instead thread through the tokenizer/detokenizer
arguments from the right places.
In the future, we will probably want task classes, to clean up
the mess of hacks and hardcoded task-specific tweaks everywhere.
2019-01-23 10:48:49 -08:00
Giovanni Campagna
91ad08211f
Fix duplicate elmo option
2019-01-23 10:45:30 -08:00
Giovanni Campagna
9ec8d7cf2d
Merge remote-tracking branch 'upstream/master'
2019-01-23 10:44:20 -08:00
Giovanni Campagna
09bcae5dd5
Merge branch 'master' into mehrad/master
2019-01-23 10:34:17 -08:00
Bryan McCann
cd997f257f
prepend root before save
2019-01-10 21:24:43 +00:00
Bryan McCann
eed545bbf7
log_dir needs root prefix
2019-01-09 18:06:04 +00:00
Bryan McCann
81b7ea7e72
options for sgd
2019-01-09 01:13:46 +00:00
Bryan McCann
27bb192249
unique val tasks
2019-01-09 00:28:11 +00:00
Bryan McCann
287b911d9c
separate out root dir; add option for experiment name
2019-01-08 02:05:55 +00:00
mehrad
873236860a
fix cuda error
2018-12-18 14:56:29 -08:00
mehrad
b4bb1ad661
working version of differentiable BLEU loss
2018-12-17 16:43:06 -08:00
Bryan McCann
07dd886f9a
Update README.md
2018-12-12 12:31:44 -08:00
Bryan Marcus McCann
c096a1f5ba
bugs in overwrite; new best mqan model
2018-12-12 20:29:46 +00:00
Bryan Marcus McCann
1dc5f7d28e
moving schema raw files to s3; updated pretrained models
2018-12-10 21:52:34 +00:00
Bryan Marcus McCann
1bd922b5bc
back compat predict.py
2018-12-10 19:41:27 +00:00
Bryan Marcus McCann
9a4493b655
more dynamic elmo use cases
2018-12-10 19:41:27 +00:00
Bryan Marcus McCann
37003d2a0a
elmo works better w/o dropout
2018-12-10 19:41:27 +00:00
Bryan Marcus McCann
6260c9f527
adding elmo naively as a replacement
2018-12-10 19:41:27 +00:00
Bryan Marcus McCann
3aecae06d9
better to have tensorboard logging on as default
2018-12-10 19:41:27 +00:00
Mehrad0711
a3ede17b3e
bug fixes
2018-12-06 08:29:11 +00:00
mehrad
e8d8ae0eab
add ELMo embeddings
2018-12-05 17:07:07 -08:00
mehrad
a9b48d2b97
update error analysis
2018-11-29 14:48:45 -08:00
mehrad
787b0887dc
adding device accuracy
2018-11-29 10:17:25 -08:00
mehrad
cd4782574d
more updates
2018-11-28 18:04:04 -08:00
mehrad
827e27d472
updates on post_processing
2018-11-28 18:00:05 -08:00
mehrad
616b2833d3
updates
2018-11-28 16:36:23 -08:00
Damini Satya
6b31d8f4cb
Remove CUDA from the command to build docker image
2018-11-28 07:40:44 -08:00
mehrad
17c09d977b
Add differentiable BLEU loss
...
-Using differentiable BLEU loss instead of cross_entropy loss
-it helps decreasing train-test evaluation gap
2018-11-27 15:22:38 -08:00
Bryan McCann
39720d0713
Update README.md
2018-11-16 18:31:36 -08:00
Bryan McCann
42f1bad639
Update README.md
2018-11-16 18:23:23 -08:00
Bryan McCann
ae8d64b898
CoVe results
2018-11-16 18:20:57 -08:00
Bryan Marcus McCann
fe580047ce
better predict; better MQAN+CoVe
2018-11-17 02:17:29 +00:00
mehrad
358ed621fd
bug fixes
2018-11-08 13:52:48 -08:00
mehrad
f9cba2b427
Updates
...
1) Use CPU automatically if cuda is not available
2) Use a different directory to store eval results
2018-11-08 13:23:12 -08:00