mehrad
eb10a788b0
updates
2019-05-28 18:12:31 -07:00
mehrad
7db36a90af
fixes
2019-05-20 14:29:25 -07:00
mehrad
19067c71ba
option to retrain encoder embeddings
2019-04-15 16:11:10 -07:00
mehrad
4905ad6ce8
Fixes
...
Apparently layer norm implementation can't be tampered with!
Reverting the change for now and switching to a new branch for truly fixing this.
2019-04-08 17:24:02 -07:00
mehrad
a7a2d752d2
Fixes
...
std() in layer normalization is the culprit for generating NAN.
It happens in the backward pass for values with zero variance.
Just update the mean for these batches.
2019-04-08 14:48:23 -07:00
mehrad
4acdba6c22
fix for NAN loss
2019-04-05 10:26:35 -07:00
Giovanni Campagna
6a97970b13
fix typo
2019-03-27 12:46:20 -07:00
Giovanni Campagna
9814d6bf4f
Implement using a pretrained language model for the decoder embedding
...
Let's see if it makes a difference
2019-03-27 11:40:59 -07:00
Giovanni Campagna
cea6092f90
Fix evaluating
...
- fix loading old config.json files that are missing some parameters
- fix expanding the trained embedding
- add a default context for "almond_with_thingpedia_as_context"
(to include thingpedia)
- fix handling empty sentences
2019-03-23 17:28:22 -07:00
Giovanni Campagna
799d8c4993
fix syntax
2019-03-19 20:40:01 -07:00
Giovanni Campagna
a3cf02cbe7
Add a way to disable glove embeddings on the decoder side
...
With grammar, they just add noise and overfit badly
2019-03-19 20:36:20 -07:00
Giovanni Campagna
d67ef67fb8
Fix
2019-03-19 17:49:42 -07:00
Giovanni Campagna
2769cc96e3
Add the option to train a portion of decoder embeddings
...
This will be needed because GloVe/char embeddings are meaningless
for tokens that encode grammar productions (which are of the form
"R<id>")
2019-03-19 17:31:53 -07:00
Giovanni Campagna
1a2a4a9ea9
One more stanford copyright
2019-03-01 16:18:10 -08:00
Giovanni Campagna
8e2b519ac3
Add copyright notices to all files
...
Makes the license clear and explicit
2019-03-01 15:51:45 -08:00
Giovanni Campagna
41b80bb4f4
Move all python files to a decanlp/ package
...
As per python conventions
2019-03-01 15:43:02 -08:00