mehrad
16e7cac99d
Add tests for sts filtering
2021-01-03 11:54:07 -08:00
Giovanni Campagna
f283774bb9
Merge pull request #66 from stanford-oval/wip/version_bumps
...
Wip/version bumps
2020-12-30 09:03:13 -08:00
Giovanni Campagna
10b51e4623
Fix exporting models ( #67 )
...
Initialize the numericalizer correctly so we don't try to load files
that don't exist
2020-12-28 19:13:50 -08:00
Sina
aa9927a100
Add support and test for diverse beam search
2020-12-20 00:08:10 -08:00
Giovanni Campagna
234180487b
tests: reduce batch size to make travis happy
2020-12-19 10:54:29 -08:00
Giovanni Campagna
826d54ccd7
tests: temporarily remove mt5 test
...
Too big
2020-12-19 10:39:50 -08:00
Giovanni Campagna
ebc09794f2
tests: ensure we run T5 with --preprocess-special-tokens
...
T5 models don't know how to extend their vocabulary properly, so
they cannot deal with new special tokens and must preprocess all
special tokens into regular words.
2020-12-14 23:11:17 -08:00
Giovanni Campagna
db8e9fb1ee
Merge remote-tracking branch 'origin/master' into wip/thingtalk2
2020-12-14 21:46:51 -08:00
Giovanni Campagna
ae93e290ec
Add automatic preprocessing of special ThingTalk tokens
...
Automatically compute a replacement for all @ tokens, ^^ tokens,
and entity tokens, into natural-language-like words. The replacement
is based on the longest unambiguous suffix after splitting on
delimiters.
2020-12-10 18:41:26 -08:00
mehrad
84ada6eedb
Update test data for t5
2020-12-10 18:03:49 -08:00
mehrad
8f1c080936
Add tests for mbart and mt5
2020-12-10 11:54:45 -08:00
Giovanni Campagna
8b34f7bed8
Fix tests, address review comments
2020-12-07 22:46:25 -08:00
Giovanni Campagna
1ebc6a6a95
Cleanup embedding code and command-line flags
...
Remove all flags that are always disabled (like the Transformer
layers on the decoder side) and set the dimension based on
the pretrained model size.
2020-12-07 08:01:45 -08:00
Giovanni Campagna
e930a6a0b5
Remove GloVe embeddings
...
Not useful in a modern NLP pipeline
2020-12-05 11:36:34 -08:00
mehrad
69c0d5f801
Fix and update tests
2020-12-03 17:24:55 -08:00
mehrad
8b644cca02
bump transformers version to 3.5.1
2020-12-03 12:50:34 -08:00
s-jse
653424214c
Merge pull request #58 from stanford-oval/wip/lower-memory
...
Submission work
2020-12-03 00:23:03 -08:00
mehrad
f0a77fc172
Fix and add test for paired encoding
2020-12-02 21:55:35 -08:00
mehrad
2384ae9b1e
Use bart-tiny-random instead of bart-large for tests
2020-11-28 14:57:34 -08:00
mehrad
c3bc5835ad
Add translation test files
2020-11-15 14:33:45 -08:00
mehrad
dbfdfb20d3
run_lm_finetuning: fix attention_mask
2020-11-15 14:22:28 -08:00
mehrad
744a7a8483
run_generation.py: fix and update denoising functions
2020-11-15 13:46:22 -08:00
mehrad
b84a6548a6
Changes to make run_lm_finetuning compatible with mbart and marian
2020-11-15 13:46:17 -08:00
Sina
2f94fc2d35
Fix bug
2020-11-11 17:21:12 -08:00
Sina
e2ad021972
Clarify test case
2020-11-07 23:42:27 -08:00
Sina
9a847b4b1c
Add Bart test
2020-11-07 15:19:44 -08:00
Sina
cdcfa97ce0
Fix bugs
2020-07-24 16:32:54 -07:00
Sina
44021e2ba0
Add test for server mode
2020-07-24 16:32:54 -07:00
mehrad
5c0453a4ec
Add multilingual dialog almond tasks
2020-07-22 22:04:27 -07:00
mehrad
813e28b8fa
Update tests
2020-07-20 22:49:31 -07:00
Sina
a943b2a1b5
Postpone BART tests until a later transformers version
2020-07-03 18:46:29 -07:00
Sina
04104f0dc3
Free up space during tests
2020-07-03 17:59:10 -07:00
Sina
f8976ac6d7
Tests are less resource-intensive to run on Travis
2020-07-03 17:14:24 -07:00
Sina
31fca52dde
Fix BART tests
2020-07-03 16:44:29 -07:00
Sina
eedb4b6b5e
Merge branch 'master' into paraphrase-merge
2020-07-03 15:09:51 -07:00
mehrad
6de6bd98a5
Added option to use language in the question
2020-06-23 14:58:43 -07:00
mehrad
0489d19f2e
Option to pool encoded values by choosing cls token representation
2020-06-23 14:58:43 -07:00
mehrad
a93dfc9099
Merge BART code with the rest of paraphrasing codebase
2020-05-04 15:54:09 -07:00
mehrad
50c4de9d60
merge master branch
2020-05-04 10:06:35 -07:00
mehrad
8ba1f4c490
code refactoring
2020-05-04 10:03:08 -07:00
mehrad
5f871512a6
bart_evaluation is integrated with run_generation script
2020-04-29 21:14:03 -07:00
mehrad
9d5945376f
move files to test dir
2020-04-27 16:36:36 -07:00
mehrad
2eb78d7080
add tests
2020-04-27 16:30:19 -07:00
mehrad
36b0fb5317
fix bug in iter
2020-04-19 18:03:11 -07:00
Sina
e24e426a3c
test datasets should not be ignored
2020-04-18 19:54:48 -07:00
Sina
a122c05062
paraphrase training is now done with tsv files
2020-04-18 18:46:45 -07:00
mehrad
891bee8632
merge master branch
2020-04-09 22:10:35 -07:00
mehrad
31190c3f18
update tests
2020-04-06 16:03:05 -07:00
mehrad
f3bf392231
adding tests + fixing bugs
2020-03-30 14:43:03 -07:00
mehrad
703edc4fdc
Option to do sentence batching
...
Batches can have multiple minibatches; each containing same sentences but in different languages
2020-03-30 12:50:02 -07:00