utility improvements to seq2seq #845

alexholdenmiller · 2018-06-11T22:31:55Z

still testing the oom code to see if it helps reduce memory spikes, it's based on fairseq's code and was suggested by myle
changed the vector caches to reference the parlai data path instead of parlai_home

alexholdenmiller · 2018-06-11T22:36:20Z

looks like the oom code is working! this clears out pytorch's GPU memory cache whenever it gets oom during the forward and backward pass during training (not when the network weights are being updated and not during validation) and just moves on to the next batch (logging the oom to the metrics and printing a warning)

alexholdenmiller · 2018-06-11T22:36:47Z

data change: #844
oom: #843 and others

alexholdenmiller · 2018-06-11T22:38:46Z

trained with batchsize 350 for a bit and was able to catch some spikes during training and continue

jaseweston · 2018-06-11T22:39:59Z

parlai/agents/seq2seq/seq2seq.py

@@ -248,23 +249,15 @@ def __init__(self, opt, shared=None):
                    embs = vocab.GloVe(
                        name='840B',
                        dim=300,
-                        cache=os.path.join(


just checking: is this the same as
https://github.com/facebookresearch/ParlAI/blob/master/parlai/zoo/glove_vectors/build.py ?

opt = { 'datapath': datapath } fnames = ['glove.840B.300d.zip'] download_models(opt, fnames, 'glove_vectors', use_model_type=False, path = "http://nlp.stanford.edu/data")

not clear it is..

we should probably just remove parlai/zoo/glove_vectors right? torchtext has its own code for downloading its vectors.

it is being used by drqa, if you can make drqa work with the other then yes! fine..! be good to get drqa to work with fasttext as well, anyway..

but yes it is the same, download_models uses os.path.join(opt['datapath'], 'models', model_folder) where model_folder here is the glove_vectors string in that call

i thought one might be a binary file and one a text file or something? i guess just check drqa still works please

jaseweston · 2018-06-12T15:59:15Z

see comment

utility improvements to seq2seq

0054998

facebook-github-bot added the CLA Signed label Jun 11, 2018

do not reset total skipped batches

f2d2b88

oops add back self.update_params()

9148d54

jaseweston reviewed Jun 11, 2018

View reviewed changes

jaseweston approved these changes Jun 12, 2018

View reviewed changes

updates

7391445

jaseweston approved these changes Jun 12, 2018

View reviewed changes

alexholdenmiller merged commit 0446621 into master Jun 12, 2018

alexholdenmiller deleted the seq2seq_util branch June 12, 2018 17:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

utility improvements to seq2seq #845

utility improvements to seq2seq #845

alexholdenmiller commented Jun 11, 2018

alexholdenmiller commented Jun 11, 2018

alexholdenmiller commented Jun 11, 2018

alexholdenmiller commented Jun 11, 2018

jaseweston Jun 11, 2018

alexholdenmiller Jun 11, 2018

jaseweston Jun 11, 2018

alexholdenmiller Jun 11, 2018

jaseweston Jun 12, 2018

jaseweston commented Jun 12, 2018

utility improvements to seq2seq #845

utility improvements to seq2seq #845

Conversation

alexholdenmiller commented Jun 11, 2018

alexholdenmiller commented Jun 11, 2018

alexholdenmiller commented Jun 11, 2018

alexholdenmiller commented Jun 11, 2018

jaseweston Jun 11, 2018

Choose a reason for hiding this comment

alexholdenmiller Jun 11, 2018

Choose a reason for hiding this comment

jaseweston Jun 11, 2018

Choose a reason for hiding this comment

alexholdenmiller Jun 11, 2018

Choose a reason for hiding this comment

jaseweston Jun 12, 2018

Choose a reason for hiding this comment

jaseweston commented Jun 12, 2018