Seq2Seq model #96

alexholdenmiller · 2017-05-22T20:52:25Z

First draft of simple seq2seq model

jaseweston · 2017-05-24T16:27:10Z

examples/rnn_baselines/train.py

@@ -0,0 +1,97 @@
+# Copyright (c) 2017-present, Facebook, Inc.


i'm confused it is called rnn_baselines (plural) but then you just have train.py and agents.py, but i guess they could be moved if we add more rnn baselines? or could just give them more specific names now

I'll update these! haven't yet

jaseweston · 2017-05-24T16:27:56Z

examples/rnn_baselines/train.py

@@ -0,0 +1,97 @@
+# Copyright (c) 2017-present, Facebook, Inc.
+# All rights reserved.


we need to start making a general train program now. lets do it after you get this PR in

jaseweston · 2017-05-24T16:28:26Z

parlai/agents/rnn_baselines/agents.py

+
+
+class Seq2SeqAgent(Agent):
+    """Simple agent which uses an LSTM to process incoming text observations."""


explain a bit more the architecture please (GRUs, layers etc.)

jaseweston · 2017-05-24T16:29:09Z

parlai/core/dict.py

-        self.freq = SharedTable(self.freq)
-        self.tok2ind = SharedTable(self.tok2ind)
-        self.ind2tok = SharedTable(self.ind2tok)
+        # self.freq = SharedTable(self.freq)


what happened here?

jaseweston · 2017-05-24T16:29:28Z

parlai/core/dict.py

@@ -294,8 +306,10 @@ def txt2vec(self, text, vec_type=np.ndarray):
                (self[token] for token in self.tokenize(str(text))),
                np.int
            )
-        else:
+        elif vec_type == list:


need to make sure you are not breaking anything else that uses the dict, e.g. IR baseline etc.

actually nothing was using this function except remote agent, looks like

jaseweston · 2017-05-24T16:29:58Z

parlai/agents/rnn_baselines/agents.py

+        argparser.add_arg('--gpu', type=int, default=-1,
+            help='which GPU device to use')
+
+    def __init__(self, opt, shared=None):


be cool if this model also ranked candidates, which we need for many of the parlAI tasks, i guess it doesnt yet?

yeah this doesn't do that yet! I'd rather check it in before adding that functionality

jaseweston · 2017-05-31T17:20:43Z

parlai/core/params.py

        default_downloads_path = os.path.join(self.parlai_home, 'downloads')
        self.parser.add_argument(
            '-t', '--task',
            help='ParlAI task(s), e.g. "babi:Task1" or "babi,cbt"')
        self.parser.add_argument(
+            '--logpath', default=default_log_path,


still here?

alexholdenmiller added 5 commits May 18, 2017 15:38

s2s

2994e5b

minor change

1167f0e

fix dict, s2s is word based now

4301a3b

switch to optim and gru

735dbaf

split files into example train and agents

1d7593a

alexholdenmiller added the Enhancement label May 22, 2017

alexholdenmiller requested a review from ajfisch May 22, 2017 20:52

facebook-github-bot added the CLA Signed label May 22, 2017

alexholdenmiller added 2 commits May 22, 2017 17:24

fix compile error

9102341

small fixes

4666e33

jaseweston reviewed May 24, 2017

View reviewed changes

alexholdenmiller added 2 commits May 25, 2017 16:51

updated seq2seq model, bug fixes

f774dd0

Merge branch 'master' into first_learner

9993e88

jaseweston reviewed May 31, 2017

View reviewed changes

alexholdenmiller added 3 commits May 31, 2017 13:53

clean up

f38f468

last fix

4b4df5a

fix merge bug

ba64920

jaseweston approved these changes May 31, 2017

View reviewed changes

alexholdenmiller merged commit 6fe19ce into master May 31, 2017

alexholdenmiller deleted the first_learner branch May 31, 2017 17:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Seq2Seq model #96

Seq2Seq model #96

alexholdenmiller commented May 22, 2017

jaseweston May 24, 2017

alexholdenmiller May 26, 2017

jaseweston May 24, 2017

jaseweston May 24, 2017

jaseweston May 24, 2017

jaseweston May 24, 2017

alexholdenmiller May 26, 2017

jaseweston May 24, 2017

alexholdenmiller May 26, 2017

jaseweston May 31, 2017

		@@ -0,0 +1,97 @@
		# Copyright (c) 2017-present, Facebook, Inc.

		@@ -0,0 +1,97 @@
		# Copyright (c) 2017-present, Facebook, Inc.
		# All rights reserved.



		class Seq2SeqAgent(Agent):
		"""Simple agent which uses an LSTM to process incoming text observations."""

Seq2Seq model #96

Seq2Seq model #96

Conversation

alexholdenmiller commented May 22, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment