Convolution Sequence to Sequence Learning #1

flrngel · 2018-01-29T02:42:11Z

Convolution Sequence to Sequence Learning

aka Fairseq

P for position vector

e for embedding

For image above, kernel width is 3, and convolution block stack size is 1

using residual connection from g_i

attention for dot product z and d_i

This is good for stabilize learning

The text was updated successfully, but these errors were encountered:

flrngel added Convolution Seq2Seq NLP labels Jan 29, 2018

flrngel added this to the Read one more time milestone Jan 29, 2018

flrngel added the Should read deeply once more label Jan 29, 2018

flrngel removed this from the Read one more time milestone Jan 29, 2018

flrngel removed the Should read deeply once more label Feb 15, 2018

flrngel added the NMT label Feb 23, 2018