RNN vs LSTM? #498

joyousrabbit · 2017-04-04T20:41:09Z

Hello, Mozilla,
Why in your project it uses LSTM instead of RNN?

In paper it said: we have limited ourselves to a single recurrent layer (which is the hardest to parallelize) and we do not use Long-Short-Term-Memory (LSTM) circuits.

kdavis-mozilla · 2017-04-04T20:52:18Z

Generally LSTM's out-perform vanilla RNN's.

Also, we don't do low level parallelization as Baidu did, with forward and backward passes of the bi-directional RNN occurring on different GPU's then having the GPU's exchange roles.

However, we do have an open issue #362 to explore the difference in LSTM WER vs RNN WER. If you have access to training hardware, and want to tackle issue #362, feel free to explore on the TED data set. However, as a warning, it will take some compute power to tune all the hyperparameters.

joyousrabbit · 2017-04-04T21:04:57Z

Thanks for fast reply. May I ask currently what's your WER by LSTM?

kdavis-mozilla · 2017-04-04T21:14:03Z

It depends on the data set.

But, for example, on the librivox data set the full test set the WER was about 22% and for the clean subset of the librivox test data set it was about 12%.

However, we haven't really had time to tune to the librivox data set as we're waiting on new hardware that would allow us to get a quicker turn-around for training and we need to tune the language model too. So these numbers are only a first pass on the data set.

kdavis-mozilla · 2017-04-07T01:37:04Z

@joyousrabbit I'm going to close this as it seems the associated question has been answered.

lock · 2019-01-03T17:58:39Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

kdavis-mozilla closed this as completed Apr 7, 2017

lock bot locked and limited conversation to collaborators Jan 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RNN vs LSTM? #498

RNN vs LSTM? #498

joyousrabbit commented Apr 4, 2017

kdavis-mozilla commented Apr 4, 2017

joyousrabbit commented Apr 4, 2017

kdavis-mozilla commented Apr 4, 2017

kdavis-mozilla commented Apr 7, 2017

lock bot commented Jan 3, 2019

RNN vs LSTM? #498

RNN vs LSTM? #498

Comments

joyousrabbit commented Apr 4, 2017

kdavis-mozilla commented Apr 4, 2017

joyousrabbit commented Apr 4, 2017

kdavis-mozilla commented Apr 4, 2017

kdavis-mozilla commented Apr 7, 2017

lock bot commented Jan 3, 2019