Add pretrained models #435

breandan · 2017-03-11T22:49:07Z

Is it possible to add links to some pretrained models? I would like to test the performance on some real world speech, but could not find any reference to these in the docs. Thanks!

kdavis-mozilla · 2017-03-12T06:33:20Z

We're planning on adding links to some pre-trained models as soon as we're satisfied with their word error rate (WER).

We're targeting a WER of at max 10% on the TED test set and hope that we can, within the next few weeks, release a model.

gvoysey · 2017-04-04T20:56:47Z

@kdavis-mozilla is there any update on this line of work?

kdavis-mozilla · 2017-04-04T21:29:37Z

@gvoysey I wish I could give you the models right now. But I can't. Sorry. The WER still needs to be tuned.

We're still waiting on our new hardware which will allow us to tune hyperparameters with a quicker turn around. But it's not here yet.

Unfortunately the ETA for the hardware looks to be about 4 weeks out now. Then, once we have the hardware, we'll still need to tune for about a week or two.

Again, sorry.

gvoysey · 2017-04-05T16:03:49Z

@kdavis-mozilla no worries. I have been thrashing some GPUs myself (on librivox), hopefully they'll finish soon!

jacobjennings · 2017-04-14T02:00:10Z

I'm curious how large a trained model from a large dataset like TED is on disk, as a rough estimate?

gvoysey · 2017-04-14T02:18:17Z

I have a trained librivox model which is ~800 MB. I've found deepspeech2 implementations on other systems (torch) that are ~600 MB. These facts are anecdata. :)

phasnox · 2017-05-03T23:56:36Z

You know how long will it take to train with the TED dataset, if I have a GTX 1080?

kdavis-mozilla · 2017-05-04T04:52:16Z

@phasnox We use 4 Titan X's and it takes about 5 days. So, I'd guess on the order of 20 days.

getnamo · 2017-05-15T23:10:53Z

@kdavis-mozilla maxwell Titan X, Titan X (Pascal), or Titan Xp? I hate that we need the distinction...

kdavis-mozilla · 2017-05-16T06:05:34Z

@getnamo Titan X (Pascal)

There are only two hard things in Computer Science:
cache invalidation and naming things.
                                -- Phil Karlton

But I don't think NVIDIA even tried on this one.

gvoysey · 2017-05-16T11:59:11Z

and off by one errors 😇

…

On Tue, May 16, 2017 at 02:46 Kelly Davis ***@***.***> wrote: @getnamo <https://github.com/getnamo> Titan X (Pascal) There are only two hard things in Computer Science: cache invalidation and naming things. -- Phil Karlton But I don't think NVIDIA even tried on this one. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#435 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ADeR7_MjPvbL9okWh4HHQxcYGiyMKSnrks5r6TywgaJpZM4MaXNN> .

striki70 · 2017-05-16T12:33:23Z

Any news for links to trained models?

striki70 · 2017-05-16T12:34:30Z

@phasnox 20 days for how many epochs? WER achieved?

phasnox · 2017-05-16T14:40:38Z

@striki70 yehp WER achieved would be nice. But I have'nt try yet, I need to get a better cooling system.

gvoysey · 2017-05-22T22:02:53Z

@kdavis-mozilla just checking in on any updates you have on trained models.

kdavis-mozilla · 2017-05-23T08:17:07Z

Just got the hardware and installed it last week. Now we're working through a few OOM issues which appear in the cluster setting and not in the single node setting. So, it's a work in progress.

pythonmobile · 2017-05-26T00:29:11Z

Waiting for it :) Thanks.

AnimeshKoratana · 2017-06-07T20:56:09Z

Are there any updates regarding the pretrained models?

reuben · 2017-06-29T18:30:31Z

We're training a bunch of models and getting ready to share them. It's coming soon! :)

ThejanW · 2017-07-03T09:13:51Z

Are they ready now?

reuben · 2017-07-03T11:04:53Z

We'll comment here when they're available.

gardenia22 · 2017-07-03T12:05:44Z

I test TED dataset with Google API using this tool autosub with little modification. Google API achieved 27.3162% WER. Is 10% WER aiming too high?

reuben · 2017-07-03T14:42:32Z

That's off topic for this issue. Please join our IRC channel for questions and discussions. I'm locking this issue to try and get people to stick to the proper communication channels.

reuben · 2017-11-26T18:51:48Z

We have a first release of an American English model available in our releases page: https://github.com/mozilla/DeepSpeech/releases/latest

Please check it out and experiment, we're excited to see what you can do with it. We also set up discussion forums on Discourse, check the release notes for links.

saikishor · 2017-12-01T17:04:44Z

@reuben We surely appreciate the work and effort you have taken in, to provide us the model. It works amazing. It would be much more helpful, if you could provide us the checkpoints of the above model.

nicolaspanel · 2018-06-12T20:20:03Z

@reuben @kdavis-mozilla and others, thanks for the great work (amazing, really !)
Do you plan to share pre-trained models for other languages as well in future releases (ex: french) ?

kdavis-mozilla · 2018-06-12T20:23:21Z

@nicolaspanel Thanks, and yes we want to share models for as many languages as we can!

nicolaspanel · 2018-06-13T17:48:51Z

@kdavis-mozilla great 👍 do you have any ETA (no mentioned in currents projects)?
PS: I would be happy to help cleanup/prepare datasets if needed

kdavis-mozilla · 2018-06-13T18:09:08Z

@nicolaspanel For other languages the timing depends upon the rate at which Common Voice collects data for the language.

For example, a couple of weeks ago data donation started for French. The faster Common Voice collects data for a particular language, e.g. French, the faster we can bring you a model for that language.

So to a very large extent the timings will be determined be the community of Common Voice.

lissyx · 2018-06-13T18:12:05Z

@nicolaspanel For french we also deeply need to diversify our sources. If you're interested, you can find informations on https://github.com/mozfr/besogne/wiki/Common-Voice-Fr

Sorkanius · 2018-07-11T09:54:43Z

Are there any news on next trained model release?

Thanks a lot, your work is amazing!

kdavis-mozilla · 2018-07-11T10:19:39Z

No real updates yet, but as we just started collecting in other languages we need to give it a bit more time before we have enough data to train on.

kdavis-mozilla · 2018-07-11T10:19:55Z

PS: And thanks for the compliment!

Sorkanius · 2018-07-11T10:21:57Z

Are you working towards increasing the dataset or trying new networks? Or both maybe?

kdavis-mozilla · 2018-07-11T10:23:34Z

Right now trying to get more data through Common Voice

Sorkanius · 2018-07-11T10:27:56Z

Have you thought about data mining movies/series with their subtitles?

kdavis-mozilla · 2018-07-11T10:33:15Z

Licensing issues prevent this.

Sorkanius · 2018-07-11T10:34:47Z

Oh I understand, thanks again for your time, Keep up the great work!

akshat9425 · 2018-09-17T09:44:33Z

Is there any pretrained model of indian english pls send me if it exists

b-ak · 2018-09-17T17:43:51Z

@akshat9425 A couple of us are working towards achieving something on similar lines. We could explore the possibility of collaborating on the same. Write to me bak0@protonmail.com

lock · 2019-01-02T17:47:22Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

kdavis-mozilla added the Priority: P2 label Jun 29, 2017

mozilla locked and limited conversation to collaborators Jul 3, 2017

This was referenced Jul 5, 2017

trained model #630

Closed

Mozilla Deepspeech? MycroftAI/ZZZ-RETIRED__openstt#4

Closed

paulfitz mentioned this issue Aug 10, 2017

Pretrained models #758

Closed

alanbekker mentioned this issue Aug 14, 2017

Status of pretrained models #769

Closed

mozilla unlocked this conversation Nov 26, 2017

reuben closed this as completed Nov 26, 2017

lock bot locked and limited conversation to collaborators Jan 2, 2019

Add pretrained models #435

Add pretrained models #435

Comments

breandan commented Mar 11, 2017

kdavis-mozilla commented Mar 12, 2017

gvoysey commented Apr 4, 2017

kdavis-mozilla commented Apr 4, 2017

gvoysey commented Apr 5, 2017

jacobjennings commented Apr 14, 2017

gvoysey commented Apr 14, 2017

phasnox commented May 3, 2017

kdavis-mozilla commented May 4, 2017

getnamo commented May 15, 2017

kdavis-mozilla commented May 16, 2017

gvoysey commented May 16, 2017 via email

striki70 commented May 16, 2017

striki70 commented May 16, 2017

phasnox commented May 16, 2017

gvoysey commented May 22, 2017

kdavis-mozilla commented May 23, 2017

pythonmobile commented May 26, 2017

AnimeshKoratana commented Jun 7, 2017

reuben commented Jun 29, 2017

ThejanW commented Jul 3, 2017

reuben commented Jul 3, 2017

gardenia22 commented Jul 3, 2017

reuben commented Jul 3, 2017

reuben commented Nov 26, 2017

saikishor commented Dec 1, 2017

nicolaspanel commented Jun 12, 2018

kdavis-mozilla commented Jun 12, 2018

nicolaspanel commented Jun 13, 2018

kdavis-mozilla commented Jun 13, 2018

lissyx commented Jun 13, 2018

Sorkanius commented Jul 11, 2018

kdavis-mozilla commented Jul 11, 2018

kdavis-mozilla commented Jul 11, 2018

Sorkanius commented Jul 11, 2018

kdavis-mozilla commented Jul 11, 2018

Sorkanius commented Jul 11, 2018

kdavis-mozilla commented Jul 11, 2018

Sorkanius commented Jul 11, 2018

akshat9425 commented Sep 17, 2018

b-ak commented Sep 17, 2018

lock bot commented Jan 2, 2019