Skip to content

contigen/speak-notes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Speak-Notes

A voice note transcription app using the JavaScript Web Speech API; the app transcribes recognised speech into text and shows the result in a preview format and a note format, with real-time edit. The preview shows the transcribed speech simultaneously as speech is being recognised, and the note form displays recognised speech which sound complete, as in a complete sentence. separate sounds are played when the speech recognition service is started and stopped, as a cue to users. The app listens in the language provided by the user agent. It allows to download the transcript, which is split into a new line at every period, as a text file.
Using modern Web APIs, the transcript can be copied to the clipboard, shared to system applications, and text files can be imported.

This project was bootstrapped with Vite.

Notes

  • MS Edge seems to give faster and more accurate results - the results it gives syncs well with the recognised speech; in fact, the transcript is punctuated.
  • Ubuntu's Edge crashes on accessing the API.
  • Brave browser supports it, so it seems, but always shows a network error - a Brave bug. Here's a link that describes the issue
  • Brave on mobile has support for the API, at least on Brave 1.49.129, Chromium 111.0.5563.116 - surprising though.
  • Firefox lacks support for the API, currently.
  • Some browsers use a server-based recognition engine. The audio is sent to a web service for recognition processing, so it works online only.
  • poor results in case of poor network signal, apparently.

Demo

Speak-notes.demo.mp4

Usage

Give credits if you'll be using.

Contribution

It's a very interesting API, I would love to see any feature requests.

Feature you could contribute:

  1. let's users choose the language they want to record in.
  2. Anything that's interesting enough.

Start the App

In the project directory, you can run:

npm start

Runs the app in the development mode.
The page will reload when you make changes.
You may also see any lint errors in the console.

Miscellaneous

Currently, the repo has 3 branches: main, feat, develop.

The main branch contains the MVP.

The feat branch contains a experimental code: getting access to the user's media device, making a live video of it; getting the user's audio device details.

The develop branch contains the latest app features - might seem counter-intuitive a bit, haha.

About

A speech transcription app, using the Web speech API.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published