Whisperer

Whisperer is a Python application that records audio when a specific key is held down, and sends the audio to OpenAI's Whisper ASR system for transcription when another key is tapped. It then types the transcription into the active window.

Prerequisites

Python 3
OpenAI API key

Dependencies

sounddevice
numpy
openai
pynput
scipy
pyperclip

You can install these dependencies using pip:

pip install sounddevice numpy openai pynput scipy pyperclip

On Linux, also run:

sudo apt-get install xclip

Setup

Clone the repository.
Create a file named openai_api_key.txt in the root directory of the project.
Paste your OpenAI API key into openai_api_key.txt.

Usage

Run whisperer.py to start the application.

python whisperer.py

To quit the application, press ctrl + c.

Hold down right ctrl button to start recording audio.
Release right ctrl button to stop recording audio.

If you want to translate the recorded audio to French, tap the right shift button while recording

Notes

The audio is recorded at a sample rate of 16000 Hz and saved as output.wav.
The application only records while the record key is held down.
The application only sends audio to Whisper when the translate key is tapped.
The application does not transcribe audio that is less than 1 second long.
The application does not handle errors from the Whisper API.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
whisperer-local.py		whisperer-local.py
whisperer.py		whisperer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisperer

Prerequisites

Dependencies

Setup

Usage

Notes

About

Releases

Packages

Languages

grassick/whisperer

Folders and files

Latest commit

History

Repository files navigation

Whisperer

Prerequisites

Dependencies

Setup

Usage

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages