logseq-plugin-whisper-subtitles

English | 日本語

Overview

This plugin integrates with the processing server of "Whisper" running locally on the PC to transcribe text from videos like YouTube, providing subtitles with timestamps.

The entire process, from transcription by Whisper to importing the content into Logseq, is completed locally.

Dependencies

OpenAI Whisper API is not used. To make it work, you must run the dedicated server for this plugin, logseq-whisper-subtitles-server, every time. It receives data through this dedicated server (requests the local "Whisper" processing server).
This plugin currently supports YouTube and local video files.

To use timestamp navigation for local video files, please install the logseq-plugin-media-ts plugin.

Usage

Install Whisper subtitles plugin from Logseq Marketplace.

The plugin settings include options for specifying Whisper model size, minimum segments, and endpoint.
Start the dedicated server for this plugin locally. Make sure it runs in the background.
Prepare a block with a video, such as a YouTube video.
- For YouTube: Paste the URL into a block, and it will be embedded.
- For local files: Copy and paste or drag them to embed as assets.
Right-click the bullet point (•) of that block and select "Transcribe (Whisper-Subtitles)" from the menu.

This will request the dedicated server to process it with Whisper. It may take a few minutes for Whisper to finish the transcription process. Once it's done, the block will have extracted timestamps and subtitles.

Demo

YouTube Embedded in a Block

youtube_demo.mp4

Video File Embedded in a Block (Local)

local_demo.mp4

Audio Embedded in a Block (Local)

local_audio.mp4

Chinese Demo

cn_demo.mp4

Related Repository

logseq-whisper-subtitles-server - The local web server running whisper, which is required to extract voice from videos and subsequently extract text from the voice.
logseq-plugin-media-ts: A plugin generate timestamps for video, audio and Bilibili video, it takes you to the corresponding video/audio position when clicked.
whisper: Robust Speech Recognition via Large-Scale Weak Supervision

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github/workflows		.github/workflows
demos		demos
public		public
.gitignore		.gitignore
.prettierrc		.prettierrc
LICENSE		LICENSE
README.ja.md		README.ja.md
README.md		README.md
icon.png		icon.png
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
wmr.config.mjs		wmr.config.mjs
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

logseq-plugin-whisper-subtitles

Overview

Dependencies

Usage

Demo

YouTube Embedded in a Block

Video File Embedded in a Block (Local)

Audio Embedded in a Block (Local)

Chinese Demo

Related Repository

About

Releases 8

Packages

Contributors 2

Languages

License

usoonees/logseq-plugin-whisper-subtitles

Folders and files

Latest commit

History

Repository files navigation

logseq-plugin-whisper-subtitles

Overview

Dependencies

Usage

Demo

YouTube Embedded in a Block

Video File Embedded in a Block (Local)

Audio Embedded in a Block (Local)

Chinese Demo

Related Repository

About

Resources

License

Stars

Watchers

Forks

Releases 8

Packages 0

Contributors 2

Languages

Packages