Transcribe and summarize videos from multiple sources using state-of-the-art AI models in Google Colab or locally. This tool addresses the problem of too much content and too little time, helping you remember the content you watch or listen to.
colab_summarize_v2_example.mp4
- Versatile Video Sources: Summarize videos from YouTube, Dropbox, Google Drive, or local files.
- Efficient Transcription:
- Use existing YouTube captions when available to save time and resources.
- Transcribe audio using Cloud Whisper (via Groq API) or Local Whisper.
- Customizable Summarization:
- Choose from different prompt types: Summarization, Grammar Correction, or Distill Wisdom to extract key insights.
- Flexible API Integration:
- Use various AI models via Groq (free), OpenAI, or custom local models for summarization.
- Output Features:
- Generate summaries with timestamps and include original transcripts.
- Quick Summaries: Get concise summaries of lengthy videos with timestamps.
- Note-Taking: Capture key points efficiently.
- Transcription Correction: Obtain grammatically correct video transcripts.
- Wisdom Extraction: Extract key insights and wisdom from any video content.
graph LR
B{Choose Video Source}
B -->|YouTube| C{Use YouTube Captions?}
B -->|Google Drive| D[Convert to Audio]
B -->|Dropbox| D
B -->|Local File| D
C -->|Yes| E[Download YouTube Captions]
C -->|No| D
E --> J{Choose Prompt Type}
D --> G{Choose Transcription Method}
G -->|Cloud Whisper| H[Transcribe with Groq API endpoint Whisper]
G -->|Local Whisper| I[Transcribe with Local Whisper]
H --> J{Choose Prompt Type}
I --> J{Choose Prompt Type}
J --> K[Summarize Content]
J --> L[Correct Captions]
J --> M[Extract Key Insights]
J --> P[Questions and answers]
J --> Q[Essay Writing in Paul Graham Style]
K --> O[Generate Final Summary]
L --> O
M --> O
P --> O
Q --> O
%% Highlight important decision points
style C fill:#f9f,stroke:#333,stroke-width:2px
style G fill:#f9f,stroke:#333,stroke-width:2px
style J fill:#bbf,stroke:#333,stroke-width:2px
-
API stuff:
- Set
api_endpoint
to Groq, OpenAI, or Custom. - Ensure
api_key
is set accordingly. - Groq API Key (
api_key_groq
): Required for cloud Whisper transcription. - If you plan to use Whisper API endpoint (only Groq endpoint is supported for now) you have to specify your Groq API key in api_key_groq.
- Why use
api_key_groq
andapi_key
? So that you can use a different API for summarization (e.g., OpenAI).
- Set
-
Configure Runtime Environment:
- If using Local Whisper on Google Colab:
- Switch the runtime type to a GPU instance (e.g., T4).
- Go to Runtime > Change runtime type > Set Hardware accelerator to GPU.
- If using Local Whisper on Google Colab:
-
Input Video Source:
- Input the video URL or file path.
- Select the source type (YouTube Video, Google Drive Video Link, Dropbox Video Link, Local File):
- For Google Drive, use the path relative to "My Drive".
- For Dropbox, use the public sharing link.
- For Youtube video, is recommended to use the available YouTube captions to save on transcription time and API usage.
-
Set Transcription Settings:
-
The transcription settings are applied only if you want to use Whisper transcription and not Youtube Captions.
-
Choose between cloud (Groq endpoint) or local Whisper:
- Cloud Whisper:
- Only supported via the Groq endpoint.
- Requires
api_key_groq
.
- Local Whisper:
- Requires a GPU runtime.
- Cloud Whisper:
-
Language: Specify the language code (ISO-639-1 format, e.g., "en" for English,"it" for Italian).
-
Initial Prompt for Whisper: (Optional) Provide an initial prompt to guide the transcription.
-
Groq Free usage transcription limits using Whisper:
Model ID Requests per Day Audio Minutes per Hour Audio Minutes per Day distil-whisper-large-v3-en
2,000 120 480 whisper-large-v3
2,000 120 480
-
-
Set Summarization Settings:
- Prompt Type: Choose from Summarization, Grammar Correction, Distill Wisdom, Questions and answers or Essay Writing in Paul Graham Style.
- Configure other settings such as Parallel API Calls (mind rate limits), Chunk Size, and Max Output Tokens.