GitHub - martinopiaggi/summarize: Video transcript summarization from multiple sources (YouTube, Dropbox, Google Drive, local files) using multiple LLM endpoints (OpenAI, Groq, custom model).

Video Transcript Summarization with AI

Transcribe and summarize videos from multiple sources using state-of-the-art AI models in Google Colab or locally. This tool addresses the problem of too much content and too little time, helping you remember the content you watch or listen to.

colab_summarize_v2_example.mp4

Features

Versatile Video Sources: Summarize videos from YouTube, Dropbox, Google Drive, or local files.
Efficient Transcription:
- Use existing YouTube captions when available to save time and resources.
- Transcribe audio using Cloud Whisper (via Groq API) or Local Whisper.
Customizable Summarization:
- Choose from different prompt types: Summarization, Grammar Correction, or Distill Wisdom to extract key insights.
Flexible API Integration:
- Use various AI models via Groq (free), OpenAI, or custom local models for summarization.
Output Features:
- Generate summaries with timestamps and include original transcripts.

Use Cases

Quick Summaries: Get concise summaries of lengthy videos with timestamps.
Note-Taking: Capture key points efficiently.
Transcription Correction: Obtain grammatically correct video transcripts.
Wisdom Extraction: Extract key insights and wisdom from any video content.

Example Summary

Usage

graph LR
    B{Choose Video Source}
    B -->|YouTube| C{Use YouTube Captions?}
    B -->|Google Drive| D[Convert to Audio]
    B -->|Dropbox| D
    B -->|Local File| D
    C -->|Yes| E[Download YouTube Captions]
    C -->|No| D
    E --> J{Choose Prompt Type}
    D --> G{Choose Transcription Method}
    G -->|Cloud Whisper| H[Transcribe with Groq API endpoint Whisper]
    G -->|Local Whisper| I[Transcribe with Local Whisper]
    H --> J{Choose Prompt Type}
    I --> J{Choose Prompt Type}
    J --> K[Summarize Content]
    J --> L[Correct Captions]
    J --> M[Extract Key Insights]
    J --> P[Questions and answers]
    J --> Q[Essay Writing in Paul Graham Style]
    K --> O[Generate Final Summary]
    L --> O
    M --> O
    P --> O
    Q --> O

    %% Highlight important decision points
    style C fill:#f9f,stroke:#333,stroke-width:2px
    style G fill:#f9f,stroke:#333,stroke-width:2px
    style J fill:#bbf,stroke:#333,stroke-width:2px

API stuff:
- Set api_endpoint to Groq, OpenAI, or Custom.
- Ensure api_key is set accordingly.
- Groq API Key (api_key_groq): Required for cloud Whisper transcription.
- If you plan to use Whisper API endpoint (only Groq endpoint is supported for now) you have to specify your Groq API key in api_key_groq.
- Why use api_key_groq and api_key ? So that you can use a different API for summarization (e.g., OpenAI).
Configure Runtime Environment:
- If using Local Whisper on Google Colab:
  - Switch the runtime type to a GPU instance (e.g., T4).
  - Go to Runtime > Change runtime type > Set Hardware accelerator to GPU.
Input Video Source:
- Input the video URL or file path.
- Select the source type (YouTube Video, Google Drive Video Link, Dropbox Video Link, Local File):
  - For Google Drive, use the path relative to "My Drive".
  - For Dropbox, use the public sharing link.
  - For Youtube video, is recommended to use the available YouTube captions to save on transcription time and API usage.
Set Transcription Settings:
- The transcription settings are applied only if you want to use Whisper transcription and not Youtube Captions.
- Choose between cloud (Groq endpoint) or local Whisper:
  - Cloud Whisper:
    - Only supported via the Groq endpoint.
    - Requires api_key_groq.
  - Local Whisper:
    - Requires a GPU runtime.
- Language: Specify the language code (ISO-639-1 format, e.g., "en" for English,"it" for Italian).
- Initial Prompt for Whisper: (Optional) Provide an initial prompt to guide the transcription.
- Groq Free usage transcription limits using Whisper:
  
  Model ID Requests per Day Audio Minutes per Hour Audio Minutes per Day
  
  distil-whisper-large-v3-en 2,000 120 480
  
  whisper-large-v3 2,000 120 480
Set Summarization Settings:
- Prompt Type: Choose from Summarization, Grammar Correction, Distill Wisdom, Questions and answers or Essay Writing in Paul Graham Style.
- Configure other settings such as Parallel API Calls (mind rate limits), Chunk Size, and Max Output Tokens.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
Video summaries examples		Video summaries examples
.gitignore		.gitignore
README.md		README.md
Summarize.ipynb		Summarize.ipynb
prompts.json		prompts.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Transcript Summarization with AI

Features

Use Cases

Usage

About

Languages

Model ID	Requests per Day	Audio Minutes per Hour	Audio Minutes per Day
`distil-whisper-large-v3-en`	2,000	120	480
`whisper-large-v3`	2,000	120	480

martinopiaggi/summarize

Folders and files

Latest commit

History

Repository files navigation

Video Transcript Summarization with AI

Features

Use Cases

Usage

About

Resources

Stars

Watchers

Forks

Languages