Skip to content
@linto-ai

linto.ai

Your Open Source end-to-end platform for voice-operated solutions

LinTO AI

Open Source Ecosystem for Transcription, Collaborative Media Management, Annotation, Live Subtitling, and Summarization

LinTO AI Banner

Overview

LinTO AI provides a powerful suite of open-source tools for transcription, collaborative media editing, annotation, live subtitling, and summarization utilizing large language models (LLMs).

Hosted by
LINAGORA
Try LinTO Studio

Quick Start

  • LinTO Studio: 🎤 A media management platform offering advanced tools for transcription and collaborative media editing. Key features include:
    • Speaker identification/diarization: Automatically segment and identify speakers.
    • Automatic timestamp alignment: Synchronize transcripts with media.
    • Collaborative editing: Work collaboratively on media annotations and transcriptions in real-time.
    • Summarization: Generate concise summaries of media content using LLMs.
    • Building and syncing subtitles: Create and synchronize subtitles for video content with ease.
    • Live transcription from the browser: Record and transcribe audio directly from your browser.
    • AI Agent for videoconferences: A bot system that joins videoconferences to capture live audio streams for transcription and subtitling. This allows LinTO Studio to act as a powerful assistant during meetings, leveraging videoconference platforms as live audio sources.

LinTO Studio leverages our other technologies, including:

  • LinTO-STT for speech-to-text conversion.
  • LinTO-Diarization for speaker segmentation and identification.
  • LLM-Gateway for advanced summarization.

To deploy LinTO Studio and its associated services, use the LinTO Deployment Tool, which simplifies the setup process.

Key Projects

  • LinTO-STT: 🗣️ An automatic speech recognition API supporting both offline and real-time transcriptions. It accommodates models like Kaldi and Whisper and can operate as a standalone service or within a microservices infrastructure. Learn more

  • Whisper-Timestamped: ⏱️ A multilingual automatic speech recognition tool providing word-level timestamps and confidence scores. It enhances OpenAI's Whisper models to deliver more precise transcriptions with detailed timing information. Learn more

  • LLM-Gateway: 📝 A service dedicated to rolling summarization using large language models (LLMs), enabling efficient processing and summarization of extensive textual data. Learn more

  • LinTO-Diarization: 🔊 A speaker diarization service that segments audio streams into homogeneous segments based on speaker identity, with capabilities for speaker identification when audio samples of known speakers are provided. Learn more

  • WebVoiceSDK: 🌐 A JavaScript library offering lightweight and optimized building blocks for always-listening voice-enabled applications directly in the browser. It manages various aspects of voice input, including hardware microphone handling, voice activity detection, and wake word detection. Learn more

Get Involved

LinTO AI is committed to open-source development, ensuring our tools are accessible and adaptable, fostering innovation in business-aware media transcription and summarization. For more information or to contribute, contact us at hello@linto.ai.

Pinned Loading

  1. linto-stt linto-stt Public

    An automatic speech recognition API

    Python 47 14

  2. linto-studio linto-studio Public

    Transcription and annotation interface for recorded audio or video files

    JavaScript 26 1

  3. whisper-timestamped whisper-timestamped Public

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

    Python 2.1k 162

Repositories

Showing 10 of 50 repositories
  • linto-studio Public

    Transcription and annotation interface for recorded audio or video files

    linto-ai/linto-studio’s past year of commit activity
    JavaScript 26 AGPL-3.0 1 5 0 Updated Dec 12, 2024
  • .github Public
    linto-ai/.github’s past year of commit activity
    0 0 0 0 Updated Dec 12, 2024
  • linto-stt Public

    An automatic speech recognition API

    linto-ai/linto-stt’s past year of commit activity
    Python 47 AGPL-3.0 14 5 0 Updated Dec 11, 2024
  • linto-transcription Public

    Transcription service for LinTO stack.

    linto-ai/linto-transcription’s past year of commit activity
    Python 3 AGPL-3.0 0 3 0 Updated Dec 11, 2024
  • linto-studio-plugins Public

    Live websocket, rtmp, srt streaming plugins for Linto Studio

    linto-ai/linto-studio-plugins’s past year of commit activity
    JavaScript 1 EUPL-1.2 0 0 0 Updated Dec 10, 2024
  • whisper-timestamped Public

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

    linto-ai/whisper-timestamped’s past year of commit activity
    Python 2,116 AGPL-3.0 162 36 (1 issue needs help) 1 Updated Dec 6, 2024
  • WebVoiceSDK Public

    Buildings block for voice-enabled applications in the browser

    linto-ai/WebVoiceSDK’s past year of commit activity
    JavaScript 33 AGPL-3.0 10 2 0 Updated Dec 3, 2024
  • linto Public

    Start here !

    linto-ai/linto’s past year of commit activity
    Jsonnet 0 0 0 0 Updated Nov 30, 2024
  • faster-whisper Public Forked from SYSTRAN/faster-whisper

    Faster Whisper transcription with CTranslate2

    linto-ai/faster-whisper’s past year of commit activity
    Python 2 MIT 1,089 0 0 Updated Nov 27, 2024
  • llm-gateway Public

    Rolling summarization using LLM

    linto-ai/llm-gateway’s past year of commit activity
    Python 1 0 2 0 Updated Nov 26, 2024

Top languages

Loading…

Most used topics

Loading…