#

speech-dataset

Here are 22 public repositories matching this topic...

aishoot / Speech_Feature_Extraction

Feature extraction of speech signal is the initial stage of any speech recognition system.

signal-processing speech feature-extraction speech-dataset speech-feature-extraction speech-features speech-preprocess

Updated Sep 3, 2020
Python

hetpandya / youtube_tts_data_generator

A python library to generate speech dataset from Youtube videos

text-to-speech youtube python-library tts speech-dataset dataset-generator youtube-dataset youtube-dataset-generator tts-dataset text-to-speech-dataset

Updated Jun 7, 2024
Python

ruslan-corpus / ruslan-corpus.github.io

text-to-speech tts russian speech-dataset speech-corpus

Updated Aug 29, 2019
HTML

fjxmlzn / RNN-SM

[T-IFS] RNN-SM: Fast Steganalysis of VoIP Streams Using Recurrent Neural Network

algorithm steganalysis idc rnn-sm ss-qccn speech-dataset

Updated May 24, 2018
Python

manankshastri / Trigger-Word-Detection

Construct a speech dataset and implement an algorithm for trigger word detection (sometimes also called keyword detection, or wakeword detection).

python deep-learning rnn gated-recurrent-units speech-dataset trigger-word-detection

Updated Apr 14, 2019
Jupyter Notebook

Rumeysakeskin / Speech-Datasets-for-ASR

Download speech datasets (English and non-English) for Automatic Speech Recognition

speech-synthesis speech-recognition speech-to-text speech-processing asr speech-dataset audio-datasets voice-datasets common-voice-dataset voxforge-dataset

Updated Jan 22, 2023
Jupyter Notebook

petrichorwq / DECRO-dataset

Deepfake cross-lingual evaluation dataset (DECRO) is constructed to evaluate the influence of language differences on deepfake detection.

speech-dataset deepfake-detection

Updated Sep 14, 2023

gauthelo / kallaama-speech-dataset

A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.

natural-language-processing agriculture speech-processing speech-dataset senegal-language

Updated Apr 29, 2024

revsic / speechset

Numpy-librosa implementation of Speech dataset pipeline

preprocessor tts vocoder speech-dataset

Updated Jan 18, 2023
Python

Ralireza / PSDR

Persian spoken digit recognition

speech-recognition persian speech-recognizer speech-analysis speech-dataset persian-speech-recognition persian-spoken-digit persian-dataset

Updated Jul 28, 2019
Python

KanishkNavale / Speech-Emotion-Recognition

A simple CNN-LSTM deep neural model using Tensorflow to classify emotions from a speech dataset

deep-learning tensorflow cnn lstm speech-emotion-recognition speech-dataset

Updated Jun 1, 2022
Jupyter Notebook

ina-foss / InaGVAD

Voice activity detection and speaker gender segmentation audiovisual corpus

radio benchmark corpus tv dataset gender audio-segmentation voice-activity-detection gender-prediction speech-dataset gender-bias speech-activity-detection speaker-gender speech-corpus audio-dataset audiovisual-dataset acoustic-diversity gender-representation

Updated Jun 6, 2024
Jupyter Notebook

mborsdorf / GlobalPhoneMS_Scripts

multilingual python deep-learning matlab speech-separation speech-dataset auditory-attention

Updated Sep 6, 2021
MATLAB

mborsdorf / TargetLanguageExtraction

audio multilingual python deep-learning matlab pytorch speech-processing audio-processing source-separation speech-separation speech-dataset auditory-attention speech-corpus speaker-extraction speech-database

Updated Feb 8, 2022

PanosAntoniadis / fast-recorder

Simple script that creates a speech dataset quickly

recorder speech-to-text sphinx-4 speech-dataset

Updated Jul 13, 2019
Python

AI2001_Category-Audio-SC-Speeches

seanpm2001 / AI2001_Category-Audio-SC-Speeches

🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🎼️🎷️ The audio:speeches category for AI2001, containing speech datasets

gplv3 dataset r-language md txt gpl3 speech-dataset audio-dataset rmarkdown-language ai2001 ai-2001 ai2001-dataset ai-2001-dataset ai2001-development ai-2001-development speech-audio-dataset

Updated Mar 17, 2023
R

cyrta / 50languages

Corpus, dataset of speech recording in 50 languages

corpus speech speech-dataset

Updated Mar 23, 2018
PHP

nafiuny / voice_conversion_dataset

top dataset for voice conversion models

python text-to-speech tts dataset speech-to-text datasets pyth voice-conversion vc speech-dataset audio-datasets voice-dataset voice-datasets audio-dataset tts-dataset vc-dataset

Updated Oct 28, 2023

MahtaFetrat / ManaTTS-Persian-Speech-Dataset

ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

text-to-speech tts speech-synthesis persian data-collection data-preprocessing speech-processing forced-alignment speech-dataset speech-corpus dataset-preparation persian-speech tts-dataset text-to-speech-dataset mana-tts speech-data-collection

Updated Sep 13, 2024
Jupyter Notebook

MahtaFetrat / VirgoolInformal-Speech-Dataset

A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.

tts persian speech-processing asr forced-alignment speech-dataset persian-speech-recognition asr-evaluation persian-speech-dataset persian-text-to-speech speech-data-collection persian-speech-corpus

Updated Sep 13, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-dataset topic, visit your repo's landing page and select "manage topics."