语音识别数据集生成

本项目以新闻联播视频为例。

下载视频

下载新闻联播视频方式：Windowns或Mac平台的央视影音软件

下载新闻联播文字稿方式：新闻联播文字稿。

从视频中提取音频

Mac 安装FFmpeg：brew install ffmpeg；其它平台请编译官网下载的压缩包。

运行Video2Audio.py。

以语音为依据分割音频

运行segment.py，使用方法请见文件头注释。

语音识别

百度

pip3 install baidu-aip

运行os_test.py

讯飞

直接上传至在线识别

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
tacotron		tacotron
tacotron_更新版本		tacotron_更新版本
文本		文本
.gitignore		.gitignore
CN2EN.py		CN2EN.py
README.md		README.md
Video2Audio.py		Video2Audio.py
asr_baidu.py		asr_baidu.py
audio_data_augmentation.py		audio_data_augmentation.py
audio_total_hours.py		audio_total_hours.py
characters_to_pinyin.py		characters_to_pinyin.py
denoising.py		denoising.py
os_test.py		os_test.py
remove_silence_from_wav_files.py		remove_silence_from_wav_files.py
segment.py		segment.py
segment_for_single_file.py		segment_for_single_file.py
xwlb.txt		xwlb.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

语音识别数据集生成

下载视频

从视频中提取音频

以语音为依据分割音频

语音识别

百度

讯飞

About

Releases 1

Packages

Contributors 3

Languages

Paymemoney/make-dataest

Folders and files

Latest commit

History

Repository files navigation

语音识别数据集生成

下载视频

从视频中提取音频

以语音为依据分割音频

语音识别

百度

讯飞

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Languages

Packages