Learning speaker embedding from Text-to-Speech

This is my first code sharing on Github. Any comments to improve this repo are welcome

Getting Started

Installation

Clone this repo and install ESPnet as below. If using a different version of ESPnet, the installation and codes also need to change accordingly.

cd [cloned repo]/tools
make KALDI=[kaldi path] PYTHON_VERSION=3.6 TH_VERSION=1.0.1 CUDA_VERSION=10.0 # Remove the "KALDI=[kaldi path]" part if kaldi is NOT installed yet

Experiments

Download voxceleb1 corpus to [voxceleb1 corpus dir]. Running "$ ls [voxceleb1 corpus dir]" shows "vox1_meta.csv voxceleb1_test.txt voxceleb1_wav"
Go to the experimental directory

cd egs/voxceleb1/spkidtts

Run experiment

$ bash run.sh --ngpu [# gpus to use. Using multiple gpus is likely NOT working with the current codes] --spkidloss_weight [spkidloss weight] --voxceleb1_root [voxceleb1 corpus dir]
$ # e.g., bash run.sh --ngpu 1 --spkidloss_weight 0.03 --voxceleb1_root /export/corpora5/VoxCeleb1_v1 # Setting "--spkidloss_weight 0.03" is the same as M-TTS + SpkID loss w/ ASR Phn. Align. SR3 in Table 5 of the paper

Citation

Learning Speaker Embedding from Text-to-Speech

Name		Name	Last commit message	Last commit date
Latest commit History 5,019 Commits
.circleci		.circleci
.github		.github
ci		ci
doc		doc
docker		docker
egs		egs
espnet		espnet
test		test
test_utils		test_utils
tools		tools
utils		utils
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning speaker embedding from Text-to-Speech

Getting Started

Installation

Experiments

Citation

About

Releases

Packages

Languages

License

JaejinCho/espnet_spkidtts

Folders and files

Latest commit

History

Repository files navigation

Learning speaker embedding from Text-to-Speech

Getting Started

Installation

Experiments

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages