[make_voxceleb1.pl] Fix train/test data split for the latest version of the voxceleb1 dataset #3249

sunshines14 · 2019-04-18T15:24:19Z

I found that previous scripts do not work with the latest version of the voxceleb1 dataset.
Therefore, I fixed the script for the latest version as follows:

'url' update to download the latest version (from official site)
Delete the script about 'vox1_meta.csv' no longer needed
Fix the code for the latest version's output (variable name equals make_voxceleb2.pl)

It is confirmed that successful output can be obtained from the script.
ex) voxceleb1_test -- spk2utt, trials, utt2spk, wav.scp

sunshines14 · 2019-04-18T17:12:43Z

(+) if previous script is still needed, I can use this modified script to another version (ex. v2) rather than replacing the existing script.

danpovey · 2019-04-18T20:16:48Z

Thanks a lot!! @david-ryan-snyder can you please check this?

david-ryan-snyder · 2019-04-18T20:22:58Z

Thanks @sunshines14.

Could you do what you offered and copy this to a new script called make_voxceleb1_v2.pl?

Then, in the run.sh script, make the v2 script the default one. Comment out the the old version of the script, and above it write a short comment describing the situation. E.g., mention that if you downloaded the dataset soon after it was released, you will want to use the make_voxceleb1.pl script instead.

egs/voxceleb/v1/local/make_voxceleb1.pl

sunshines14 · 2019-04-19T09:03:37Z

Thanks @david-ryan-snyder.

I fixed some code as you commented as follows:

I have made a new script called 'make_voxceleb1_v2.pl'.
In the run.sh script, the v2 script used as the default one.
In addition, I have made the code simpler in 'make_voxceleb1_v2.pl'.

It was reconfirmed that successful output can be obtained from all of fixed scripts.
Thanks.

egs/voxceleb/v1/local/make_voxceleb1_v2.pl

egs/voxceleb/v1/run.sh

david-ryan-snyder · 2019-04-19T12:44:44Z

Thanks @sunshines14! I just suggested you credit your work in the v2 perl script and fix a preexisting typo. Then we can merge it.

sunshines14 · 2019-04-19T15:38:11Z

I did it all.
Thanks @david-ryan-snyder.

egs/voxceleb/v1/local/make_voxceleb1_v2.pl

david-ryan-snyder · 2019-04-19T17:14:26Z

Thanks @sunshines14, looks good to me. @danpovey, I think it's fine to merge this.

…aldi-asr#3249)

revise make_voxceleb1.pl

4cfaff9

david-ryan-snyder reviewed Apr 18, 2019

View reviewed changes

egs/voxceleb/v1/local/make_voxceleb1.pl Outdated Show resolved Hide resolved

sunshines14 added 4 commits April 19, 2019 17:41

revised make_voxceleb1_v2.pl and run.sh

ff76bc2

edited typos

66c33dc

edited typo -2

e74fa91

edited typo -3

105da59

update make_voxceleb1_v2.pl and run.sh

92f49af

david-ryan-snyder reviewed Apr 19, 2019

View reviewed changes

egs/voxceleb/v1/local/make_voxceleb1_v2.pl Show resolved Hide resolved

david-ryan-snyder reviewed Apr 19, 2019

View reviewed changes

egs/voxceleb/v1/run.sh Outdated Show resolved Hide resolved

update make_voxceleb1_v2.pl and run.sh

4d4dff4

david-ryan-snyder reviewed Apr 19, 2019

View reviewed changes

egs/voxceleb/v1/local/make_voxceleb1_v2.pl Outdated Show resolved Hide resolved

update make_voxceleb1_v2.pl and run.sh

9e758b6

danpovey merged commit c3260f2 into kaldi-asr:master Apr 19, 2019

danpovey pushed a commit to danpovey/kaldi that referenced this pull request Jun 19, 2019

[egs] Make voxceleb recipe work with latest version of the dataset (k…

4831a66

…aldi-asr#3249)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[make_voxceleb1.pl] Fix train/test data split for the latest version of the voxceleb1 dataset #3249

[make_voxceleb1.pl] Fix train/test data split for the latest version of the voxceleb1 dataset #3249

sunshines14 commented Apr 18, 2019 •

edited

Loading

sunshines14 commented Apr 18, 2019 •

edited

Loading

danpovey commented Apr 18, 2019

david-ryan-snyder commented Apr 18, 2019 •

edited

Loading

sunshines14 commented Apr 19, 2019 •

edited

Loading

david-ryan-snyder commented Apr 19, 2019

sunshines14 commented Apr 19, 2019

david-ryan-snyder commented Apr 19, 2019

[make_voxceleb1.pl] Fix train/test data split for the latest version of the voxceleb1 dataset #3249

[make_voxceleb1.pl] Fix train/test data split for the latest version of the voxceleb1 dataset #3249

Conversation

sunshines14 commented Apr 18, 2019 • edited Loading

sunshines14 commented Apr 18, 2019 • edited Loading

danpovey commented Apr 18, 2019

david-ryan-snyder commented Apr 18, 2019 • edited Loading

sunshines14 commented Apr 19, 2019 • edited Loading

david-ryan-snyder commented Apr 19, 2019

sunshines14 commented Apr 19, 2019

david-ryan-snyder commented Apr 19, 2019

sunshines14 commented Apr 18, 2019 •

edited

Loading

sunshines14 commented Apr 18, 2019 •

edited

Loading

david-ryan-snyder commented Apr 18, 2019 •

edited

Loading

sunshines14 commented Apr 19, 2019 •

edited

Loading