Help with using this tool for creating TTS training data #69

weedwind · 2024-07-11T00:52:13Z

Hi,

Thank you very much for building this tool. I want to use it to segment/align libri-light for training TTS. I am new to this tool. Can anyone help me with the following questions:

If I want to segment the books to about 10 sec chunks (rather than 30), what hyperparameters I should change?
In the output, there are two sets of texts, lowercase with punctuations, and uppercase without punctuations, which one should I use as the ground truth for training TTS?

Thank you so much for any help.

pkufool · 2024-07-16T11:24:28Z

If I want to segment the books to about 10 sec chunks (rather than 30), what hyperparameters I should change?

text_search/examples/libriheavy/matching.py

Lines 101 to 103 in 7c452ed

    
           "min_duration": 2, 
        
           "max_duration": 30, 
        
           "expected_duration": (5, 20),

In the output, there are two sets of texts, lowercase with punctuations, and uppercase without punctuations, which one should I use as the ground truth for training TTS?

It's up to you, I will suggest to use texts with punctuations.

weedwind · 2024-07-23T19:06:23Z

@pkufool Thank you so much. From a quick look at the documentation, it looks to me that the texts with punctuations are the reference, and the uppercase ones are the output from ASR. I am wondering is the uppercase text equally accurate as the reference, if I want to use them to train TTS?

pkufool · 2024-07-24T01:34:39Z

No. If you don't want the punctuations, you can remove them and convert the punctuation texts to uppercase, it is not a good idea to use the ASR transcrptions to train TTS.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Help with using this tool for creating TTS training data #69

Help with using this tool for creating TTS training data #69

weedwind commented Jul 11, 2024

pkufool commented Jul 16, 2024

weedwind commented Jul 23, 2024

pkufool commented Jul 24, 2024

Help with using this tool for creating TTS training data #69

Help with using this tool for creating TTS training data #69

Comments

weedwind commented Jul 11, 2024

pkufool commented Jul 16, 2024

weedwind commented Jul 23, 2024

pkufool commented Jul 24, 2024