Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extending ds_tool for SODA conversational dataset #32

Merged
merged 29 commits into from
Jun 24, 2024
Merged

Conversation

farzadab
Copy link
Contributor

@farzadab farzadab commented Jun 15, 2024

This PR extends the ds_tool to have Jinja2 support which was used to create the SODA-audio dataset for training.

It also adds a cache for TTS generation. The cache is necessary since TTS generation might fail from time to time, but a blind retry didn't seem like the best choice at this point (e.g. the first time we actually ran out of budget, so retrying would not have helped).

@farzadab farzadab changed the title Extending ds_tool for SODA conversational dataset [WIP] Extending ds_tool for SODA conversational dataset Jun 15, 2024
ultravox/tools/ds_tool.py Outdated Show resolved Hide resolved
ultravox/tools/ds_templates/README.md Outdated Show resolved Hide resolved
ultravox/tools/ds_tool.py Show resolved Hide resolved
ultravox/tools/ds_tool.py Outdated Show resolved Hide resolved
@farzadab farzadab changed the title [WIP] Extending ds_tool for SODA conversational dataset Extending ds_tool for SODA conversational dataset Jun 19, 2024
@farzadab farzadab marked this pull request as ready for review June 19, 2024 18:04
Copy link
Contributor

@juberti juberti left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LG, just a few nits.

ultravox/tools/ds_templates/boolq_template.jinja Outdated Show resolved Hide resolved
ultravox/tools/ds_templates/README.md Outdated Show resolved Hide resolved
ultravox/tools/ds_templates/README.md Outdated Show resolved Hide resolved
ultravox/tools/ds_tool.py Outdated Show resolved Hide resolved
ultravox/tools/ds_tool.py Outdated Show resolved Hide resolved
ultravox/tools/tts.py Outdated Show resolved Hide resolved
ultravox/tools/tts.py Outdated Show resolved Hide resolved
ultravox/tools/tts.py Outdated Show resolved Hide resolved
ultravox/tools/tts.py Outdated Show resolved Hide resolved
ultravox/tools/wrappers.py Outdated Show resolved Hide resolved
ultravox/tools/wrappers.py Outdated Show resolved Hide resolved
ultravox/tools/ds_tool.py Outdated Show resolved Hide resolved
ultravox/tools/wrappers.py Outdated Show resolved Hide resolved
@farzadab farzadab merged commit cbe5f6e into main Jun 24, 2024
1 check passed
@farzadab farzadab deleted the farzad-soda branch June 24, 2024 20:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants