Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Microphone Input to MultimodalTextbox #10186

Open
wants to merge 10 commits into
base: main
Choose a base branch
from
Open

Add Microphone Input to MultimodalTextbox #10186

wants to merge 10 commits into from

Conversation

dawoodkhan82
Copy link
Collaborator

@dawoodkhan82 dawoodkhan82 commented Dec 11, 2024

Description

Add microphone support to multimodal textbox. To test run demo: chatbot_multimodal.py

Screen.Recording.2024-12-11.at.6.48.29.PM.mov

Closes: #9094

🎯 PRs Should Target Issues

Before your create a PR, please check to see if there is an existing issue for this change. If not, please create an issue before you create this PR, unless the fix is very small.

Not adhering to this guideline will result in the PR being closed.

Testing and Formatting Your Code

  1. PRs will only be merged if tests pass on CI. We recommend at least running the backend tests locally, please set up your Gradio environment locally and run the backed tests: bash scripts/run_backend_tests.sh

  2. Please run these bash scripts to automatically format your code: bash scripts/format_backend.sh, and (if you made any changes to non-Python files) bash scripts/format_frontend.sh

@gradio-pr-bot
Copy link
Collaborator

gradio-pr-bot commented Dec 11, 2024

🪼 branch checks and previews

Name Status URL
Spaces ready! Spaces preview
Website ready! Website preview
Storybook ready! Storybook preview
🦄 Changes detecting...

Install Gradio from this PR

pip install https://gradio-pypi-previews.s3.amazonaws.com/ba5a958eae74b68f8188a5871844da6829ad371a/gradio-5.9.0-py3-none-any.whl

Install Gradio Python Client from this PR

pip install "gradio-client @ git+https://github.com/gradio-app/gradio@ba5a958eae74b68f8188a5871844da6829ad371a#subdirectory=client/python"

Install Gradio JS Client from this PR

npm install https://gradio-npm-previews.s3.amazonaws.com/ba5a958eae74b68f8188a5871844da6829ad371a/gradio-client-1.8.0.tgz

Use Lite from this PR

<script type="module" src="https://gradio-lite-previews.s3.amazonaws.com/ba5a958eae74b68f8188a5871844da6829ad371a/dist/lite.js""></script>

@gradio-pr-bot
Copy link
Collaborator

gradio-pr-bot commented Dec 11, 2024

🦄 change detected

This Pull Request includes changes to the following packages.

Package Version
@gradio/audio minor
@gradio/multimodaltextbox minor
gradio minor
  • Maintainers can select this checkbox to manually select packages to update.

With the following changelog entry.

Add Microphone Input to MultimodalTextbox

Maintainers or the PR author can modify the PR title to modify this entry.

Something isn't right?

  • Maintainers can change the version label to modify the version bump.
  • If the bot has failed to detect any changes, or if this pull request needs to update multiple packages to different versions or requires a more comprehensive changelog entry, maintainers can update the changelog file directly.

@abidlabs
Copy link
Member

After building the frontend and running python demo/chatbot_multimodal/run.py, the styling looks off for me:

image

Are you seeing this?

@dawoodkhan82
Copy link
Collaborator Author

@abidlabs I see that for a split second as the page loads. But looks fine afterwards.
Screenshot 2024-12-12 at 11 44 06 AM

@abidlabs
Copy link
Member

You've built the frontend? I am seeing the UI persistently broken. Console looks like this:

image

@abidlabs
Copy link
Member

Thanks @dawoodkhan82, loading now and works well. (Passed along some UI feedback in our 1-1)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add microphone input to gr.MultimodalTextbox
3 participants