Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU support for fastT5 #42

Closed
rupeshpoojary97 opened this issue Mar 8, 2022 · 3 comments
Closed

GPU support for fastT5 #42

rupeshpoojary97 opened this issue Mar 8, 2022 · 3 comments

Comments

@rupeshpoojary97
Copy link

You have used onnxruntime which is CPU compatible but are we looking forward to have onnxruntime-gpu?

@Ki6an
Copy link
Owner

Ki6an commented Mar 8, 2022

I'm currently working on GPU support, there are a few issues, one of which is adding support for both onnxruntime (CPU) and onnxruntime-GPU in a single library and GPU also does not support quantization. hopefully, onnxruntime adds support for quantization for GPU soon.

@GenVr
Copy link

GenVr commented Mar 8, 2022

@Ki6an Meanwhile, is there any way to improve the inference times of a T5 on GPU?

@Ki6an
Copy link
Owner

Ki6an commented Mar 9, 2022

@gennarovaccaro follow this thread #34 it'll give you a basic idea

@Ki6an Ki6an closed this as completed May 20, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants