huggingface / optimum-nvidia Public

Notifications You must be signed in to change notification settings
Fork 86
Star 890

Code
Issues 48
Pull requests 5
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: huggingface/optimum-nvidia

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

48 Open 17 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Speed up training?

#161 opened Oct 24, 2024 by fzyzcjy

How to use TensorRT model converter

#149 opened Sep 5, 2024 by FortunaZhang

OutOfMemory - Not able to run the text-generation.py example on V100, and A10G cores.

#146 opened Aug 11, 2024 by yahavb

Unable to install optimum-nvidia on Ubuntu

#144 opened Jul 5, 2024 by QuantumStaticFR

Error on Quickstart example

#143 opened Jul 3, 2024 by laikhtewari

Error for gated model access despite valid HF_TOKEN

#142 opened Jul 3, 2024 by laikhtewari

Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0!

#141 opened Jun 29, 2024 by d142796

Model of type falcon is not supported

#140 opened Jun 28, 2024 by puneeshkhanna

Model of type gpt_bigcode is not supported

#134 opened Jun 10, 2024 by Aryabhattacharjee

Providing input_embeddings for generation instead of IDs

#129 opened May 16, 2024 by verityw

No engine file found for LLama 3 and Cuda API error with LLama 2 with use_fp8

#128 opened May 14, 2024 by PhilSapiens

Can't Run README Code

#127 opened May 14, 2024 by hammoudhasan

Load from local path?

#126 opened May 9, 2024 by bdambrosio

Is there support for StoppingCriteria? enhancement

New feature or request

#123 opened Apr 26, 2024 by RomanKoshkin

Docker container fails on RTX A6000

#122 opened Apr 26, 2024 by RomanKoshkin

Failed to import optimum.nvidia

#121 opened Apr 25, 2024 by abpani

ValueError: mutable default <class 'tensorrt_llm.lora_manager.LoraBuildConfig'> for field lora_config is not allowed: use default_factory

#119 opened Apr 22, 2024 by manish-marwah

FileNotFoundError: [Errno 2] No such file or directory: 'trtllm-build'

#116 opened Apr 15, 2024 by Quang-elec44

When can I support llava

#106 opened Mar 25, 2024 by xusk

Instructions on how to set TP/PP

#102 opened Mar 22, 2024 by fxmarty

Avoid writting engines in .cache/huggingface/hub

#100 opened Mar 21, 2024 by fxmarty

Add optimum-cli export tensorrt-llm

#99 opened Mar 21, 2024 by fxmarty

can't build optimum-nvidia

#96 opened Mar 16, 2024 by dahwin

Original model configuration (config.json) was not found error during running inference using "Llama-2-7b-chat-hf"

#91 opened Mar 8, 2024 by raorajendra

Incorrect tensorrt_llm config class initialization bug

Something isn't working

#90 opened Mar 7, 2024 by Wojx

Previous 1 2 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly