-
Notifications
You must be signed in to change notification settings - Fork 120
Issues: michaelfeil/infinity
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Support for colbert style late interaction models in rerank endpoint
#503
opened Dec 28, 2024 by
wwymak
health endpoint does not really provide insights about healthiness
#483
opened Nov 28, 2024 by
bufferoverflow
5 tasks
Maintainer: Breaking CI / Python installs
help wanted
Extra attention is needed
#415
opened Oct 11, 2024 by
michaelfeil
Add a End-to-end unit test for image embeddings and audio embeddings
help wanted
Extra attention is needed
#378
opened Sep 24, 2024 by
michaelfeil
when use engine optimum device tensorrt,startup fail
#372
opened Sep 23, 2024 by
weibingo
2 of 4 tasks
Reranker dynamic quantization
help wanted
Extra attention is needed
#363
opened Sep 16, 2024 by
rawsh-rubrik
jinaai/jina-reranker-v1-*-en does not work with optimum
#362
opened Sep 13, 2024 by
rawsh
2 of 4 tasks
Issue running cross-encoder onnx model exported with optimum-cli
#361
opened Sep 13, 2024 by
rawsh
2 of 4 tasks
Write a custom flash-attention function for the deberta model.
#359
opened Sep 12, 2024 by
wolfassi123
3 tasks done
Support Integration with KServe
help wanted
Extra attention is needed
#352
opened Sep 6, 2024 by
indranilr
Add Installation Option to Depend Only on ONNX, Excluding New Torch and CUDA Packages
#332
opened Aug 9, 2024 by
bash99
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.