Releases: michaelfeil/infinity
Releases · michaelfeil/infinity
0.0.23
What's Changed
- support hf_transfer by @michaelfeil in #81
- update dstack support by @deep-diver in #79
- Update Dockerfile to python 3.11 + CI fix by @michaelfeil in #83
- adding revision by @michaelfeil in #84
- starting to deprecated fastembed and ctranslate2 by @michaelfeil in #86
New Contributors
- @deep-diver made their first contribution in #79 Thanks @deep-diver
Full Changelog: 0.0.22...0.0.23
0.0.22
0.0.21
What's Changed
- improvements optimum by @michaelfeil in #74
- bump sentence-transformers to 2.3.0 by @michaelfeil in #76
- update dockerfile and tensorrt by @michaelfeil in #75
Full Changelog: 0.0.20...0.0.21
0.0.20
What's Changed
- update arm docker by @michaelfeil in #73
- patch release: optimum tokenization issue
Full Changelog: 0.0.19...0.0.20
0.0.19 - yanked
0.0.18 - yanked
What's Changed
- support mps backend. by @ninehills in #59
- Add optimum[onnx] by @michaelfeil in #68
New Contributors
- @ninehills made their first contribution in #59 Thanks @ninehills for sharing this on twitter.
Full Changelog: 0.0.17...0.0.18
0.0.17
What's Changed
Breaking: Switched to Cuda 12.1 and torch 2.1.2
- Add rerank/predict endpoint in the API by @michaelfeil in #50
- update dockerfile (Cuda 12.1 and torch 2.1.2) and tests by @michaelfeil in #54
Full Changelog: 0.0.16...0.0.17
0.0.16
What's Changed
- fixing delayed warmup by @michaelfeil in #53
- expose
capabilities
by @michaelfeil in #53
Full Changelog: 0.0.15...0.0.16
0.0.15
What's Changed
- Linting and better model errors by @michaelfeil in #52
Full Changelog: 0.0.14...0.0.15
0.0.14
What's Changed
- adding tests and better typing by @michaelfeil in #51
Full Changelog: 0.0.13...0.0.14