Skip to content

Releases: michaelfeil/infinity

0.0.13

20 Dec 01:56
8213424
Compare
Choose a tag to compare

What's Changed

Full Changelog: 0.0.12...0.0.13

0.0.12

18 Dec 12:45
8b49a01
Compare
Choose a tag to compare

What's Changed

Breaking:

  • make fp16 default for CUDA by @michaelfeil in #47 -> This may lower your precision for a 2x speedup.

Full Changelog: 0.0.11...0.0.12

0.0.11

18 Dec 00:37
b7aea37
Compare
Choose a tag to compare

What's Changed

Full Changelog: 0.0.10...0.0.11

0.0.10

05 Dec 13:55
69f5f50
Compare
Choose a tag to compare

What's Changed

Full Changelog: 0.0.9...0.0.10

0.0.9

05 Dec 00:13
584b320
Compare
Choose a tag to compare

What's Changed

New Contributors

  • @BrianPulfer made their first contribution in #37 also first contributor to this repo - thanks!

Full Changelog: 0.0.8...0.0.9

0.0.8

25 Nov 13:29
Compare
Choose a tag to compare

What's Changed

Full Changelog: 0.0.7...0.0.8

0.0.7

12 Nov 10:41
6cad81c
Compare
Choose a tag to compare

What's Changed

  • Docker: Cuda11.8, make dependencies optional by @michaelfeil in #33
  • Breaking changes: New install groups pip install infinity-emb[server,logging,onnx-gpu-runtime]
  • add onnx-gpu by @michaelfeil in #26

Full Changelog: 0.0.6...0.0.7

0.0.6

11 Nov 13:29
2b930f1
Compare
Choose a tag to compare

What's Changed

Full Changelog: 0.0.5...0.0.6

0.0.5

06 Nov 07:58
a1389ce
Compare
Choose a tag to compare

What's Changed

  • Docker image multi by @michaelfeil in #24
  • patch missing event -> 200ms to 7ms inference at Batch size 1

Full Changelog: 0.0.4...0.0.5

0.0.4

04 Nov 12:33
be64ff2
Compare
Choose a tag to compare

What's Changed

PRs:

Issues:
Closes #5 ONNX Support via https://github.com/qdrant/fastembed/
Closes #22 making pytorch and optional dependency

tl,dr

  • fastembed as backend besides ct2 or torch
  • v1/models returns "backend"
  • makes torch an optional dependency
  • calculates "min" sleep time dynamically on startup _> slightly optimized.
  • default model is now "BAAI/bge-small-en-v1.5"

Full Changelog: 0.0.3...0.0.4