Releases · TabbyML/tabby

23 Nov 01:50

github-actions

v0.6.0-rc.0

820465d

v0.6.0-rc.0 Pre-release

Pre-release

🚀 Features

👷 Built-in distribution support: running completion / chat model worker on different process / machine.

The community edition has a restriction of a maximum of one worker for code completion/chat. The enterprise edition is still in private alpha and is only available to our design partners. If you're interested, please DM Meng Zhang on the Slack channel to apply (limited slots available).

💬 Conversation history in chat playground.

🧰 Fixes and Improvements

Fix the slow repository indexing due to constraint memory arena in tantivy index writer.
Command line argument --model is now optional, so users can create a chat only instance.
New command line argument --parallelism to control the throughput and VRAM usage: #727
New api path/metrics endpoint for prometheus metrics collection.

💫 New Contributors

@liangfung made their first contribution in #702
@erfanium made their first contribution in #742
@costanzo made their first contribution in #748
@darknight made their first contribution in #750
@suside made their first contribution in #775
@jpoisso made their first contribution in #838
@Squadrick made their first contribution in #849
@sonique6784 made their first contribution in #813

Full Changelog: v0.5.5...v0.6.0-rc.0

Contributors

suside, darknight, and 6 other contributors

Assets 4

09 Nov 08:46

github-actions

v0.5.5

507a9d9

v0.5.5

⚠️ Notice

llama.cpp backend (CPU, Metal) now requires a redownload of gguf model due to upstream format changes: #645 ggerganov/llama.cpp#3252
Due to indexing format changes, the ~/.tabby/index needs to be manually removed before any further runs of tabby scheduler.
TABBY_REGISTRY is replaced with TABBY_DOWNLOAD_HOST for the github based registry implementation.

🚀 Features

Improved dashboard UI.

🧰 Fixes and Improvements

Cpu backend is switched to llama.cpp: #638
add server.completion_timeout to control the code completion interface timeout: #637
Cuda backend is switched to llama.cpp: #656
Tokenizer implementation is switched to llama.cpp, so tabby no longer need to download additional tokenizer file: #683

💫 New Contributors

@CrCs2O4 made their first contribution in #597
@yusiwen made their first contribution in #620
@gjedeer made their first contribution in #635
@XpycT made their first contribution in #634
@HKABIG made their first contribution in #662

Full Changelog: v0.4.0...v0.5.5

Contributors

XpycT, gjedeer, and 3 other contributors

Assets 4

07 Nov 21:36

github-actions

v0.5.4

732f022

v0.5.4 Pre-release

Pre-release

🧰 Fixes and Improvements

Fix deadlock issue reported in #718

Assets 4

07 Nov 09:24

github-actions

v0.5.3

8046a48

v0.5.3 Pre-release

Pre-release

⚠️ Notice

llama.cpp backend (CPU, Metal) now requires a redownload of gguf model due to upstream format changes: #645 ggerganov/llama.cpp#3252
Due to indexing format changes, the ~/.tabby/index needs to be manually removed before any further runs of tabby scheduler.
TABBY_REGISTRY is replaced with TABBY_DOWNLOAD_HOST for the github based registry implementation.

🚀 Features

Improved dashboard UI.

🧰 Fixes and Improvements

Cpu backend is switched to llama.cpp: #638
add server.completion_timeout to control the code completion interface timeout: #637
Cuda backend is switched to llama.cpp: #656
Tokenizer implementation is switched to llama.cpp, so tabby no longer need to download additional tokenizer file: #683

💫 New Contributors

@CrCs2O4 made their first contribution in #597
@yusiwen made their first contribution in #620
@gjedeer made their first contribution in #635
@XpycT made their first contribution in #634
@HKABIG made their first contribution in #662

Full Changelog: v0.4.0...v0.5.3

Contributors

XpycT, gjedeer, and 3 other contributors

Assets 4

07 Nov 07:32

github-actions

v0.5.2-rc.0

551d1dc

v0.5.2-rc.0 Pre-release

Pre-release

v0.5.2-rc.0

Assets 4

07 Nov 08:57

github-actions

v0.5.2

0226a37

v0.5.2 Pre-release

Pre-release

Release 0.5.2

http-api-bindings@0.5.2
llama-cpp-bindings@0.5.2
tabby@0.5.2
tabby-common@0.5.2
tabby-download@0.5.2
tabby-inference@0.5.2
tabby-scheduler@0.5.2

Generated by cargo-workspaces

Assets 4

05 Nov 20:30

github-actions

v0.5.1-rc.2

4a44f44

v0.5.1-rc.2 Pre-release

Pre-release

v0.5.1-rc.2

Assets 4

05 Nov 20:09

github-actions

v0.5.1-rc.1

345e304

v0.5.1-rc.1 Pre-release

Pre-release

v0.5.1-rc.1

Assets 4

04 Nov 01:02

github-actions

v0.5.0-rc.4

281d189

v0.5.0-rc.4 Pre-release

Pre-release

v0.5.0-rc.4

Assets 4

04 Nov 01:39

github-actions

v0.5.0

536c7e8

v0.5.0 Pre-release

Pre-release

Release 0.5.0

http-api-bindings@0.5.0
llama-cpp-bindings@0.5.0
tabby@0.5.0
tabby-common@0.5.0
tabby-download@0.5.0
tabby-inference@0.5.0
tabby-scheduler@0.5.0

Generated by cargo-workspaces

Assets 4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚀 Features

👷 Built-in distribution support: running completion / chat model worker on different process / machine.

💬 Conversation history in chat playground.

🧰 Fixes and Improvements

💫 New Contributors

Contributors

⚠️ Notice

🚀 Features

🧰 Fixes and Improvements

💫 New Contributors

Contributors

🧰 Fixes and Improvements

⚠️ Notice

🚀 Features

🧰 Fixes and Improvements

💫 New Contributors

Contributors

Releases: TabbyML/tabby

v0.6.0-rc.0

🚀 Features

👷 Built-in distribution support: running completion / chat model worker on different process / machine.

💬 Conversation history in chat playground.

🧰 Fixes and Improvements

💫 New Contributors

Contributors

v0.5.5

⚠️ Notice

🚀 Features

🧰 Fixes and Improvements

💫 New Contributors

Contributors

v0.5.4

🧰 Fixes and Improvements

v0.5.3

⚠️ Notice

🚀 Features

🧰 Fixes and Improvements

💫 New Contributors

Contributors

v0.5.2-rc.0

v0.5.2

v0.5.1-rc.2

v0.5.1-rc.1

v0.5.0-rc.4

v0.5.0