Releases · jllllll/llama-cpp-python-cuBLAS-wheels

17 Sep 22:48

jllllll

8bfb842

MacOS Metal Wheels

Available for Intel and Apple Silicon CPUs.

Install with:

python -m pip install llama-cpp-python --prefer-binary --extra-index-url=https://jllllll.github.io/llama-cpp-python-cuBLAS-wheels/basic/cpu

0.1.85 builds likely won't work until fixes to the workflow are made.

Assets 867

07 Aug 21:57

jllllll

cpu

1930fa1

CPU-only

While this repo is focused on providing cuBLAS wheels, it has become evident that there is a need for CPU-only wheels that do not require AVX2.

Wheels can be more easily downloaded from: https://jllllll.github.io/llama-cpp-python-cuBLAS-wheels/AVX/cpu
Replace AVX with one of basic, AVX2 or AVX512 depending on what your CPU supports.

Assets 1,482

07 Aug 21:53

jllllll

basic

1930fa1

Basic non-AVX Wheels

Wheels without AVX, FMA and F16C support for compatibility with older CPUs.

Assets 2,222

04 Aug 07:27

jllllll

rocm

7578ef0

AMD ROCm

All wheels built for AVX2 CPUs for now.

Linux

Wheels Built for ROCm 5.4.2, 5.5 and 5.6.1.

Windows

Should be considered experimental and may not work at all. Windows ROCm is very new.

To test it, you will need ROCm for Windows: https://www.amd.com/en/developer/rocm-hub/hip-sdk.html
Consult the possibly inaccurate GPU compatibility chart here: https://rocm.docs.amd.com/en/docs-5.5.1/release/windows_support.html
If your GPU isn't on that list, or it just doesn't work, you may need to build llama-cpp-python manually and hope your GPU is compatible.
Another option is to do this: ggerganov/llama.cpp#1087 (comment)

Pre-0.1.80 wheels built using ggerganov/llama.cpp#1087

Installation

To install, you can use this command:

python -m pip install llama-cpp-python --prefer-binary --extra-index-url=https://jllllll.github.io/llama-cpp-python-cuBLAS-wheels/AVX2/rocm5.5

This will install the latest llama-cpp-python version available from here for ROCm 5.5. You can change rocm5.5 to change the ROCm version.
Supported ROCm versions:

Windows
- 5.5.1
Linux
- 5.4.2 5.5 5.6.1
- Some adjacent versions of ROCm may also be compatible.
  For example, 5.4.1 should be compatible with the 5.4.2 wheel.

GitHub Actions workflow here: https://github.com/jllllll/llama-cpp-python-cuBLAS-wheels/blob/main/.github/workflows/build-wheel-rocm.yml

Assets 1,995

20 Jul 01:26

jllllll

textgen-webui

a484311

Webui Wheels

These are basic/AVX/AVX2 wheels built under a different namespace to allow for simultaneous installation with the main llama-cpp-python package.

Installation can be done with this command:

python -m pip install llama-cpp-python-cuda --prefer-binary --extra-index-url=https://jllllll.github.io/llama-cpp-python-cuBLAS-wheels/textgen/AVX2/cu117

The index URL can be changed similarly to what is described in the main installation instructions.

Assets 4,535

27 Jun 19:50

jllllll

AVX512

d06d13a

AVX512

AVX and AVX2 wheels can be found in a different release.

Assets 2,462

27 Jun 06:05

jllllll

AVX

3fcd05f

AVX

AVX wheels

AVX2 wheels can be found in the main Wheels release.
AVX512 wheels can be found in a different release.

Assets 2,462

26 Jun 23:03

jllllll

wheels

d0a2be3

Wheels Latest

Latest

AVX2 wheels

AVX and AVX512 wheels can be found in a different release.

Assets 2,462

llama_cpp_python-0.1.62+cu116-cp310-cp310-linux_x86_64.whl

619 KB 2023-07-05T23:15:37Z
llama_cpp_python-0.1.62+cu120-cp310-cp310-linux_x86_64.whl

632 KB 2023-07-05T23:16:02Z
llama_cpp_python-0.1.62+cu117-cp310-cp310-linux_x86_64.whl

626 KB 2023-07-05T23:16:05Z
llama_cpp_python-0.1.62+cu121-cp310-cp310-linux_x86_64.whl

630 KB 2023-07-05T23:16:12Z
llama_cpp_python-0.1.62+cu122-cp310-cp310-linux_x86_64.whl

631 KB 2023-07-05T23:17:36Z
llama_cpp_python-0.1.62+cu118-cp310-cp310-linux_x86_64.whl

676 KB 2023-07-05T23:18:15Z
llama_cpp_python-0.1.62+cu117-cp311-cp311-linux_x86_64.whl

626 KB 2023-07-05T23:22:25Z
llama_cpp_python-0.1.62+cu116-cp311-cp311-linux_x86_64.whl

619 KB 2023-07-05T23:23:07Z
llama_cpp_python-0.1.62+cu118-cp311-cp311-linux_x86_64.whl

676 KB 2023-07-05T23:24:41Z
llama_cpp_python-0.1.62+cu120-cp311-cp311-linux_x86_64.whl

632 KB 2023-07-05T23:24:46Z
Source code (zip)

2023-06-26T23:03:10Z
Source code (tar.gz)

2023-06-26T23:03:10Z

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linux

Windows

Installation

AVX wheels

AVX2 wheels

Releases: jllllll/llama-cpp-python-cuBLAS-wheels

MacOS Metal Wheels

CPU-only

Basic non-AVX Wheels

AMD ROCm

Linux

Windows

Installation

Webui Wheels

AVX512

AVX

AVX wheels

Wheels

AVX2 wheels