-
Notifications
You must be signed in to change notification settings - Fork 10k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CLBlast /w new ggml & vicuna is not working. #1415
Comments
Can confirm; latest CLBlast + Accelerate (I'm on Mac OS) build shows weird responses for me too regardless of model (tried q4_0 and q5_0 of WizardLM and Vicuna; all showed gibberish). |
+1. Only if there's one single speaker at the initial prompts let it work. Currently need manually type the reserve prompt to start conversion and let it work. |
Same here on alpaca + WizardLM same issue, on higher ones like q8_0 it works good, but if you give it complex input then it spews this trash! (unless q8,f16,f32,mabey q5*) openCL + nvidia linux |
It may have something to do with the new quantization format. |
I forgot to update the OpenCL kernels See the CUDA changes here for reference: b9fd7ee#diff-66b17223e8ba54054fb2600ecbd31107f8b917bac36c7f3789811b0f0e9802a1L83-L106 I'll try to do it, but I don't have CLBlast setup, so hoping somebody else to propose a fix |
@ggerganov I will attempt it. |
Can you check again with the latest fixes? |
Works fine for me 😄 |
It's still broken on some prompts, but it's the first time I'm trying to use clblast - maybe it was always broken :) Here is an example: With clblast it produces garbage while without it works fine. |
@SlyEcho Sorry for late. It works great! Thank you and closing. |
Hello.
When new ggml using which wrapped clblast with vicuna prompt.
main.exe
shows weird response.Characters after last
### Humans:
are generated bymain.exe
Both cpu & cuda versions works well.
The model file I tried is ggml-vic7b-q4_0.bin at https://huggingface.co/eachadea/ggml-vicuna-7b-1.1/tree/main
The text was updated successfully, but these errors were encountered: