[llama.cpp] AI answer does not stop automatically when inference is launched on Android phone #116

zhouwg · 2024-03-28T02:06:36Z

CPU-only reference on Xiaomi 14.

here is a screenshot to demonstrate this issue(the input question is: where is China's capital city?):

same issue here with following models on Xiaomi 14 (Xiaomi 14 is available since Oct 2023, Xiaomi 14 contains a very very very powerful mobile SoC ------ Qualcomm SM8650-AB Snapdragon 8 Gen 3 (4 nm) ------ and was used for personal device-side AI PoC development activity).

qwen1_5-1_8b-chat-q4_0.gguf (https://huggingface.co/Qwen/Qwen1.5-1.8B-Chat-GGUF/resolve/main/qwen1_5-1_8b-chat-q4_0.gguf , 1.1 GB)
llama-2-7b.Q4_K_M.gguf (https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/resolve/main/llama-2-7b-chat.Q4_K_M.gguf, 4.08 GB)
blossom-v3-baichuan2-7b.Q4_K_M.gguf (https://huggingface.co/TheBloke/blossom-v3-baichuan2-7B-GGUF/resolve/main/blossom-v3-baichuan2-7b.Q4_K_M.gguf, 4.61 GB)

original similar issue report could be found at upstream llama.cpp:

zhouwg · 2024-04-24T10:02:09Z

a workaround method has been used for fix this issue.so close it accordingly.

zhouwg added help wanted Extra attention is needed AI labels Mar 28, 2024

zhouwg changed the title ~~[llamacpp] infinite loop~~ [llama.cpp] infinite loop Mar 28, 2024

zhouwg changed the title ~~[llama.cpp] infinite loop~~ [llama.cpp] AI answer does not stop automatically when inference is launched Mar 28, 2024

zhouwg mentioned this issue Apr 11, 2024

PoC: Add Qualcomm mobile SoC native backend for GGML #121

Closed

17 tasks

zhouwg changed the title ~~[llama.cpp] AI answer does not stop automatically when inference is launched~~ [llama.cpp] AI answer does not stop automatically when inference is launched on Android phone Apr 15, 2024

zhouwg added low priority medium priority and removed low priority medium priority labels Apr 15, 2024

zhouwg closed this as completed Apr 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[llama.cpp] AI answer does not stop automatically when inference is launched on Android phone #116

[llama.cpp] AI answer does not stop automatically when inference is launched on Android phone #116

zhouwg commented Mar 28, 2024 •

edited

Loading

zhouwg commented Apr 24, 2024

[llama.cpp] AI answer does not stop automatically when inference is launched on Android phone #116

[llama.cpp] AI answer does not stop automatically when inference is launched on Android phone #116

Comments

zhouwg commented Mar 28, 2024 • edited Loading

zhouwg commented Apr 24, 2024

zhouwg commented Mar 28, 2024 •

edited

Loading