Skip to content

Commit

Permalink
CI: Bump to 4-bit TinyLlama
Browse files Browse the repository at this point in the history
  • Loading branch information
ariya committed Mar 5, 2024
1 parent 151ed2e commit 59c8228
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions .github/actions/prepare-llm/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ runs:
steps:
- name: Download TinyLlama model
shell: bash
run: curl -OL https://huggingface.co/TheBloke/TinyLlama-1.1B-1T-OpenOrca-GGUF/resolve/main/tinyllama-1.1b-1t-openorca.Q3_K_M.gguf
run: curl -OL https://huggingface.co/TheBloke/TinyLlama-1.1B-Chat-v1.0-GGUF/resolve/main/tinyllama-1.1b-chat-v1.0.Q4_K_M.gguf

- name: Download and unpack Nitro
shell: bash
Expand All @@ -26,4 +26,4 @@ runs:
run: |
curl http://localhost:8080/inferences/llamacpp/loadmodel \
-H 'Content-Type: application/json' \
-d '{"llama_model_path": "./tinyllama-1.1b-1t-openorca.Q3_K_M.gguf"}'
-d '{"llama_model_path": "./tinyllama-1.1b-chat-v1.0.Q4_K_M.gguf"}'

0 comments on commit 59c8228

Please sign in to comment.