Deploying LLM to Mobile

To deploy LLM to mobile, we need a quantized model in GGUF format for CPU inference.

Selected Model

bigcode/starcoderbase-3b
bigcode/starcoderbase-1b

Local deployment steps on ios

llama.cpp has developed a barebone ios app using swiftui. You can find the example here. You can also find a lengthy discussion on Performance of llama.cpp on Apple Silicon A-series where some models have been benchmarked and also instruction is given how you can benchmark as well.

The general steps to follow to run the app in simulator:

Clone the repo

git clone https://github.com/ggerganov/llama.cpp

Download xcode in mac

Download xcode in mac from app store. Don't forget to install ios stimulator as well. You will be prompted to install during xcode installation.

Open the `examples/llama.swiftui` with Xcode

In your terminal where you have cloned the repo, type

cd llama.cpp/examples/llama.swiftui
xed .

This will open llama.swiftui in xcode.

Select the simulator

Select the ios simulator from top (iphone 15 pro max in my case)

Click on Run

Click on run icon to build and start the ios simulator.

Tested models

cosmo3769/starcoderbase-1b-GGUF

I have downloaded and loaded bigcode/starcoderbase-1b in GGUF format which I have quantized. Here is the download link for the GGUF format cosmo3769/starcoderbase-1b-GGUF.

cosmo3769/starcoderbase-3b-GGUF

I have downloaded and loaded bigcode/starcoderbase-3b in GGUF format which I have quantized. Here is the download link for the GGUF format cosmo3769/starcoderbase-3b-GGUF.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEPLOY_MOBILE_GUIDE.md

DEPLOY_MOBILE_GUIDE.md

Deploying LLM to Mobile

Selected Model

Local deployment steps on ios

Clone the repo

Download xcode in mac

Open the `examples/llama.swiftui` with Xcode

Select the simulator

Click on Run

Tested models

cosmo3769/starcoderbase-1b-GGUF

cosmo3769/starcoderbase-3b-GGUF

Files

DEPLOY_MOBILE_GUIDE.md

Latest commit

History

DEPLOY_MOBILE_GUIDE.md

File metadata and controls

Deploying LLM to Mobile

Selected Model

Local deployment steps on ios

Clone the repo

Download xcode in mac

Open the examples/llama.swiftui with Xcode

Select the simulator

Click on Run

Tested models

cosmo3769/starcoderbase-1b-GGUF

cosmo3769/starcoderbase-3b-GGUF

Open the `examples/llama.swiftui` with Xcode