The Chat Demo instantiates a picoLLM inference engine on a web worker and enables a back-and-forth conversation with the LLM, similar to ChatGPT.
The Completion Demo instantiates a picoLLM inference engine on a web worker that accepts a prompt along with a set of optional parameters and generates a single completion.