You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There have been 3 important updates to the llama.cpp recently (a few days ago). One of them is the addition of pipeline parallelism (multi-threading) (ggerganov/llama.cpp#6017). The other important update was the correction of the embeddings bug. And one more update to release all GPU memory (this was also a bug - memory was not released). Thank you slaren!
These are very important updates. Looking forward to the addition of the latest llama.cpp code to LLamaSharp.
The text was updated successfully, but these errors were encountered:
There have been 3 important updates to the llama.cpp recently (a few days ago). One of them is the addition of pipeline parallelism (multi-threading) (ggerganov/llama.cpp#6017). The other important update was the correction of the embeddings bug. And one more update to release all GPU memory (this was also a bug - memory was not released). Thank you slaren!
These are very important updates. Looking forward to the addition of the latest llama.cpp code to LLamaSharp.
The text was updated successfully, but these errors were encountered: