Skip to content

CUDA: use MMQ instead of cuBLAS by default#8075

Merged
JohannesGaessler merged 1 commit intoggerganov:masterfrom JohannesGaessler:cuda-mmq-defaultJun 24, 2024