Skip to content

metal : add BS=1 kernel for flash attention#6508

Merged
ggerganov merged 8 commits intogg/flash-attnfrom gg/flash-attn-vecApr 18, 2024