Skip to content

Commit

Permalink
Fix issues with SSE3 version for vec_dot_q4_0_b16_q8_0_b16
Browse files Browse the repository at this point in the history
  • Loading branch information
Srihari-mcw committed May 27, 2024
1 parent d5ab66f commit 46c0cd7
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion ggml-quants.c
Original file line number Diff line number Diff line change
Expand Up @@ -4719,7 +4719,7 @@ void ggml_vec_dot_q4_0_b16_q8_0_b16(int n, float * restrict s, size_t bs, const
_mm_prefetch(&y[0] + sizeof(block_q8_0), _MM_HINT_T0);

// Compute combined scale for the block 0 and 1
const __m128 d_0_1 = _mm_set1_ps( GGML_BF16_TO_FP32(ggml_make_bf16(x[i].d)) * GGML_BF16_TO_FP32(ggml_make_bf16(y[i].d)));
const __m128 d_0_1 = _mm_set1_ps( GGML_BF16_TO_FP32(ggml_make_bf16(x[0].d)) * GGML_BF16_TO_FP32(ggml_make_bf16(y[0].d)));

const __m128i tmp_0_1 = _mm_loadu_si128((const __m128i *)x[0].qs);

Expand Down

0 comments on commit 46c0cd7

Please sign in to comment.