Skip to content

Commit a320ce7

Browse files
Update ggml/src/ggml-cuda/cpy.cu
Co-authored-by: Johannes Gäßler <[email protected]>
1 parent a4443d3 commit a320ce7

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

ggml/src/ggml-cuda/cpy.cu

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -98,6 +98,7 @@ template<dequantize_kernel_t dequant, int qk>
9898
static __device__ void cpy_blck_q_f32(const char * cxi, char * cdsti) {
9999
float * cdstf = (float *)(cdsti);
100100

101+
#pragma unroll
101102
for (int j = 0; j < qk/2; j++) {
102103
float2 dq;
103104
dequant(cxi, 0, j, dq);

0 commit comments

Comments
 (0)