Skip to content

Commit 7e787c2

Browse files
committed
Revert "Kcpp triage for rowsplit: revert ggml-org#16715 until ggml-org#16799 is resolved"
This reverts commit 3aec5ed.
1 parent af94884 commit 7e787c2

File tree

10 files changed

+959
-166
lines changed

10 files changed

+959
-166
lines changed

ggml/src/ggml-cuda/common.cuh

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1021,3 +1021,16 @@ struct ggml_backend_cuda_context {
10211021
return pool(device);
10221022
}
10231023
};
1024+
1025+
struct ggml_cuda_mm_fusion_args_host {
1026+
const ggml_tensor * x_bias = nullptr;
1027+
const ggml_tensor * gate = nullptr;
1028+
const ggml_tensor * gate_bias = nullptr;
1029+
ggml_glu_op glu_op;
1030+
};
1031+
struct ggml_cuda_mm_fusion_args_device {
1032+
const void * x_bias = nullptr;
1033+
const void * gate = nullptr;
1034+
const void * gate_bias = nullptr;
1035+
ggml_glu_op glu_op;
1036+
};

ggml/src/ggml-cuda/convert.cuh

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,4 @@
1+
#pragma once
12
#include "common.cuh"
23

34
#define CUDA_DEQUANTIZE_BLOCK_SIZE 256

ggml/src/ggml-cuda/ggml-cuda.cu

Lines changed: 352 additions & 1 deletion
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)