Skip to content

Commit 410171a

Browse files
committed
CUDA: conv2d update block size
1 parent ac5e0c0 commit 410171a

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

ggml/src/ggml-cuda/conv2d.cuh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
#pragma once
22
#include "common.cuh"
33

4-
#define BS_OC 64
4+
#define BS_OC 16
55
#define BS_ICKHKW 16
6-
#define BS_NOHOW 64
6+
#define BS_NOHOW 128
77

88
#define WMMA_M 16
99
#define WMMA_N 16

0 commit comments

Comments
 (0)