Skip to content

Commit 2e757bb

Browse files
authored
Merge pull request #529 from julcst/fix-jit-half
Add TCNN_HALF_PRECISION definition to RTC-kernel
2 parents 38e50ed + 09ee3d6 commit 2e757bb

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

src/rtc_kernel.cu

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -132,6 +132,7 @@ CudaRtcKernel::CudaRtcKernel(const std::string& name, const std::string& kernel_
132132

133133
std::vector<std::string> opts = {
134134
fmt::format("--gpu-architecture=compute_{}", cc),
135+
fmt::format("-DTCNN_HALF_PRECISION={}", TCNN_HALF_PRECISION),
135136
fmt::format("-DTCNN_MIN_GPU_ARCH={}", cc),
136137
"--std=c++14",
137138
#ifdef TCNN_RTC_USE_FAST_MATH

0 commit comments

Comments
 (0)