Deduplicate `calc_chunk_indices_kernel` #1657

jinsolp · 2025-12-19T00:07:15Z

Reduces binary size by deduplicating calc_chunk_indices_kernel. This PR reduces instantiations from 62 -> 1 for each template (BlockDim=32, 64, ..., 1024)

Binary Size Changes

CUDA 12.9: 1096.15MB ->
CUDA 13: 432.98 MB->

divyegala · 2025-12-19T01:13:48Z

@robertmaynard can you review this PR as well? I was under the impression that launching a kernel across TUs would not work in CUDA whole compilation mode but here it seems to be working. Aren't kernels supposed to have their symbols hidden too?

robertmaynard

This approach is invalid and goes against the guidance in https://developer.nvidia.com/blog/cuda-c-compiler-updates-impacting-elf-visibility-and-linkage/

This currently only work in cuvs as we have failed to remove: https://github.com/rapidsai/cuvs/blob/main/cpp/cmake/modules/ConfigureCUDA.cmake#L38

jinsolp · 2025-12-19T20:07:20Z

Ohh I see okay 🥲

divyegala · 2026-01-05T17:41:21Z

cpp/src/neighbors/ivf_common.cu

The kernel needs to be launched in the same TU as which it is defined. We can (but should ideally avoid) pass the pointer around to other TUs but they shouldn't be attempting to launch the kernel.

jinsolp · 2026-01-12T17:41:17Z

Hi @robertmaynard , can you check this PR? I've added the changes to follow the guidelines

jinsolp · 2026-01-16T01:33:26Z

/merge

separate .cu for calc_chunk_indices_kernel

724afcd

jinsolp self-assigned this Dec 19, 2025

jinsolp requested review from a team as code owners December 19, 2025 00:07

github-project-automation bot added this to Vector Search, ML, & Data Mining Release Board Dec 19, 2025

github-project-automation bot moved this to Todo in Vector Search, ML, & Data Mining Release Board Dec 19, 2025

jinsolp added non-breaking Introduces a non-breaking change improvement Improves an existing functionality labels Dec 19, 2025

jinsolp requested a review from divyegala December 19, 2025 00:07

robertmaynard requested changes Dec 19, 2025

View reviewed changes

jinsolp added 3 commits December 19, 2025 13:42

Merge branch 'main' into bs/dedup-calc_chunk_indices_kernel

7599d93

ptr in same TU

583d7d1

Merge branch 'main' into bs/dedup-calc_chunk_indices_kernel

444d00a

divyegala reviewed Jan 5, 2026

View reviewed changes

cjnolet moved this from Todo to In Progress in Vector Search, ML, & Data Mining Release Board Jan 5, 2026

jinsolp added 3 commits January 7, 2026 16:36

Merge branch 'main' into bs/dedup-calc_chunk_indices_kernel

f93b2fe

launch in same TU

44f56d2

move arithmetic back to header

a50cce9

divyegala approved these changes Jan 8, 2026

View reviewed changes

Merge branch 'main' into bs/dedup-calc_chunk_indices_kernel

0e97eda

robertmaynard approved these changes Jan 15, 2026

View reviewed changes

Merge branch 'main' into bs/dedup-calc_chunk_indices_kernel

4d6c6bc

rapids-bot bot merged commit 3138284 into rapidsai:main Jan 16, 2026
191 of 193 checks passed

github-project-automation bot moved this from In Progress to Done in Vector Search, ML, & Data Mining Release Board Jan 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Deduplicate `calc_chunk_indices_kernel` #1657

Deduplicate `calc_chunk_indices_kernel` #1657

Uh oh!

jinsolp commented Dec 19, 2025

Uh oh!

divyegala commented Dec 19, 2025

Uh oh!

robertmaynard left a comment

Uh oh!

jinsolp commented Dec 19, 2025

Uh oh!

divyegala Jan 5, 2026 •

edited

Loading

Uh oh!

jinsolp commented Jan 12, 2026

Uh oh!

jinsolp commented Jan 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Deduplicate calc_chunk_indices_kernel #1657

Deduplicate calc_chunk_indices_kernel #1657

Uh oh!

Conversation

jinsolp commented Dec 19, 2025

Binary Size Changes

Uh oh!

divyegala commented Dec 19, 2025

Uh oh!

robertmaynard left a comment

Choose a reason for hiding this comment

Uh oh!

jinsolp commented Dec 19, 2025

Uh oh!

divyegala Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jinsolp commented Jan 12, 2026

Uh oh!

jinsolp commented Jan 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Deduplicate `calc_chunk_indices_kernel` #1657

Deduplicate `calc_chunk_indices_kernel` #1657

divyegala Jan 5, 2026 •

edited

Loading