Use ArcSwap for aggregate fn registry by robert3005 · Pull Request #8072 · vortex-data/vortex

robert3005 · 2026-05-22T22:49:57Z

ArcSwap is faster than a lock for read. These session are mutable but mutations
are rare and retrievals are common

Signed-off-by: Robert Kruszewski <github@robertk.io>

codspeed-hq · 2026-05-22T23:02:43Z

Merging this PR will not alter performance

⚠️

Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

⚠️

Different runtime environments detected

Some benchmarks with significant performance changes were compared across different runtime environments,
which may affect the accuracy of the results.

Open the report in CodSpeed to investigate

⚡ 5 improved benchmarks
❌ 2 regressed benchmarks
✅ 1244 untouched benchmarks

Warning

Please fix the performance issues or acknowledge them on CodSpeed.

Performance Changes

	Mode	Benchmark	`BASE`	`HEAD`	Efficiency
⚡	WallTime	`cuda/bitpacked_u8/unpack/3bw[100M]`	353 µs	300.1 µs	+17.63%
⚡	Simulation	`encode_varbin[(1000, 2)]`	162.7 µs	142.9 µs	+13.87%
⚡	Simulation	`encode_varbin[(1000, 32)]`	170.1 µs	150 µs	+13.45%
⚡	Simulation	`encode_varbin[(1000, 4)]`	163.8 µs	143.7 µs	+14.03%
⚡	Simulation	`encode_varbin[(1000, 8)]`	165.1 µs	145.2 µs	+13.69%
❌	Simulation	`new_alp_prim_test_between[f32, 16384]`	103.7 µs	118.1 µs	-12.21%
❌	Simulation	`null_count_run_end[(10000, 4, 0.01)]`	112.2 µs	126.6 µs	-11.41%

Tip

Investigate this regression by commenting @codspeedbot fix this regression on this PR, or directly use the CodSpeed MCP with your agent.

_{Comparing rk/aggregatearcswap (9630c96) with develop (495f30e)}

onursatici · 2026-05-27T06:04:10Z

 /// Session state for aggregate function vtables.
 #[derive(Debug)]
 pub struct AggregateFnSession {
-    registry: AggregateFnRegistry,


shall we delete this type alias now that it is unused?

onursatici · 2026-05-27T06:04:31Z

@@ -107,15 +107,20 @@ impl Default for AggregateFnSession {

 impl AggregateFnSession {
    /// Returns the aggregate function registry.


this doc is now stale

onursatici · 2026-05-27T06:15:47Z


        let session = ctx.session().clone();
-        let kernels = &session.aggregate_fns().kernels;
+        let kernels = &session.aggregate_fns().kernels.load();


I think holding this for the entire body is mostly fine, but if we have some recursion here it might fallback to be as slow as the rwlock. I mean we load once then hold that guard and call the kernel, if it is a chunked array then that calls aggregate and calls load once more etc. I believe arcswap has a limited number of fast permits per thread and if we exhaust them then it falls back to refcount increments.

If we narrow the load scope to just one kernel execution in the loop below then that problem goes away. But it is unlikely that we will hit that level of recursion and the perf degradation is not that bad, it falls back to what rwlock does so up to you

Use ArcSwap for aggregate fn registry

9630c96

Signed-off-by: Robert Kruszewski <github@robertk.io>

robert3005 requested a review from gatesn May 22, 2026 22:50

robert3005 added the changelog/chore A trivial change label May 22, 2026

robert3005 requested review from joseph-isaacs and onursatici May 27, 2026 00:46

onursatici reviewed May 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use ArcSwap for aggregate fn registry#8072

Use ArcSwap for aggregate fn registry#8072
robert3005 wants to merge 1 commit into
developfrom
rk/aggregatearcswap

robert3005 commented May 22, 2026

Uh oh!

codspeed-hq Bot commented May 22, 2026

Uh oh!

onursatici May 27, 2026

Uh oh!

onursatici May 27, 2026

Uh oh!

onursatici May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -107,15 +107,20 @@ impl Default for AggregateFnSession {

		impl AggregateFnSession {
		/// Returns the aggregate function registry.

Conversation

robert3005 commented May 22, 2026

Uh oh!

codspeed-hq Bot commented May 22, 2026

Merging this PR will not alter performance

Performance Changes

Uh oh!

onursatici May 27, 2026

Choose a reason for hiding this comment

Uh oh!

onursatici May 27, 2026

Choose a reason for hiding this comment

Uh oh!

onursatici May 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants