[Performance]: Use a cache for PodTemplateHash and optimize calls to CreateOrUpdate by unmarshall · Pull Request #439 · ai-dynamo/grove

unmarshall · 2026-02-17T14:22:41Z

What type of PR is this?

/kind enhancement
/area performance

What this PR does / why we need it:

Introduces a LRU cache with bounded items to store computed PodSpec hash.
Adapted all reconcilers to use the cache instead always computing the cache even if there was no change in PodSpec.
Removed the use of dump.ForHash and replaced it with simple JSON serialization. Added benchmark tests to prove that dump.ForHash is quite slow/expensive w.r.t CPU usage.
Optimizes calls to controller-runtime CreateOrUpdate since that uses too many DeepCopy calls making it a bit expensive.

Which issue(s) this PR fixes:

Fixes #406 and partially addresses #407

Special notes for your reviewer:

Does this PR introduce a API change?

Reduced CPU usage in reconcilers by optimizing PodSpec hash computation.

Additional documentation e.g., enhancement proposals, usage docs, etc.:

* Introduced LRU cache for PodTemplateSpec hashes. * Added utility functions to compute Hash. * Replaced dump.ForHash with JSON marshaling. * Added benchmark tests to different hash computing options to compare. * Adapted Pod component in PCLQ reconciler to use the hash cache. * Adapted PCLQ reconcile status to use the hash cache. * Adapted PCLQ component in PCS reconciler to pre-compute hashes via the hash cache and optimize the calls for CreateOrPatch. * Adapted PCSG status reconciliation to optimize hash computation via hash cache. Signed-off-by: Madhav Bhargava <madhav.bhargava@sap.com>

Signed-off-by: Madhav Bhargava <madhav.bhargava@sap.com>

copy-pr-bot · 2026-02-17T14:22:45Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: Madhav Bhargava <madhav.bhargava@sap.com>

unmarshall added 4 commits February 17, 2026 19:46

Adapted usage of hash cache across reconcilers.

a13e1fa

Signed-off-by: Madhav Bhargava <madhav.bhargava@sap.com>

added unit tests, license headers and formatting fixes

dda5c2f

Signed-off-by: Madhav Bhargava <madhav.bhargava@sap.com>

increased the max no of items in cache to 50K

f5455ff

Signed-off-by: Madhav Bhargava <madhav.bhargava@sap.com>

unmarshall requested review from Ronkahn21, gflarity, sanjaychatterjee and shayasoolin as code owners February 17, 2026 14:22

unmarshall marked this pull request as draft February 17, 2026 14:22

fixed linting errors

9d25194

Signed-off-by: Madhav Bhargava <madhav.bhargava@sap.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Performance]: Use a cache for PodTemplateHash and optimize calls to CreateOrUpdate#439

[Performance]: Use a cache for PodTemplateHash and optimize calls to CreateOrUpdate#439
unmarshall wants to merge 5 commits intoai-dynamo:mainfrom
unmarshall:hashcache

unmarshall commented Feb 17, 2026

Uh oh!

copy-pr-bot bot commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

unmarshall commented Feb 17, 2026

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a API change?

Additional documentation e.g., enhancement proposals, usage docs, etc.:

Uh oh!

copy-pr-bot bot commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant