Use channels for metrics updates, added metrics tests #171

irar2 · 2025-08-26T05:36:34Z

fixes #145

This PRs changes the way the metrics of number of running and waiting requests are updated. The current way of atomic increments and a separate Prometheus set is causing the metrics to be inconsistent. Therefore, a use of channels is added.

Also, this PR adds metrics tests, and reorganizes simulator tests a bit.

Signed-off-by: Ira <[email protected]>

mayabar · 2025-08-26T06:21:24Z

pkg/llm-d-inference-sim/metrics.go


 // reportWaitingRequests sets information about waiting completion requests
-func (s *VllmSimulator) reportWaitingRequests() {
+func (s *VllmSimulator) reportWaitingRequests(nWaitingReqs int64) {


no need to pass the nWaitingReqs, the s.nWaitingReqs has an updated value

mayabar · 2025-08-26T06:21:41Z

pkg/llm-d-inference-sim/metrics.go


 // reportRunningRequests sets information about running completion requests
-func (s *VllmSimulator) reportRunningRequests() {
+func (s *VllmSimulator) reportRunningRequests(nRunningReqs int64) {


no need to pass the nRunningReqs, the s. nRunningReqs has an updated value

mayabar · 2025-08-26T06:25:04Z

pkg/llm-d-inference-sim/simulator.go

+	// waitingReqChan is a channel to update nWaitingReqs
+	waitingReqChan chan int64
 	// loraInfo is prometheus gauge
 	loraInfo *prometheus.GaugeVec


maybe we need a channel for loraInfos? please create an issue for this

mayabar · 2025-08-26T06:25:54Z

pkg/llm-d-inference-sim/simulator.go

 		kvcacheHelper:  nil, // kvcache helper will be created only if required after reading configuration
 		namespace:      os.Getenv(podNsEnv),
 		pod:            os.Getenv(podNameEnv),
+		runReqChan:     make(chan int64, 1000),


maybe we can define a constant for metrics channels size or even expose it in the configuration?

mayabar · 2025-08-26T06:44:01Z

pkg/llm-d-inference-sim/simulator.go

+		case inc := <-s.waitingReqChan:
+			s.nWaitingReqs += inc
+			s.reportWaitingRequests(s.nWaitingReqs)
+		case inc := <-s.runReqChan:


please create a separate loop for each metric

Signed-off-by: Ira <[email protected]>

mayabar · 2025-08-26T08:46:12Z

/lgtm

/approve

mayabar · 2025-08-26T08:51:49Z

/lgtm
/approve

Use channels for metrics updates. Metrics tests

18be299

Signed-off-by: Ira <[email protected]>

irar2 requested a review from mayabar August 26, 2025 05:36

mayabar requested changes Aug 26, 2025

View reviewed changes

Review comments

afdad58

Signed-off-by: Ira <[email protected]>

github-actions bot added the lgtm label Aug 26, 2025

github-actions bot approved these changes Aug 26, 2025

View reviewed changes

mayabar approved these changes Aug 26, 2025

View reviewed changes

github-actions bot approved these changes Aug 26, 2025

View reviewed changes

irar2 merged commit 57657bf into llm-d:main Aug 26, 2025
4 checks passed

irar2 deleted the metrics branch August 26, 2025 08:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use channels for metrics updates, added metrics tests #171

Use channels for metrics updates, added metrics tests #171

Uh oh!

irar2 commented Aug 26, 2025

Uh oh!

mayabar Aug 26, 2025

Uh oh!

mayabar Aug 26, 2025

Uh oh!

mayabar Aug 26, 2025

Uh oh!

mayabar Aug 26, 2025

Uh oh!

mayabar Aug 26, 2025

Uh oh!

mayabar commented Aug 26, 2025

Uh oh!

mayabar commented Aug 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Use channels for metrics updates, added metrics tests #171

Use channels for metrics updates, added metrics tests #171

Uh oh!

Conversation

irar2 commented Aug 26, 2025

Uh oh!

mayabar Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

mayabar Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

mayabar Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

mayabar Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

mayabar Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

mayabar commented Aug 26, 2025

Uh oh!

mayabar commented Aug 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants