-
Notifications
You must be signed in to change notification settings - Fork 38
Open
Open
Copy link
Labels
c/autoscaling/neonvmComponent: autoscaling: NeonVMComponent: autoscaling: NeonVMmigrated_to_jirat/bugIssue Type: BugIssue Type: Bug
Description
Environment
Production
Steps to reproduce
I was unable to reproduce locally, but it seems like the following sequence of events took place:
- VM is created with min CU = 0.25, current CU = 56, max CU = 56, with CPU scaling via sysfs
- neonvm-controller tells neonvm-runner to set CPU to 56
- neonvm-runner times out making the initial
PUTrequest to neonvm-daemon... but neonvm-daemon does actually make the change? (AFAICT)
Expected result
After we get to this inconsistent state, neonvm-runner should be able to recover on subsequent requests from neonvm-controller to get the current CPU.
Actual result
On every subsequent request where neonvm-controller fetches the current CPU, neonvm-runner emits the following warning:
{"level":"warn","ts":1746021781.7511058,"logger":"neonvm-runner.http-handlers.cpu_current","caller":"cmd/main.go:501","msg":"CPU from NeonVM Daemon does not match stored value, returning daemon value to let controller reconcile correct state","stored":"250m","current":56}
Other logs, links
- Logs link: https://neonprod.grafana.net/goto/OM7LJNbNR?orgId=1 (from 2025-04-30)
- Report in slack: https://neondb.slack.com/archives/C08PJ07BZ44/p1746031280385729
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
c/autoscaling/neonvmComponent: autoscaling: NeonVMComponent: autoscaling: NeonVMmigrated_to_jirat/bugIssue Type: BugIssue Type: Bug