-
Notifications
You must be signed in to change notification settings - Fork 1.2k
engine/orchestration: fix missing vm powerstate update vm state #10407
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
engine/orchestration: fix missing vm powerstate update vm state #10407
Conversation
Fixes apache#10406 VMs were not moving to Stopped state when PowerReportMissing is processed. Signed-off-by: Abhishek Kumar <[email protected]>
|
@blueorangutan package |
|
@shwstppr a [SL] Jenkins job has been kicked to build packages. It will be bundled with KVM, XenServer and VMware SystemVM templates. I'll keep you posted as I make progress. |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## 4.20 #10407 +/- ##
=========================================
Coverage 15.98% 15.99%
- Complexity 13077 13083 +6
=========================================
Files 5649 5649
Lines 495624 495625 +1
Branches 60003 60003
=========================================
+ Hits 79250 79268 +18
+ Misses 407526 407506 -20
- Partials 8848 8851 +3
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
DaanHoogland
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
clgtm, but no idea if this has side effects.
|
Packaging result [SF]: ✔️ el8 ✔️ el9 ✔️ debian ✔️ suse15. SL-JID 12487 |
weizhouapache
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
code lgtm
2025-02-17T14:14:55,772 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-1:[]) (logid:) VM state report is updated. Host {"id":2,"name":"ref-trl-6851-k-Mu24-wei-zhou-kvm2","type":"Routing","uuid":"eb8661f9-6a36-4449-8923-cfe3fb849500"}, VM instance {"id":1159,"instanceName":"i-2-1159-VM","type":"User","uuid":"9e1de80e-ddcf-4c8b-87bc-1c1412292a46"}, power state: PowerReportMissing
2025-02-17T14:14:55,783 INFO [c.c.v.ClusteredVirtualMachineManagerImpl] (AgentManager-Handler-1:[ctx-d817875a]) (logid:) VM VM instance {"id":1159,"instanceName":"i-2-1159-VM","type":"User","uuid":"9e1de80e-ddcf-4c8b-87bc-1c1412292a46"} is at Running and we received a PowerReportMissing report while there is no pending jobs on it
...
2025-02-17T14:14:55,843 INFO [c.c.v.ClusteredVirtualMachineManagerImpl] (AgentManager-Handler-1:[ctx-d817875a]) (logid:) VM VM instance {"id":1159,"instanceName":"i-2-1159-VM","type":"User","uuid":"9e1de80e-ddcf-4c8b-87bc-1c1412292a46"} is sync-ed to at Stopped state according to power-off report from hypervisor.
sureshanaparti
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
clgtm
harikrishna-patnala
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
code LGTM
Signed-off-by: Abhishek Kumar <[email protected]>
Signed-off-by: Abhishek Kumar <[email protected]>
|
@blueorangutan package |
|
@shwstppr a [SL] Jenkins job has been kicked to build packages. It will be bundled with KVM, XenServer and VMware SystemVM templates. I'll keep you posted as I make progress. |
|
Packaging result [SF]: ✖️ el8 ✖️ el9 ✖️ debian ✖️ suse15. SL-JID 12498 |
Signed-off-by: Abhishek Kumar <[email protected]>
|
@blueorangutan package |
|
@shwstppr a [SL] Jenkins job has been kicked to build packages. It will be bundled with KVM, XenServer and VMware SystemVM templates. I'll keep you posted as I make progress. |
|
Packaging result [SF]: ✔️ el8 ✔️ el9 ✔️ debian ✔️ suse15. SL-JID 12499 |
|
@blueorangutan test |
|
@DaanHoogland a [SL] Trillian-Jenkins test job (ol8 mgmt + kvm-ol8) has been kicked to run smoke tests |
|
[SF] Trillian test result (tid-12461)
|
kiranchavala
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Steps to reproduce the issue
- Deploy a vm from cloudstack
- Login to the kvm host and destory the vm using virsh (virsh destory domain id)
- The vm should get stopped in cloudstack after some time
Before the fix (the vm continued to be in running state)
2025-02-25` 06:59:13,941 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-16:null) (logid:) Run missing VM report. current time: 1740466753941
2025-02-25 06:59:13,942 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-16:null) (logid:) Detected missing VM. Host {"id":1,"name":"ol8.localdomain","type":"Routing","uuid":"23ad85e8-e338-4b1a-958c-9a65b2532e75"}, VM instance {"id":3,"instanceName":"i-2-3-VM","type":"User","uuid":"6ae77659-2f5f-492c-b616-6d78ef134dcb"}, power state: PowerReportMissing, last state update: 2025-02-25T06:58:13+0000
2025-02-25 06:59:13,942 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-16:null) (logid:) VM instance {"id":3,"instanceName":"i-2-3-VM","type":"User","uuid":"6ae77659-2f5f-492c-b616-6d78ef134dcb"} - time since last state update(60941 ms) has not passed graceful period yet
2025-02-25 06:59:13,942 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-16:null) (logid:) Done with process of VM state report. Host {"id":1,"name":"ol8.localdomain","type":"Routing","uuid":"23ad85e8-e338-4b1a-958c-9a65b2532e75"}
2025-02-25 07:00:13,989 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-15:null) (logid:) Run missing VM report. current time: 1740466813989
2025-02-25 07:00:13,990 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-15:null) (logid:) Detected missing VM. Host {"id":1,"name":"ol8.localdomain","type":"Routing","uuid":"23ad85e8-e338-4b1a-958c-9a65b2532e75"}, VM instance {"id":3,"instanceName":"i-2-3-VM","type":"User","uuid":"6ae77659-2f5f-492c-b616-6d78ef134dcb"}, power state: PowerReportMissing, last state update: 2025-02-25T06:58:13+0000
2025-02-25 07:00:13,990 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-15:null) (logid:) VM instance {"id":3,"instanceName":"i-2-3-VM","type":"User","uuid":"6ae77659-2f5f-492c-b616-6d78ef134dcb"} - time since last state update(120989 ms) has passed graceful period
2025-02-25 07:00:13,994 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-15:null) (logid:) Done with process of VM state report. Host {"id":1,"name":"ol8.localdomain","type":"Routing","uuid":"23ad85e8-e338-4b1a-958c-9a65b2532e75"}
After the fix (the vm goes to stop state)
2025-02-25 06:39:20,361 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-7:null) (logid:) Run missing VM report. current time: 1740465560361
2025-02-25 06:39:20,362 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-7:null) (logid:) Detected missing VM. Host {"id":1,"name":"ol8.localdomain","type":"Routing","uuid":"22e66fdc-690b-4ae1-bfd6-a5677315cea0"}, VM instance {"id":3,"instanceName":"i-2-3-VM","type":"User","uuid":"9368adc3-6b06-48db-8fa4-4ad8dbb20a81"}, power state: PowerReportMissing, last state update: 2025-02-25T06:38:20+0000
2025-02-25 06:39:20,362 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-7:null) (logid:) VM instance {"id":3,"instanceName":"i-2-3-VM","type":"User","uuid":"9368adc3-6b06-48db-8fa4-4ad8dbb20a81"} - time since last state update(60361 ms) has not passed graceful period yet
2025-02-25 06:39:20,362 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-7:null) (logid:) Done with process of VM state report. Host {"id":1,"name":"ol8.localdomain","type":"Routing","uuid":"22e66fdc-690b-4ae1-bfd6-a5677315cea0"
2025-02-25 06:40:20,331 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-1:null) (logid:) VM state report is updated. Host {"id":1,"name":"ol8.localdomain","type":"Routing","uuid":"22e66fdc-690b-4ae1-bfd6-a5677315cea0"}, VM instance {"id":3,"instanceName":"i-2-3-VM","type":"User","uuid":"9368adc3-6b06-48db-8fa4-4ad8dbb20a81"}, power state: PowerReportMissing
2025-02-25 06:40:20,341 INFO [c.c.v.VirtualMachineManagerImpl] (AgentManager-Handler-1:null) (logid:) VM i-2-3-VM is at Running and we received a PowerReportMissing report while there is no pending jobs on it
2025-02-25 06:40:20,430 INFO [c.c.v.VirtualMachineManagerImpl] (AgentManager-Handler-1:null) (logid:) VM i-2-3-VM is sync-ed to at Stopped state according to power-off report from hypervisor
2025-02-25 06:40:20,431 DEBUG [c.c.v.VirtualMachinePowerStateSyncImpl] (AgentManager-Handler-1:null) (logid:) Done with process of VM state report. Host {"id":1,"name":"ol8.localdomain","type":"Routing","uuid":"22e66fdc-690b-4ae1-bfd6-a5677315cea0"}
…he#10407) * engine/orchestration: fix missing vm powerstate update vm state Fixes apache#10406 VMs were not moving to Stopped state when PowerReportMissing is processed. Signed-off-by: Abhishek Kumar <[email protected]> * add unit tests Signed-off-by: Abhishek Kumar <[email protected]> * add license Signed-off-by: Abhishek Kumar <[email protected]> * add lenient Signed-off-by: Abhishek Kumar <[email protected]> --------- Signed-off-by: Abhishek Kumar <[email protected]>
Description
Fixes #10406
VMs were not moving to Stopped state when PowerReportMissing is processed.
Types of changes
Feature/Enhancement Scale or Bug Severity
Feature/Enhancement Scale
Bug Severity
Screenshots (if appropriate):
How Has This Been Tested?
How did you try to break this feature and the system with this change?