Skip to content

Commit d6c4716

Browse files
authored
Merge pull request #241436 from Sathyadevi-S/main
TSG for BMM reboot
2 parents 7ef6bb2 + 6f55a38 commit d6c4716

File tree

4 files changed

+60
-0
lines changed

4 files changed

+60
-0
lines changed

articles/operator-nexus/TOC.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -71,6 +71,8 @@
7171
href: troubleshoot-isolation-domain.md
7272
- name: Troubleshoot Internet host virtual machine
7373
href: troubleshoot-internet-host-virtual-machine.md
74+
- name: Troubleshoot BMM reboot issues
75+
href: troubleshoot-bmm-node-reboot.md
7476
- name: Sample Deployment
7577
items:
7678
- name: VNF Deployment
102 KB
Loading
68 KB
Loading
Lines changed: 58 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,58 @@
1+
---
2+
title: Troubleshoot Post Bare Metal machine(BMM) Cordon and Restart, VMs show up as error on portal. Power and Restart doesn’t work anymore for Azure Operator Nexus
3+
description: Troubleshoot Post Bare Metal machine(BMM) Cordon and restart, VMs show up as error on portal. Power & Restart doesn’t work anymore.
4+
ms.service: azure-operator-nexus
5+
ms.custom: troubleshooting
6+
ms.topic: troubleshooting
7+
ms.date: 06/13/2023
8+
ms.author: v-sathysubra
9+
author: Sathyadevi-S
10+
---
11+
# Troubleshoot post Bare Metal machine(BMM) cordon and restart, VMs show up as error on portal. Power and Restart doesn’t work anymore
12+
Follow this troubleshooting guide for Bare Metal machine(BMM) Cordon and Restart:
13+
- If you encounter VMs showing up as "ERROR" on the portal after an upgrade.
14+
- Traditional methods such as powering off and restarting the VMs doesn't work
15+
16+
## Prerequisites to complete this TSG
17+
- Install the latest version of the
18+
[appropriate CLI extensions](./howto-install-cli-extensions.md)
19+
- Familiarize yourself with the capabilities referenced in this article by reviewing the [Bare Metal Machine Actions](howto-baremetal-functions.md)
20+
- Subscription ID
21+
- Cluster name and resource group
22+
- Virtual machine name
23+
- The Virtual Machine should have a Provisioning State of "Succeeded" and a Power State of "On."
24+
25+
## Symptoms
26+
27+
- During Bare Metal machine(BMM) restart or upgrade testing, the VM is currently in an error state.
28+
- After the Bare Metal machine(BMM) reboot CORDONED -> POWERED OFF -> POWERED ON -> UNCORDONED
29+
- Although the VNF successfully came up, established its BGP sessions, and started routing traffic, the VM status in the portal consistently shows an error. Despite this discrepancy, the application remains healthy and continues to function properly.
30+
- The Portal actions and AZ CLI APIs for the NC VM resource itself were no longer achieving the intent. Example:
31+
- Clicking the Power Off (or AZ CLI) actually Power Off the VM anymore
32+
- Clicking the Restart (or AZ CLI) doesn't actually Restart the VM anymore
33+
- The platform has lost capability to manage this VM Resource
34+
35+
:::image type="content" source="media\troubleshoot-bmm-server\bmm-error-status.png" alt-text="Screenshot of Sample VM in Error status." lightbox="media\troubleshoot-bmm-server\bmm-error-status.png":::
36+
37+
## Troubleshooting
38+
39+
- Gather the VM details and Validate the VM status in the portal.
40+
41+
- VM isn't connected and powered off
42+
- Validate the status of the virtual machine before and after restart or upgrade
43+
- Check the BGP session and traffic flow before and after restart or upgrade of the VNF
44+
- For more troubleshooting, see [troubleshoot-reboot-reimage-replace](troubleshoot-reboot-reimage-replace.md)
45+
46+
## Procedure
47+
48+
There's an issue with the status update on the VM after the upgrade.
49+
Although the upgrade and the VM itself are fine, the status is being reported incorrectly, leading to actions being ignored.
50+
Perform the following CLI update on any affected VMs with some dummy tag values (the use of "tag1" and "value1")
51+
52+
~~~bash
53+
az networkcloud virtualmachine update --ids <VMresourceId> --tags tag1=value1
54+
~~~
55+
56+
This process facilitates the restoration of the VM to an online state.
57+
58+
:::image type="content" source="media\troubleshoot-bmm-server\BMM-running-status.png" alt-text="Screenshot of Sample VM in running status." lightbox="media\troubleshoot-bmm-server\BMM-running-status.png":::

0 commit comments

Comments
 (0)