|
| 1 | +--- |
| 2 | +title: Troubleshoot Post Bare Metal machine(BMM) Cordon and Restart, VMs show up as error on portal. Power and Restart doesn’t work anymore for Azure Operator Nexus |
| 3 | +description: Troubleshoot Post Bare Metal machine(BMM) Cordon and restart, VMs show up as error on portal. Power & Restart doesn’t work anymore. |
| 4 | +ms.service: azure-operator-nexus |
| 5 | +ms.custom: troubleshooting |
| 6 | +ms.topic: troubleshooting |
| 7 | +ms.date: 06/13/2023 |
| 8 | +ms.author: v-sathysubra |
| 9 | +author: Sathyadevi-S |
| 10 | +--- |
| 11 | +# Troubleshoot post Bare Metal machine(BMM) cordon and restart, VMs show up as error on portal. Power and Restart doesn’t work anymore |
| 12 | +Follow this troubleshooting guide for Bare Metal machine(BMM) Cordon and Restart: |
| 13 | +- If you encounter VMs showing up as "ERROR" on the portal after an upgrade. |
| 14 | +- Traditional methods such as powering off and restarting the VMs doesn't work |
| 15 | + |
| 16 | +## Prerequisites to complete this TSG |
| 17 | +- Install the latest version of the |
| 18 | + [appropriate CLI extensions](./howto-install-cli-extensions.md) |
| 19 | +- Familiarize yourself with the capabilities referenced in this article by reviewing the [Bare Metal Machine Actions](howto-baremetal-functions.md) |
| 20 | +- Subscription ID |
| 21 | +- Cluster name and resource group |
| 22 | +- Virtual machine name |
| 23 | +- The Virtual Machine should have a Provisioning State of "Succeeded" and a Power State of "On." |
| 24 | + |
| 25 | +## Symptoms |
| 26 | + |
| 27 | +- During Bare Metal machine(BMM) restart or upgrade testing, the VM is currently in an error state. |
| 28 | +- After the Bare Metal machine(BMM) reboot CORDONED -> POWERED OFF -> POWERED ON -> UNCORDONED |
| 29 | + - Although the VNF successfully came up, established its BGP sessions, and started routing traffic, the VM status in the portal consistently shows an error. Despite this discrepancy, the application remains healthy and continues to function properly. |
| 30 | + - The Portal actions and AZ CLI APIs for the NC VM resource itself were no longer achieving the intent. Example: |
| 31 | + - Clicking the Power Off (or AZ CLI) actually Power Off the VM anymore |
| 32 | + - Clicking the Restart (or AZ CLI) doesn't actually Restart the VM anymore |
| 33 | + - The platform has lost capability to manage this VM Resource |
| 34 | + |
| 35 | +:::image type="content" source="media\troubleshoot-bmm-server\bmm-error-status.png" alt-text="Screenshot of Sample VM in Error status." lightbox="media\troubleshoot-bmm-server\bmm-error-status.png"::: |
| 36 | + |
| 37 | +## Troubleshooting |
| 38 | + |
| 39 | +- Gather the VM details and Validate the VM status in the portal. |
| 40 | + |
| 41 | + - VM isn't connected and powered off |
| 42 | + - Validate the status of the virtual machine before and after restart or upgrade |
| 43 | + - Check the BGP session and traffic flow before and after restart or upgrade of the VNF |
| 44 | + - For more troubleshooting, see [troubleshoot-reboot-reimage-replace](troubleshoot-reboot-reimage-replace.md) |
| 45 | + |
| 46 | +## Procedure |
| 47 | + |
| 48 | +There's an issue with the status update on the VM after the upgrade. |
| 49 | +Although the upgrade and the VM itself are fine, the status is being reported incorrectly, leading to actions being ignored. |
| 50 | +Perform the following CLI update on any affected VMs with some dummy tag values (the use of "tag1" and "value1") |
| 51 | + |
| 52 | +~~~bash |
| 53 | + az networkcloud virtualmachine update --ids <VMresourceId> --tags tag1=value1 |
| 54 | +~~~ |
| 55 | + |
| 56 | +This process facilitates the restoration of the VM to an online state. |
| 57 | + |
| 58 | +:::image type="content" source="media\troubleshoot-bmm-server\BMM-running-status.png" alt-text="Screenshot of Sample VM in running status." lightbox="media\troubleshoot-bmm-server\BMM-running-status.png"::: |
0 commit comments