Skip to content

Commit e0f2ed3

Browse files
Merge pull request #253673 from dennispadia/depadia-hadoc
Changes in SLES HA doc
2 parents a008364 + e4ca94a commit e0f2ed3

File tree

2 files changed

+61
-4
lines changed

2 files changed

+61
-4
lines changed

articles/sap/workloads/high-availability-guide-suse-pacemaker.md

Lines changed: 10 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.subservice: sap-vm-workloads
1010
ms.topic: article
1111
ms.workload: infrastructure-services
1212
ms.custom: devx-track-azurepowershell
13-
ms.date: 09/15/2023
13+
ms.date: 10/03/2023
1414
ms.author: radeltch
1515
---
1616

@@ -1010,11 +1010,18 @@ Azure offers [scheduled events](../../virtual-machines/linux/scheduled-events.md
10101010
Important: The resources must start with 'health-azure'.
10111011
10121012
```bash
1013-
sudo crm configure primitive health-azure-events \
1014-
ocf:heartbeat:azure-events-az op monitor interval=10s
1013+
sudo crm configure primitive health-azure-events ocf:heartbeat:azure-events-az \
1014+
meta allow-unhealthy-nodes=true \
1015+
op monitor interval=10s
1016+
10151017
sudo crm configure clone health-azure-events-cln health-azure-events
10161018
```
10171019
1020+
> [!NOTE]
1021+
> On configuring 'health-azure-events' resource, following warning message can be ignored.
1022+
>
1023+
> WARNING: health-azure-events: unknown attribute 'allow-unhealthy-nodes'.
1024+
10181025
6. Take the Pacemaker cluster out of maintenance mode
10191026
10201027
```bash

articles/sap/workloads/sap-hana-high-availability.md

Lines changed: 51 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.service: sap-on-azure
99
ms.subservice: sap-vm-workloads
1010
ms.topic: article
1111
ms.workload: infrastructure
12-
ms.date: 09/15/2023
12+
ms.date: 10/03/2023
1313
ms.author: radeltch
1414

1515
---
@@ -1375,6 +1375,56 @@ In the following test descriptions, we assume `PREFER_SITE_TAKEOVER="true"` and
13751375
rsc_nc_HN1_HDB03 (ocf::heartbeat:azure-lb): Started hn1-db-0
13761376
```
13771377
1378+
1. Test 10: Crash primary database indexserver
1379+
1380+
This test is relevant only when you have set up the susChkSrv hook as outlined in [Implement HANA hooks SAPHanaSR and susChkSrv](./sap-hana-high-availability.md#implement-hana-hooks-saphanasr-and-suschksrv).
1381+
1382+
The resource state before starting the test:
1383+
1384+
```output
1385+
Clone Set: cln_SAPHanaTopology_HN1_HDB03 [rsc_SAPHanaTopology_HN1_HDB03]
1386+
Started: [ hn1-db-0 hn1-db-1 ]
1387+
Master/Slave Set: msl_SAPHana_HN1_HDB03 [rsc_SAPHana_HN1_HDB03]
1388+
Masters: [ hn1-db-0 ]
1389+
Slaves: [ hn1-db-1 ]
1390+
Resource Group: g_ip_HN1_HDB03
1391+
rsc_ip_HN1_HDB03 (ocf::heartbeat:IPaddr2): Started hn1-db-0
1392+
rsc_nc_HN1_HDB03 (ocf::heartbeat:azure-lb): Started hn1-db-0
1393+
```
1394+
1395+
Run the following commands as root on the `hn1-db-0` node:
1396+
1397+
```bash
1398+
hn1-db-0:~ # killall -9 hdbindexserver
1399+
```
1400+
1401+
When the indexserver is terminated, the susChkSrv hook detects the event and trigger an action to fence 'hn1-db-0' node and initiate a takeover process.
1402+
1403+
Run the following commands to register `hn1-db-0` node as secondary and clean up the failed resource:
1404+
1405+
```bash
1406+
# run as <hana sid>adm
1407+
hn1adm@hn1-db-0:/usr/sap/HN1/HDB03> hdbnsutil -sr_register --remoteHost=hn1-db-1 --remoteInstance=<instance number> --replicationMode=sync --name=<site 1>
1408+
1409+
# run as root
1410+
hn1-db-0:~ # crm resource cleanup msl_SAPHana_<HANA SID>_HDB<instance number> hn1-db-0
1411+
```
1412+
1413+
The resource state after the test:
1414+
1415+
```output
1416+
Clone Set: cln_SAPHanaTopology_HN1_HDB03 [rsc_SAPHanaTopology_HN1_HDB03]
1417+
Started: [ hn1-db-0 hn1-db-1 ]
1418+
Master/Slave Set: msl_SAPHana_HN1_HDB03 [rsc_SAPHana_HN1_HDB03]
1419+
Masters: [ hn1-db-1 ]
1420+
Slaves: [ hn1-db-0 ]
1421+
Resource Group: g_ip_HN1_HDB03
1422+
rsc_ip_HN1_HDB03 (ocf::heartbeat:IPaddr2): Started hn1-db-1
1423+
rsc_nc_HN1_HDB03 (ocf::heartbeat:azure-lb): Started hn1-db-1
1424+
```
1425+
1426+
You can execute a comparable test case by causing the indexserver on the secondary node to crash. In the event of indexserver crash, the susChkSrv hook will recognize the occurrence and initiate an action to fence the secondary node.
1427+
13781428
## Next steps
13791429
13801430
- [Azure Virtual Machines planning and implementation for SAP][planning-guide]

0 commit comments

Comments
 (0)