Skip to content

Commit 3851c07

Browse files
authored
Merge pull request #75768 from xenolinux/hcp-ha-dr-content-add
OSDOCS#10475: Restructure backup, restore, disaster recovery for HCP
2 parents 8043812 + 48db093 commit 3851c07

21 files changed

+219
-175
lines changed

_topic_maps/_topic_map.yml

Lines changed: 11 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2375,8 +2375,17 @@ Topics:
23752375
File: hcp-updating
23762376
- Name: Hosted control planes Observability
23772377
File: hcp-observability
2378-
- Name: Backup, restore, and disaster recovery for hosted control planes
2379-
File: hcp-backup-restore-dr
2378+
- Name: High availability for hosted control planes
2379+
Dir: hcp_high_availability
2380+
Topics:
2381+
- Name: Recovering a failing etcd cluster
2382+
File: hcp-recovering-etcd-cluster
2383+
- Name: Backing up and restoring etcd in an on-premise environment
2384+
File: hcp-backup-restore-on-premise
2385+
- Name: Backing up and restoring etcd on AWS
2386+
File: hcp-backup-restore-aws
2387+
- Name: Disaster recovery for a hosted cluster in AWS
2388+
File: hcp-disaster-recovery-aws
23802389
- Name: Troubleshooting hosted control planes
23812390
File: hcp-troubleshooting
23822391
---

backup_and_restore/control_plane_backup_and_restore/backing-up-etcd.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ include::modules/backup-etcd.adoc[leveloffset=+1]
2525
[role="_additional-resources"]
2626
[id="additional-resources_backup-etcd"]
2727
== Additional resources
28-
* xref:../../hosted_control_planes/hcp-backup-restore-dr.adoc#hcp-backup-restore[Backing up and restoring etcd on a hosted cluster]
28+
* xref:../../hosted_control_planes/hcp_high_availability/hcp-recovering-etcd-cluster.adoc#hcp-recovering-etcd-cluster[Recovering an unhealthy etcd cluster]
2929

3030
// Creating automated etcd backups
3131
include::modules/etcd-creating-automated-backups.adoc[leveloffset=+1]
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
../../_attributes/
Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,16 @@
1+
:_mod-docs-content-type: ASSEMBLY
2+
[id="hcp-backup-restore-aws"]
3+
= Backing up and restoring etcd on AWS
4+
include::_attributes/common-attributes.adoc[]
5+
:context: hcp-backup-restore-aws
6+
7+
toc::[]
8+
9+
You can back up and restore etcd on a hosted cluster on {aws-first} to fix failures.
10+
11+
:FeatureName: Hosted control planes on the {aws-short} platform
12+
include::snippets/technology-preview.adoc[]
13+
14+
include::modules/backup-etcd-hosted-cluster.adoc[leveloffset=+1]
15+
16+
include::modules/restoring-etcd-snapshot-hosted-cluster.adoc[leveloffset=+1]
Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
:_mod-docs-content-type: ASSEMBLY
2+
[id="hcp-backup-restore-on-premise"]
3+
= Backing up and restoring etcd in an on-premise environment
4+
include::_attributes/common-attributes.adoc[]
5+
:context: hcp-backup-restore-on-premise
6+
7+
toc::[]
8+
9+
You can back up and restore etcd on a hosted cluster in an on-premise environment to fix failures.
10+
11+
include::modules/hosted-cluster-etcd-backup-restore-on-premise.adoc[leveloffset=+1]
Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,43 @@
1+
:_mod-docs-content-type: ASSEMBLY
2+
[id="hcp-disaster-recovery-aws"]
3+
= Disaster recovery for a hosted cluster in AWS
4+
include::_attributes/common-attributes.adoc[]
5+
:context: hcp-disaster-recovery-aws
6+
7+
toc::[]
8+
9+
You can recover a hosted cluster to the same region within {aws-first}. For example, you need disaster recovery when the upgrade of a management cluster fails and the hosted cluster is in a read-only state.
10+
11+
:FeatureName: Hosted control planes
12+
include::snippets/technology-preview.adoc[]
13+
14+
The disaster recovery process involves the following steps:
15+
16+
. Backing up the hosted cluster on the source management cluster
17+
. Restoring the hosted cluster on a destination management cluster
18+
. Deleting the hosted cluster from the source management cluster
19+
20+
Your workloads remain running during the process. The Cluster API might be unavailable for a period, but that does not affect the services that are running on the worker nodes.
21+
22+
[IMPORTANT]
23+
====
24+
Both the source management cluster and the destination management cluster must have the `--external-dns` flags to maintain the API server URL. That way, the server URL ends with `https://api-sample-hosted.sample-hosted.aws.openshift.com`. See the following example:
25+
26+
.Example: External DNS flags
27+
[source,terminal]
28+
----
29+
--external-dns-provider=aws \
30+
--external-dns-credentials=<path_to_aws_credentials_file> \
31+
--external-dns-domain-filter=<basedomain>
32+
----
33+
34+
If you do not include the `--external-dns` flags to maintain the API server URL, you cannot migrate the hosted cluster.
35+
====
36+
37+
include::modules/dr-backup-restore-aws-overview.adoc[leveloffset=+1]
38+
39+
include::modules/dr-hosted-cluster-within-aws-region-backup.adoc[leveloffset=+1]
40+
41+
include::modules/dr-hosted-cluster-within-aws-region-restore.adoc[leveloffset=+1]
42+
43+
include::modules/dr-hosted-cluster-within-aws-region-delete.adoc[leveloffset=+1]
Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
:_mod-docs-content-type: ASSEMBLY
2+
[id="hcp-recovering-etcd-cluster"]
3+
= Recovering an unhealthy etcd cluster
4+
include::_attributes/common-attributes.adoc[]
5+
:context: hcp-recovering-etcd-cluster
6+
7+
toc::[]
8+
9+
In a highly available control plane, three etcd pods run as a part of a stateful set in an etcd cluster. To recover an etcd cluster, identify unhealthy etcd pods by checking the etcd cluster health.
10+
11+
include::modules/hosted-cluster-etcd-status.adoc[leveloffset=+1]
12+
13+
include::modules/hcp-recover-failing-etcd-pods.adoc[leveloffset=+1]
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
../../images
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
../../modules
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
../../snippets

0 commit comments

Comments
 (0)