You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: backup_and_restore/backing-up-etcd.adoc
+1-2Lines changed: 1 addition & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,8 +15,7 @@ installation, otherwise the backup will contain expired certificates. It is also
15
15
recommended to take etcd backups during non-peak usage hours, as it is a
16
16
blocking action.
17
17
18
-
Once you have an etcd backup, you can xref:../backup_and_restore/disaster_recovery/scenario-1-infra-recovery.adoc#dr-infrastructure-recovery[recover from lost master hosts]
19
-
and xref:../backup_and_restore/disaster_recovery/scenario-2-restoring-cluster-state.adoc#dr-restoring-cluster-state[restore to a previous cluster state].
18
+
Once you have an etcd backup, you can xref:../backup_and_restore/disaster_recovery/scenario-2-restoring-cluster-state.adoc#dr-restoring-cluster-state[restore to a previous cluster state].
20
19
21
20
You can perform the xref:../backup_and_restore/backing-up-etcd.adoc#backing-up-etcd-data_backup-etcd[etcd data backup process]
22
21
on any master host that has connectivity to the etcd cluster, where the proper
Copy file name to clipboardExpand all lines: backup_and_restore/disaster_recovery/about-disaster-recovery.adoc
+4-13Lines changed: 4 additions & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,11 +11,10 @@ how to recover from several disaster situations that might occur with their
11
11
more of the following procedures in order to return your cluster to a working
12
12
state.
13
13
14
-
xref:../../backup_and_restore/disaster_recovery/scenario-1-infra-recovery.adoc#dr-infrastructure-recovery[Recovering from lost master hosts]::
15
-
This solution handles situations where you have lost the majority of your master
16
-
hosts, leading to etcd quorum loss and the cluster going offline. As long as you
17
-
have taken an etcd backup and have at least one remaining healthy master host,
18
-
you can follow this procedure to recover your cluster.
14
+
xref:../../backup_and_restore/disaster_recovery/scenario-2-restoring-cluster-state.adoc#dr-restoring-cluster-state[Restoring to a previous cluster state]::
15
+
This solution handles situations where you want to restore your cluster to
16
+
a previous state, for example, if an administrator deletes something critical.
17
+
This also includes situations where you have lost the majority of your master hosts, leading to etcd quorum loss and the cluster going offline. As long as you have taken an etcd backup, you can follow this procedure to restore your cluster to a previous state.
19
18
+
20
19
If applicable, you might also need to xref:../../backup_and_restore/disaster_recovery/scenario-3-expired-certs.adoc#dr-recovering-expired-certs[recover from expired control plane certificates].
21
20
+
@@ -24,14 +23,6 @@ If applicable, you might also need to xref:../../backup_and_restore/disaster_rec
24
23
If you have a majority of your masters still available and have an etcd quorum, then follow the procedure to xref:../../backup_and_restore/replacing-failed-master.adoc#replacing-failed-master-host[replace a single failed master host].
25
24
====
26
25
27
-
xref:../../backup_and_restore/disaster_recovery/scenario-2-restoring-cluster-state.adoc#dr-restoring-cluster-state[Restoring to a previous cluster state]::
28
-
This solution handles situations where you want to restore your cluster to
29
-
a previous state, for example, if an administrator deletes something critical.
30
-
As long as you have taken an etcd backup, you can follow this procedure to
31
-
restore your cluster to a previous state.
32
-
+
33
-
If applicable, you might also need to xref:../../backup_and_restore/disaster_recovery/scenario-3-expired-certs.adoc#dr-recovering-expired-certs[recover from expired control plane certificates].
34
-
35
26
xref:../../backup_and_restore/disaster_recovery/scenario-3-expired-certs.adoc#dr-recovering-expired-certs[Recovering from expired control plane certificates]::
36
27
This solution handles situations where your control plane certificates have
37
28
expired. For example, if you shut down your cluster before the first certificate
This document describes the process to recover from a complete loss of a master host. This includes
9
-
situations where a majority of master hosts have been lost, leading to etcd quorum loss and the cluster going offline. This procedure assumes that you have at least one healthy master host.
10
-
11
-
At a high level, the procedure is to:
12
-
13
-
. Restore etcd quorum on a remaining master host.
14
-
. Create new master hosts.
15
-
. Correct DNS and load balancer entries.
16
-
. Grow etcd to full membership.
17
-
18
-
If the majority of master hosts have been lost, you will need an xref:../../backup_and_restore/backing-up-etcd.adoc#backing-up-etcd-data_backup-etcd[etcd backup] to restore etcd quorum on the remaining master host.
8
+
As of {product-title} 4.4, follow the procedure to xref:../../backup_and_restore/disaster_recovery/scenario-2-restoring-cluster-state.adoc#dr-restoring-cluster-state[restore to a previous cluster state] in order to recover from lost master hosts.
19
9
20
10
[NOTE]
21
11
====
22
12
If you have a majority of your masters still available and have an etcd quorum, then follow the procedure to xref:../../backup_and_restore/replacing-failed-master.adoc#replacing-failed-master-host[replace a single failed master host].
Copy file name to clipboardExpand all lines: modules/backup-etcd.adoc
+14-5Lines changed: 14 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,7 +5,7 @@
5
5
[id="backing-up-etcd-data_{context}"]
6
6
= Backing up etcd data
7
7
8
-
Follow these steps to back up etcd data by creating an etcd snapshot and backing up static Kubernetes API server resources. This backup can be saved and used at a later time if you need to restore etcd.
8
+
Follow these steps to back up etcd data by creating an etcd snapshot and backing up the resources for the static Pods. This backup can be saved and used at a later time if you need to restore etcd.
9
9
10
10
You should only save a backup from a single master host. You do not need a backup from each master host in the cluster.
11
11
@@ -15,18 +15,27 @@ You should only save a backup from a single master host. You do not need a backu
15
15
16
16
.Procedure
17
17
18
-
. Access a master host as the root user.
18
+
. Access a master host.
19
19
20
-
. Run the `etcd-snapshot-backup.sh` script and pass in the location to save the backup to.
20
+
. Run the `cluster-backup.sh` script and pass in the location to save the backup to.
found latest kube-apiserver-pod: /etc/kubernetes/static-pod-resources/kube-apiserver-pod-7
28
+
found latest kube-controller-manager-pod: /etc/kubernetes/static-pod-resources/kube-controller-manager-pod-8
29
+
found latest kube-scheduler-pod: /etc/kubernetes/static-pod-resources/kube-scheduler-pod-6
30
+
found latest etcd-pod: /etc/kubernetes/static-pod-resources/etcd-pod-2
31
+
Snapshot saved at /var/home/core/assets/backup/snapshot_2020-03-18_220218.db
32
+
snapshot db and kube resources are successfully saved to /var/home/core/assets/backup
24
33
----
25
34
+
26
35
In this example, two files are created in the `./assets/backup/` directory on the master host:
27
36
28
37
* `snapshot_<datetimestamp>.db`: This file is the etcd snapshot.
29
-
* `static_kuberesources_<datetimestamp>.tar.gz`: This file contains the static Kubernetes API server resources. If etcd encryption is enabled, it also contains the encryption keys for the etcd snapshot.
38
+
* `static_kuberesources_<datetimestamp>.tar.gz`: This file contains the resources for the static Pods. If etcd encryption is enabled, it also contains the encryption keys for the etcd snapshot.
0 commit comments