add a doc about maintaining a node

fgksgf · fgksgf · commit bf2445e0d103 · 2025-08-28T18:06:45.000+08:00
diff --git a/en/TOC.md b/en/TOC.md
@@ -45,6 +45,7 @@
     - [Suspend and Resume a TiDB Cluster](suspend-tidb-cluster.md)
     - [Restart a TiDB Cluster](restart-a-tidb-cluster.md)
     - [Destroy a TiDB Cluster](destroy-a-tidb-cluster.md)
+    - [Maintain Kubernetes Nodes](maintain-a-kubernetes-node.md)
 - Reference
   - Architecture
     - [TiDB Operator](architecture.md)
diff --git a/en/maintain-a-kubernetes-node.md b/en/maintain-a-kubernetes-node.md
@@ -0,0 +1,132 @@
+---
+title: Maintain Kubernetes Nodes that Hold the TiDB Cluster
+summary: Learn how to maintain Kubernetes nodes that hold the TiDB cluster.
+---
+
+# Maintain Kubernetes Nodes that Hold the TiDB Cluster
+
+TiDB is a highly available database that can run smoothly when some of the database nodes go offline. For this reason, you can safely shut down and maintain the Kubernetes nodes at the bottom layer without influencing TiDB's service.
+
+This document describes in detail how to perform maintenance operations on Kubernetes nodes. Different operation strategies are provided based on maintenance duration and storage type.
+
+## Prerequisites
+
+- [`kubectl`](https://kubernetes.io/docs/tasks/tools/install-kubectl/)
+
+> **Note:**
+>
+> Before you maintain a node, you need to make sure that the remaining resources in the Kubernetes cluster are enough for running the TiDB cluster.
+
+## Maintain a node
+
+### Step 1: Preparation
+
+1. Use the `kubectl cordon` command to mark the node to be maintained as unschedulable to prevent new Pods from being scheduled to the node:
+
+    ```shell
+    kubectl cordon ${node_name}
+    ```
+
+2. Check if there are TiDB cluster component Pods on the node to be maintained:
+
+    ```shell
+    kubectl get pod --all-namespaces -o wide -l pingcap.com/managed-by=tidb-operator | grep ${node_name}
+    ```
+
+### Step 2: Migrate TiDB cluster component Pods
+
+Choose the appropriate Pod migration strategy based on your storage type:
+
+#### Option A: Reschedule Pods (for automatically migratable storage)
+
+If the node storage can be automatically migrated (such as [Amazon EBS](https://aws.amazon.com/ebs/)), you can refer to [Gracefully restart a single Pod of a component](restart-a-tidb-cluster.md) to reschedule component Pods. Using the PD component as an example:
+
+1. Check the PD Pods on the node to be maintained:
+
+    ```shell
+    kubectl get pod --all-namespaces -o wide -l pingcap.com/component=pd | grep ${node_name}
+    ```
+
+2. Check the instance name corresponding to the PD Pod:
+
+    ```shell
+    kubectl get pod -n ${namespace} ${pod_name} -o jsonpath='{.metadata.labels.pingcap\.com/instance}'
+    ```
+
+3. Add a new label to the PD instance to trigger rescheduling:
+
+    ```shell
+    kubectl label pd -n ${namespace} ${pd_instance_name} pingcap.com/restartedAt=2025-06-30T12:00
+    ```
+
+4. Confirm that the PD Pod has been successfully scheduled to other nodes:
+
+    ```shell
+    watch kubectl -n ${namespace} get pod -o wide
+    ```
+
+5. Repeat the above steps for other components (TiKV, TiDB, etc.) until all TiDB cluster component Pods on the maintenance node have been migrated.
+
+#### Option B: Recreate instances (for local storage)
+
+If the node storage cannot be automatically migrated (such as local storage), you need to recreate instances:
+
+> **Warning:**
+>
+> Recreating instances will cause data loss. For stateful components like TiKV, ensure that the cluster has sufficient replicas to guarantee data safety.
+
+Using recreating a TiKV instance as an example:
+
+1. Delete the TiKV instance CR. TiDB Operator will delete its associated PVC, ConfigMap, and other resources, and automatically create a new instance:
+
+    ```shell
+    kubectl delete -n ${namespace} tikv ${tikv_instance_name}
+    ```
+
+2. Wait for the newly created TiKV instance status to become ready:
+
+    ```shell
+    kubectl get -n ${namespace} tikv ${tikv_instance_name}
+    ```
+
+3. After confirming that the TiDB cluster status is normal and data synchronization is complete, you can continue to maintain other components.
+
+### Step 3: Confirm migration completion
+
+At this point, there should only be Pods managed by DaemonSets (such as network plugins, monitoring agents, etc.):
+
+```shell
+kubectl get pod --all-namespaces -o wide | grep ${node_name}
+```
+
+### Step 4: Perform node maintenance
+
+At this point, you can safely perform node maintenance operations (such as restart, system update, hardware maintenance, etc.).
+
+### Step 5: Post-maintenance recovery (only for temporary maintenance)
+
+If it is temporary maintenance, you need to restore the node after maintenance is completed:
+
+1. Confirm the node health status:
+
+    ```shell
+    watch kubectl get node ${node_name}
+    ```
+
+    After observing that the node enters the `Ready` state, proceed to the next step.
+
+2. Use the `kubectl uncordon` command to remove the node's scheduling restrictions:
+
+    ```shell
+    kubectl uncordon ${node_name}
+    ```
+
+3. Observe whether all Pods have returned to normal operation:
+
+    ```shell
+    kubectl get pod --all-namespaces -o wide | grep ${node_name}
+    ```
+
+    After the Pods return to normal operation, the maintenance operation is complete.
+
+If it is long-term maintenance or permanent node removal, this step is not required.
diff --git a/en/restart-a-tidb-cluster.md b/en/restart-a-tidb-cluster.md
@@ -40,8 +40,18 @@ For a TiKV Pod, specify the `--grace-period` option when deleting the Pod to pro
 kubectl -n ${namespace} delete pod ${pod_name} --grace-period=60
 ```
 
-For other component Pods, you can delete them directly, because TiDB Operator will automatically handle a graceful restart:
+For Pods of other components, you can perform a graceful restart by adding a label or annotation to the corresponding Instance CR. Taking PD as an example:
 
-```shell
-kubectl -n ${namespace} delete pod ${pod_name}
-```
+    1. First, query the corresponding PD Instance CR through the Pod:
+
+    ```shell
+    kubectl get pod -n ${namespace} ${pod_name} -o jsonpath='{.metadata.labels.pingcap\.com/instance}'
+    ```
+
+    2. Add a new label to the PD instance, for example:
+
+    ```shell
+    kubectl label pd -n ${namespace} ${pd_instance_name} pingcap.com/restartedAt=2025-06-30T12:00
+    ```
+
+    3. If the PD is the leader, TiDB Operator will migrate the leader to another PD before restarting the PD Pod.
diff --git a/zh/TOC.md b/zh/TOC.md
@@ -45,6 +45,7 @@
     - [挂起和恢复 TiDB 集群](suspend-tidb-cluster.md)
     - [重启 TiDB 集群](restart-a-tidb-cluster.md)
     - [销毁 TiDB 集群](destroy-a-tidb-cluster.md)
+    - [维护 TiDB 集群所在的 Kubernetes 节点](maintain-a-kubernetes-node.md)
 - 参考
   - 架构
     - [TiDB Operator 架构](architecture.md)
diff --git a/zh/maintain-a-kubernetes-node.md b/zh/maintain-a-kubernetes-node.md
@@ -0,0 +1,133 @@
+---
+title: 维护 TiDB 集群所在的 Kubernetes 节点
+summary: 介绍如何维护 TiDB 集群所在的 Kubernetes 节点。
+---
+
+# 维护 TiDB 集群所在的 Kubernetes 节点
+
+TiDB 是高可用数据库，可以在部分数据库节点下线的情况下正常运行，因此，我们可以安全地对底层 Kubernetes 节点进行停机维护。
+
+本文档将详细介绍如何对 Kubernetes 节点进行维护操作。根据维护时长和存储类型，提供不同的操作策略。
+
+## 环境准备
+
+- [`kubectl`](https://kubernetes.io/docs/tasks/tools/install-kubectl/)
+
+> **注意：**
+>
+> 维护节点前，需要保证 Kubernetes 集群的剩余资源足够运行 TiDB 集群。
+
+
+## 维护节点
+
+### 步骤 1：准备工作
+
+1. 使用 `kubectl cordon` 命令标记待维护节点为不可调度，防止新的 Pod 调度到待维护节点上：
+
+    ```shell
+    kubectl cordon ${node_name}
+    ```
+
+2. 检查待维护节点上是否有 TiDB 集群组件 Pod：
+
+    ```shell
+    kubectl get pod --all-namespaces -o wide -l pingcap.com/managed-by=tidb-operator | grep ${node_name}
+    ```
+
+### 步骤 2：迁移 TiDB 集群组件 Pod
+
+根据您的存储类型，选择合适的 Pod 迁移策略：
+
+#### 选项 A：重调度 Pod（适用于存储可自动迁移）
+
+如果使用的是可自动迁移的存储（如 [Amazon EBS](https://aws.amazon.com/cn/ebs/)），可以参考[优雅重启某个组件的单个 Pod](restart-a-tidb-cluster.md)来重调度各个组件 Pod。以 PD 组件为例：
+
+1. 查看待维护节点上的 PD Pod:
+
+    ```shell
+    kubectl get pod --all-namespaces -o wide -l pingcap.com/component=pd | grep ${node_name}
+    ```
+
+2. 查看该 PD Pod 对应的实例名称：
+
+    ```shell
+    kubectl get pod -n ${namespace} ${pod_name} -o jsonpath='{.metadata.labels.pingcap\.com/instance}'
+    ```
+
+3. 给该 PD 实例添加一个新的 label 来触发重调度：
+
+    ```shell
+    kubectl label pd -n ${namespace} ${pd_instance_name} pingcap.com/restartedAt=2025-06-30T12:00
+    ```
+
+4. 确认该 PD Pod 已成功调度到其它节点：
+
+    ```shell
+    watch kubectl -n ${namespace} get pod -o wide
+    ```
+
+5. 对其他组件（TiKV、TiDB 等）重复上述步骤，直到该维护节点上所有 TiDB 集群组件 Pod 都迁移完成。
+
+#### 选项 B：重建实例（适用于本地存储）
+
+如果节点存储不可以自动迁移（比如使用本地存储），你需要重建实例：
+
+> **警告：**
+>
+> 重建实例会导致数据丢失。对于 TiKV 等有状态组件，请确保集群有足够的副本来保证数据安全。
+
+以重建 TiKV 实例为例：
+
+1. 删除 TiKV 实例 CR，TiDB Operator 会删除其关联的 PVC 和 ConfigMap 等资源，并自动创建新的实例：
+
+    ```shell
+    kubectl delete -n ${namespace} tikv ${tikv_instance_name}
+    ```
+
+2. 等待新创建的 TiKV 实例状态变为就绪：
+
+    ```shell
+    kubectl get -n ${namespace} tikv ${tikv_instance_name}
+    ```
+
+3. 确认 TiDB 集群状态正常，数据同步完成后，可以继续维护其他组件。
+
+### 步骤 3：确认迁移完成
+
+此时应该只剩下 DaemonSet 管理的 Pod（如网络插件、监控代理等）：
+
+```shell
+kubectl get pod --all-namespaces -o wide | grep ${node_name}
+```
+
+### 步骤 4：进行节点维护
+
+此时可以安全地进行节点维护操作（如重启、更新系统、硬件维护等）。
+
+### 步骤 5：维护后恢复（仅适用于临时维护）
+
+如果是临时维护，节点维护完成后需要恢复节点：
+
+1. 确认节点健康状态：
+
+    ```shell
+    watch kubectl get node ${node_name}
+    ```
+
+    观察到节点进入 `Ready` 状态后，继续下一步操作。
+
+2. 使用 `kubectl uncordon` 命令解除节点的调度限制：
+
+    ```shell
+    kubectl uncordon ${node_name}
+    ```
+
+3. 观察 Pod 是否全部恢复正常运行：
+
+    ```shell
+    kubectl get pod --all-namespaces -o wide | grep ${node_name}
+    ```
+
+    Pod 恢复正常运行后，维护操作完成。
+
+如果是长期维护或节点永久移除，则不需要执行此步骤。
diff --git a/zh/restart-a-tidb-cluster.md b/zh/restart-a-tidb-cluster.md
@@ -34,14 +34,24 @@ spec:
 
 你可以单独重启 TiDB 集群中的特定 Pod。不同组件的 Pod，操作略有不同。
 
-对于 TiKV Pod，为确保有足够时间驱逐 Region leader，在删除 Pod 时需要指定 `--grace-period` 选项，否则操作可能失败。以下示例为 TiKV Pod 设置了 60 秒的宽限期：
+对于 TiKV Pod，为确保有足够时间驱逐 Region Leader，在删除 Pod 时需要指定 `--grace-period` 选项，否则操作可能失败。以下示例为 TiKV Pod 设置了 60 秒的宽限期：
 
 ```shell
 kubectl -n ${namespace} delete pod ${pod_name} --grace-period=60
 ```
 
-其他组件的 Pod 可以直接删除，TiDB Operator 会自动优雅重启这些 Pod：
+对于其他组件的 Pod，可以通过给 Pod 对应的实例（Instance CR）添加 label 或 annotation 的方式来优雅重启。以 PD 为例：
 
-```shell
-kubectl -n ${namespace} delete pod ${pod_name}
-```
+    1. 首先，通过 Pod 查询到对应的 PD Instance CR：
+
+    ```shell
+    kubectl get pod -n ${namespace} ${pod_name} -o jsonpath='{.metadata.labels.pingcap\.com/instance}'
+    ```
+
+    2. 给该 PD 实例打上一个新 label，例如：
+
+    ```shell
+    kubectl label pd -n ${namespace} ${pd_instance_name} pingcap.com/restartedAt=2025-06-30T12:00
+    ```
+
+    3. 若该 PD 是 Leader，TiDB Operator 会将 Leader 迁移给其他 PD 后再重启该 PD Pod。