update MD by dispatch event pingcap/docs master

github-actions · github-actions · commit c4505075387a · 2026-03-27T07:54:39.000Z
diff --git a/markdown-pages/en/tidb/master/TOC.md b/markdown-pages/en/tidb/master/TOC.md
@@ -636,6 +636,7 @@
       - Attributes
         - [AUTO_INCREMENT](/auto-increment.md)
         - [AUTO_RANDOM](/auto-random.md)
+        - [_tidb_rowid](/tidb-rowid.md)
         - [SHARD_ROW_ID_BITS](/shard-row-id-bits.md)
       - [Literal Values](/literal-values.md)
       - [Schema Object Names](/schema-object-names.md)
diff --git a/markdown-pages/en/tidb/master/clustered-indexes.md b/markdown-pages/en/tidb/master/clustered-indexes.md
@@ -11,7 +11,7 @@ The term _clustered_ in this context refers to the _organization of how data is
 
 Currently, tables containing primary keys in TiDB are divided into the following two categories:
 
-- `NONCLUSTERED`: The primary key of the table is non-clustered index. In tables with non-clustered indexes, the keys for row data consist of internal `_tidb_rowid` implicitly assigned by TiDB. Because primary keys are essentially unique indexes, tables with non-clustered indexes need at least two key-value pairs to store a row, which are:
+- `NONCLUSTERED`: The primary key of the table is non-clustered index. In tables with non-clustered indexes, the keys for row data consist of internal [`_tidb_rowid`](/tidb-rowid.md) values implicitly assigned by TiDB. Because primary keys are essentially unique indexes, tables with non-clustered indexes need at least two key-value pairs to store a row, which are:
     - `_tidb_rowid` (key) - row data (value)
     - Primary key data (key) - `_tidb_rowid` (value)
 - `CLUSTERED`: The primary key of the table is clustered index. In tables with clustered indexes, the keys for row data consist of primary key data given by the user. Therefore, tables with clustered indexes need only one key-value pair to store a row, which is:
diff --git a/markdown-pages/en/tidb/master/shard-row-id-bits.md b/markdown-pages/en/tidb/master/shard-row-id-bits.md
@@ -5,11 +5,11 @@ summary: Learn the SHARD_ROW_ID_BITS attribute.
 
 # SHARD_ROW_ID_BITS
 
-This document introduces the `SHARD_ROW_ID_BITS` table attribute, which is used to set the number of bits of the shards after the implicit `_tidb_rowid` is sharded.
+This document introduces the `SHARD_ROW_ID_BITS` table attribute, which is used to set the number of bits of the shards after the implicit [`_tidb_rowid`](/tidb-rowid.md) is sharded.
 
 ## Concept
 
-For the tables with a non-clustered primary key or no primary key, TiDB uses an implicit auto-increment row ID. When a large number of `INSERT` operations are performed, the data is written into a single Region, causing a write hot spot.
+For tables with a non-clustered primary key or no primary key, TiDB uses the automatically generated [`_tidb_rowid`](/tidb-rowid.md) as an implicit auto-increment row ID. When a large number of `INSERT` operations are performed, the data is written into a single Region, causing a write hot spot.
 
 To mitigate the hot spot issue, you can configure `SHARD_ROW_ID_BITS`. The row IDs are scattered and the data are written into multiple different Regions.
 
@@ -23,9 +23,13 @@ When you set `SHARD_ROW_ID_BITS = S`, the structure of `_tidb_rowid` is as follo
 |--------|--------|--------------|
 | 1 bit | `S` bits | `63-S` bits |
 
-- The values of the auto-increment bits are stored in TiKV and allocated sequentially. Each time a value is allocated, the next value is incremented by 1. The auto-increment bits ensure that the column values of `_tidb_rowid` are unique globally. When the value of the auto-increment bits is exhausted (that is, when the maximum value is reached), subsequent automatic allocations fail with the error `Failed to read auto-increment value from storage engine`.
+- The values of the auto-increment bits are stored in TiKV and allocated sequentially. Each time a value is allocated, the next value is incremented by 1. When the value of the auto-increment bits is exhausted (that is, when the maximum value is reached), subsequent automatic allocations fail with the error `Failed to read auto-increment value from storage engine`.
 - The value range of `_tidb_rowid`: the maximum number of bits for the final generated value = shard bits + auto-increment bits, so the maximum value is `(2^63)-1`.
 
+> **Warning:**
+>
+> `_tidb_rowid` is an internal row ID implicitly assigned by TiDB. Do not assume it is globally unique in all cases. For partitioned tables that do not use clustered indexes, `ALTER TABLE ... EXCHANGE PARTITION` can leave different partitions with the same `_tidb_rowid` value. For details, see [`_tidb_rowid`](/tidb-rowid.md).
+
 > **Note:**
 >
 > Selection of shard bits (`S`):
diff --git a/markdown-pages/en/tidb/master/sql-statements/sql-statement-show-table-next-rowid.md b/markdown-pages/en/tidb/master/sql-statements/sql-statement-show-table-next-rowid.md
@@ -8,7 +8,7 @@ aliases: ['/docs/dev/sql-statements/sql-statement-show-table-next-rowid/']
 
 `SHOW TABLE NEXT_ROW_ID` is used to show the details of some special columns of a table, including:
 
-* [`AUTO_INCREMENT`](/auto-increment.md) column automatically created by TiDB, namely, `_tidb_rowid` column.
+* [`_tidb_rowid`](/tidb-rowid.md), the hidden row ID column automatically managed by TiDB for supported tables.
 * `AUTO_INCREMENT` column created by users.
 * [`AUTO_RANDOM`](/auto-random.md) column created by users.
 * [`SEQUENCE`](/sql-statements/sql-statement-create-sequence.md) created by users.
@@ -66,3 +66,4 @@ This statement is a TiDB extension to MySQL syntax.
 * [CREATE TABLE](/sql-statements/sql-statement-create-table.md)
 * [AUTO_RANDOM](/auto-random.md)
 * [CREATE_SEQUENCE](/sql-statements/sql-statement-create-sequence.md)
+* [_tidb_rowid](/tidb-rowid.md)
diff --git a/markdown-pages/en/tidb/master/tidb-rowid.md b/markdown-pages/en/tidb/master/tidb-rowid.md
@@ -0,0 +1,157 @@
+---
+title: _tidb_rowid
+summary: Learn what `_tidb_rowid` is, when it is available, and how to use it safely.
+---
+
+# `_tidb_rowid`
+
+`_tidb_rowid` is a hidden system column automatically generated by TiDB. For tables that do not use a clustered index, it serves as the internal row ID of the table. You cannot declare or modify this column in the table schema, but you can reference it in SQL when the table uses `_tidb_rowid` as its internal row ID.
+
+In the current implementation, `_tidb_rowid` is an extra `BIGINT NOT NULL` column automatically managed by TiDB.
+
+> **Warning:**
+>
+> - Do not assume that `_tidb_rowid` is globally unique in all cases. For partitioned tables that do not use clustered indexes, executing `ALTER TABLE ... EXCHANGE PARTITION` might result in duplicate `_tidb_rowid` values across different partitions.
+> - If you need a stable unique identifier, define and use an explicit primary key instead of relying on `_tidb_rowid`.
+
+## When `_tidb_rowid` is available
+
+TiDB uses `_tidb_rowid` to identify each row when a table does not use a clustered primary key as the unique row identifier. In practice, this means that the following types of tables use `_tidb_rowid`:
+
+- Tables without primary keys
+- Tables with primary keys that are explicitly defined as `NONCLUSTERED`
+
+`_tidb_rowid` is not available for tables that use a clustered index (that is, tables whose primary key is defined as `CLUSTERED`, regardless of whether it is a single-column or composite primary key).
+
+The following example shows the difference:
+
+```sql
+CREATE TABLE t1 (a INT, b VARCHAR(20));
+CREATE TABLE t2 (id BIGINT PRIMARY KEY NONCLUSTERED, a INT);
+CREATE TABLE t3 (id BIGINT PRIMARY KEY CLUSTERED, a INT);
+```
+
+For `t1` and `t2`, you can query `_tidb_rowid` because these tables do not use a clustered index as the row identifier:
+
+```sql
+SELECT _tidb_rowid, a, b FROM t1;
+SELECT _tidb_rowid, id, a FROM t2;
+```
+
+For `t3`, `_tidb_rowid` is unavailable because the clustered primary key is already the row identifier:
+
+```sql
+SELECT _tidb_rowid, id, a FROM t3;
+```
+
+```sql
+ERROR 1054 (42S22): Unknown column '_tidb_rowid' in 'field list'
+```
+
+## Read `_tidb_rowid`
+
+For tables that use `_tidb_rowid`, you can query `_tidb_rowid` in `SELECT` statements. This is useful for tasks such as pagination, troubleshooting, and batch processing.
+
+Example:
+
+```sql
+CREATE TABLE t (a INT, b VARCHAR(20));
+INSERT INTO t VALUES (1, 'x'), (2, 'y');
+
+SELECT _tidb_rowid, a, b FROM t ORDER BY _tidb_rowid;
+```
+
+```sql
++-------------+---+---+
+| _tidb_rowid | a | b |
++-------------+---+---+
+|           1 | 1 | x |
+|           2 | 2 | y |
++-------------+---+---+
+```
+
+To view the next value that TiDB will allocate for the row ID, use `SHOW TABLE ... NEXT_ROW_ID`:
+
+```sql
+SHOW TABLE t NEXT_ROW_ID;
+```
+
+```sql
++-----------------------+------------+-------------+--------------------+-------------+
+| DB_NAME               | TABLE_NAME | COLUMN_NAME | NEXT_GLOBAL_ROW_ID | ID_TYPE     |
++-----------------------+------------+-------------+--------------------+-------------+
+| update_doc_rowid_test | t          | _tidb_rowid |              30001 | _TIDB_ROWID |
++-----------------------+------------+-------------+--------------------+-------------+
+```
+
+## Write `_tidb_rowid`
+
+By default, TiDB does not allow `INSERT`, `REPLACE`, or `UPDATE` statements to write `_tidb_rowid` directly.
+
+```sql
+INSERT INTO t(_tidb_rowid, a, b) VALUES (101, 4, 'w');
+```
+
+```sql
+ERROR 1105 (HY000): insert, update and replace statements for _tidb_rowid are not supported
+```
+
+If you need to preserve the original row IDs during data import or migration, enable the [`tidb_opt_write_row_id`](/system-variables.md#tidb_opt_write_row_id) system variable first:
+
+```sql
+SET @@tidb_opt_write_row_id = ON;
+INSERT INTO t(_tidb_rowid, a, b) VALUES (100, 3, 'z');
+SET @@tidb_opt_write_row_id = OFF;
+
+SELECT _tidb_rowid, a, b FROM t WHERE _tidb_rowid = 100;
+```
+
+```sql
++-------------+---+---+
+| _tidb_rowid | a | b |
++-------------+---+---+
+|         100 | 3 | z |
++-------------+---+---+
+```
+
+> **Warning:**
+>
+> `tidb_opt_write_row_id` is intended for import and migration scenarios. It is not recommended for regular application writes.
+
+## Restrictions
+
+- You cannot create a user column named `_tidb_rowid`.
+- You cannot rename an existing user column to `_tidb_rowid`.
+- `_tidb_rowid` is an internal column in TiDB. Do not treat it as a business primary key or a long-term identifier.
+- On partitioned non-clustered tables, `_tidb_rowid` values are not guaranteed to be unique across partitions. After you execute `EXCHANGE PARTITION`, different partitions can contain rows with the same `_tidb_rowid` value.
+- Whether `_tidb_rowid` exists depends on the table schema. For tables with clustered indexes, use the primary key as the row identifier.
+
+## Address hotspot issues
+
+For tables that use `_tidb_rowid`, TiDB allocates row IDs in increasing order by default. In write-intensive workloads, this can create write hotspots.
+
+To mitigate this issue (for tables that rely on `_tidb_rowid` as the row ID), consider using [`SHARD_ROW_ID_BITS`](/shard-row-id-bits.md) to distribute row IDs more evenly, and use [`PRE_SPLIT_REGIONS`](/sql-statements/sql-statement-split-region.md#pre_split_regions) to pre-split Regions when necessary.
+
+Example:
+
+```sql
+CREATE TABLE t (
+    id BIGINT PRIMARY KEY NONCLUSTERED,
+    c INT
+) SHARD_ROW_ID_BITS = 4;
+```
+
+`SHARD_ROW_ID_BITS` applies only to tables that use `_tidb_rowid` and does not apply to tables with clustered indexes.
+
+## Related statements and variables
+
+- [`SHOW TABLE NEXT_ROW_ID`](/sql-statements/sql-statement-show-table-next-rowid.md): shows the next row ID that TiDB will allocate
+- [`SHARD_ROW_ID_BITS`](/shard-row-id-bits.md): shards implicit row IDs to reduce hotspots
+- [`Clustered Indexes`](/clustered-indexes.md): explains when a table uses the primary key instead of `_tidb_rowid`
+- [`tidb_opt_write_row_id`](/system-variables.md#tidb_opt_write_row_id): controls whether writes to `_tidb_rowid` are allowed
+
+## See also
+
+- [`CREATE TABLE`](/sql-statements/sql-statement-create-table.md)
+- [`AUTO_INCREMENT`](/auto-increment.md)
+- [Non-transactional DML](/non-transactional-dml.md)