Skip to content

Commit d3bec28

Browse files
committed
docs: remove dead link to sundy-li blog post
1 parent 4988dd2 commit d3bec28

File tree

2 files changed

+0
-4
lines changed

2 files changed

+0
-4
lines changed

docs/cn/developer/20-community/02-rfcs/20220729-recluster.md

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -13,8 +13,6 @@ description: RFC for recluster a clustered table
1313

1414
## 设计
1515

16-
有关更详细的原则和图片,请参考 [snowflake auto clustering](https://sundy-li.github.io/posts/探索snowflake-auto-clustering/)
17-
1816
执行全表排序的成本非常高,特别是对于不断有新数据流入的表。为了在高效修剪和低成本之间取得平衡,表只需要大致排序,而不是完全排序。因此,在 [指标](#metrics) 中引入了两个指标来确定表是否聚簇良好。重新聚簇的目标是减少 `overlap``depth`
1917

2018
为了避免多次重复处理同一块数据,我们将块分成不同的级别,类似于 LSM 树。重新聚簇类似于 LSM 压缩操作。`level` 表示该块中的数据已被聚簇的次数。重新聚簇操作在同一级别上执行。

docs/en/developer/20-community/02-rfcs/20220729-recluster.md

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -13,8 +13,6 @@ By default, data is stored in tables according to natural dimensions. We need to
1313

1414
## Design
1515

16-
For more detailed principles and pictures, please refer to [snowflake auto clustering](https://sundy-li.github.io/posts/探索snowflake-auto-clustering/).
17-
1816
The cost of performing full table sorting is very expensive, especially for the tables that constantly have new data inflow. In order to make a balance between efficient pruning and low cost, the tables only need to be roughly sorted instead of fully sorted. Therefore, two metrics are introduced in [Metrics](#metrics) to determine whether the table is well clustered. The goal of recluster is to reduce `overlap` and `depth`.
1917

2018
To avoid churning on the same piece of data many times, we divides the blocks into different levels like LSM trees. The recluster is similar to the LSM compact operation. The `level` represents the number of times the data in that block has been clustered. The recluster operation is performed on the same level.

0 commit comments

Comments
 (0)