💬Generate LLM translations (#2177)

github-actions[bot] · web-flow · commit 48035c220c25 · 2025-05-20T17:13:06.000+08:00
Co-authored-by: github-actions[bot] &lt;41898282+github-actions[bot]@users.noreply.github.com&gt;
diff --git a/docs/cn/guides/51-ai-functions/index.md b/docs/cn/guides/51-ai-functions/index.md
@@ -13,7 +13,7 @@ Databend 依赖 [Azure OpenAI Service](https://azure.microsoft.com/en-us/product
 
 ## 什么是嵌入 (Embeddings)？
 
-嵌入是文本数据的向量表示，它捕获了原始文本的语义和上下文含义。它们可用于比较和分析各种自然语言处理任务中的文本，例如文档相似性、聚类和推荐系统。
+嵌入是文本数据的向量表示，它捕获原始文本的语义和上下文含义。它们可用于比较和分析各种自然语言处理任务中的文本，例如文档相似性、聚类和推荐系统。
 
 为了说明嵌入的工作原理，让我们考虑一个简单的例子。假设我们有以下句子：
 1. `"The cat sat on the mat."`
@@ -30,7 +30,7 @@ Databend 依赖 [Azure OpenAI Service](https://azure.microsoft.com/en-us/product
 2. `[0.25, 0.29, 0.11, 0.71, 0.38]`
 3. `[-0.1, 0.5, 0.6, -0.3, 0.8]`
 
-在这个简化的例子中，您可以看到句子 1 和句子 2 的嵌入在向量空间中彼此更接近，而句子 3 的嵌入则更远。这说明了嵌入如何捕获语义关系并用于比较和分析文本数据。
+在这个简化的例子中，您可以看到句子 1 和句子 2 的嵌入在向量空间中彼此更接近，而句子 3 的嵌入则更远。这说明了嵌入如何捕获语义关系，并可用于比较和分析文本数据。
 
 ## 什么是向量数据库？
 
@@ -42,7 +42,7 @@ Databend 提供了内置的 AI 函数，用于各种自然语言处理任务。
 
 - [ai_embedding_vector](/sql/sql-functions/ai-functions/ai-embedding-vector): 为文本文档生成嵌入。
 - [ai_text_completion](/sql/sql-functions/ai-functions/ai-text-completion): 根据给定的提示生成文本补全。
-- [cosine_distance](/sql/sql-functions/ai-functions/ai-cosine-distance): 计算两个嵌入之间的余弦距离。
+- [cosine_distance](/sql/sql-functions/vector-distance-functions/vector-cosine-distance): 计算两个嵌入之间的余弦距离。
 
 ## 生成嵌入
 
@@ -67,7 +67,7 @@ VALUES
 
 ## 计算余弦距离
 
-现在，让我们使用 [cosine_distance](/sql/sql-functions/ai-functions/ai-cosine-distance) 函数找到与给定查询最相似的文档：
+现在，让我们使用 [cosine_distance](/sql/sql-functions/vector-distance-functions/vector-cosine-distance) 函数找到与给定查询最相似的文档：
 ```sql
 SELECT
     id,
@@ -112,11 +112,11 @@ completion: and machine learning. It is known for its simplicity, readability, a
 
 您可以在我们的 [Databend Cloud](https://databend.com) 上体验这些功能，您可以在这里注册免费试用版并立即开始使用这些 AI 功能。
 
-Databend 的 AI 函数设计为易于使用，即使对于不熟悉机器学习或自然语言处理的用户也是如此。借助 Databend，您可以快速轻松地将强大的 AI 功能添加到您的 SQL 查询中，并将您的数据分析提升到一个新的水平。
+Databend 的 AI 功能旨在易于使用，即使对于不熟悉机器学习或自然语言处理的用户也是如此。借助 Databend，您可以快速轻松地将强大的 AI 功能添加到您的 SQL 查询中，并将您的数据分析提升到一个新的水平。
 
 ## 使用 Databend 构建 AI 问答系统
 
-我们已经利用 [Databend Cloud](https://databend.com) 和 AI 函数为我们的文档构建了一个 AI 问答系统。
+我们利用 [Databend Cloud](https://databend.com) 和 AI 函数为我们的文档构建了一个 AI 问答系统。
 
 以下是构建它的分步指南：
 
@@ -151,28 +151,28 @@ WHERE LENGTH(embedding) = 0;
 ### 步骤 4：提问并检索相关答案
 
 ```sql
--- Define the question as a CTE (Common Table Expression)
+-- 将问题定义为 CTE（公共表表达式）
 WITH question AS (
     SELECT 'Tell me the ai functions' AS q
 ),
--- Calculate the question's embedding vector
+-- 计算问题的嵌入向量
 question_embedding AS (
     SELECT ai_embedding_vector((SELECT q FROM question)) AS q_vector
 ),
--- Retrieve the top 3 most relevant documents
+-- 检索前 3 个最相关的文档
 top_3_docs AS (
     SELECT content,
            cosine_distance((SELECT q_vector FROM question_embedding), embedding) AS dist
     FROM doc
     ORDER BY dist ASC
     LIMIT 3
 ),
--- Combine the content of the top 3 documents
+-- 合并前 3 个文档的内容
 combined_content AS (
     SELECT string_agg(content, ' ') AS aggregated_content
     FROM top_3_docs
 ),
--- Concatenate a custom prompt, the combined content, and the original question
+-- 连接自定义提示、合并的内容和原始问题
 prompt AS (
     SELECT CONCAT(
                'Utilizing the sections provided from the Databend documentation, answer the questions to the best of your ability. ',
@@ -182,7 +182,7 @@ prompt AS (
                (SELECT q FROM question)
            ) as p
 )
--- Pass the concatenated text to the ai_text_completion function to generate a coherent and relevant response
+-- 将连接的文本传递给 ai_text_completion 函数以生成连贯且相关的响应
 SELECT ai_text_completion((SELECT p FROM prompt)) AS answer;
 ```
 
diff --git a/docs/cn/sql-reference/20-sql-functions/11-vector-distance-functions/00-vector-cosine-distance.md b/docs/cn/sql-reference/20-sql-functions/11-vector-distance-functions/00-vector-cosine-distance.md
@@ -0,0 +1,82 @@
+---
+title: 'COSINE_DISTANCE'
+description: '在 Databend 中使用 cosine_distance 函数测量相似度'
+---
+
+计算两个向量之间的余弦距离，测量它们的相异程度。
+
+## 语法
+
+```sql
+COSINE_DISTANCE(vector1, vector2)
+```
+
+## 参数
+
+- `vector1`: 第一个向量 (ARRAY(FLOAT32 NOT NULL))
+- `vector2`: 第二个向量 (ARRAY(FLOAT32 NOT NULL))
+
+## 返回值
+
+返回一个介于 0 和 1 之间的 FLOAT 值：
+- 0：相同的向量（完全相似）
+- 1：正交向量（完全不相似）
+
+## 描述
+
+余弦距离测量两个向量之间基于它们之间角度的相异度，而不管它们的大小。该函数：
+
+1. 验证两个输入向量是否具有相同的长度
+2. 计算两个向量的元素乘积之和（点积）
+3. 计算每个向量的平方和的平方根（向量大小）
+4. 返回 `1 - (dot_product / (magnitude1 * magnitude2))`
+
+实现的数学公式为：
+
+```
+cosine_distance(v1, v2) = 1 - (Σ(v1ᵢ * v2ᵢ) / (√Σ(v1ᵢ²) * √Σ(v2ᵢ²)))
+```
+
+其中 v1ᵢ 和 v2ᵢ 是输入向量的元素。
+
+:::info
+此函数在 Databend 中执行向量计算，不依赖于外部 API。
+:::
+
+
+## 示例
+
+创建一个包含向量数据的表：
+
+```sql
+CREATE OR REPLACE TABLE vectors (
+    id INT,
+    vec ARRAY(FLOAT32 NOT NULL)
+);
+
+INSERT INTO vectors VALUES
+    (1, [1.0000, 2.0000, 3.0000]),
+    (2, [1.0000, 2.2000, 3.0000]),
+    (3, [4.0000, 5.0000, 6.0000]);
+```
+
+找到与 [1, 2, 3] 最相似的向量：
+
+```sql
+SELECT 
+    vec, 
+    COSINE_DISTANCE(vec, [1.0000, 2.0000, 3.0000]) AS distance
+FROM 
+    vectors
+ORDER BY 
+    distance ASC
+LIMIT 1;
+```
+
+```
++-------------------------+----------+
+| vec                     | distance |
++-------------------------+----------+
+| [1.0000,2.2000,3.0000]  | 0.0      |
++-------------------------+----------+
+```
diff --git a/docs/cn/sql-reference/20-sql-functions/11-vector-distance-functions/01-vector-l2-distance.md b/docs/cn/sql-reference/20-sql-functions/11-vector-distance-functions/01-vector-l2-distance.md
@@ -0,0 +1,84 @@
+---
+title: 'L2_DISTANCE'
+description: '在 Databend 中测量向量之间的欧几里得距离'
+---
+
+计算两个向量之间的欧几里得（L2）距离，测量它们在向量空间中的直线距离。
+
+## 语法
+
+```sql
+L2_DISTANCE(vector1, vector2)
+```
+
+## 参数
+
+- `vector1`: 第一个向量 (ARRAY(FLOAT32 NOT NULL))
+- `vector2`: 第二个向量 (ARRAY(FLOAT32 NOT NULL))
+
+## 返回值
+
+返回一个 FLOAT 值，表示两个向量之间的欧几里得（L2）距离。该值始终为非负数：
+- 0：相同的向量
+- 较大的值：距离较远的向量
+
+## 描述
+
+L2 距离，也称为欧几里得距离，测量欧几里得空间中两点之间的直线距离。它是向量相似性搜索和机器学习应用中最常用的指标之一。
+
+该函数：
+
+1. 验证两个输入向量是否具有相同的长度
+2. 计算对应元素之间平方差的总和
+3. 返回该总和的平方根
+
+实现的数学公式为：
+
+```
+L2_distance(v1, v2) = √(Σ(v1ᵢ - v2ᵢ)²)
+```
+
+其中 v1ᵢ 和 v2ᵢ 是输入向量的元素。
+
+:::info
+- 此函数在 Databend 中执行向量计算，不依赖于外部 API。
+:::
+
+## 示例
+
+创建一个包含向量数据的表：
+
+```sql
+CREATE OR REPLACE TABLE vectors (
+    id INT,
+    vec ARRAY(FLOAT32 NOT NULL)
+);
+
+INSERT INTO vectors VALUES
+    (1, [1.0000, 2.0000, 3.0000]),
+    (2, [1.0000, 2.2000, 3.0000]),
+    (3, [4.0000, 5.0000, 6.0000]);
+```
+
+使用 L2 距离查找最接近 [1, 2, 3] 的向量：
+
+```sql
+SELECT 
+    id,
+    vec, 
+    L2_DISTANCE(vec, [1.0000, 2.0000, 3.0000]) AS distance
+FROM 
+    vectors
+ORDER BY 
+    distance ASC;
+```
+
+```
++----+-------------------------+----------+
+| id | vec                     | distance |
++----+-------------------------+----------+
+| 1  | [1.0000,2.0000,3.0000]  | 0.0      |
+| 2  | [1.0000,2.2000,3.0000]  | 0.2      |
+| 3  | [4.0000,5.0000,6.0000]  | 5.196152 |
++----+-------------------------+----------+
+```
diff --git a/docs/cn/sql-reference/20-sql-functions/11-vector-distance-functions/index.md b/docs/cn/sql-reference/20-sql-functions/11-vector-distance-functions/index.md
@@ -0,0 +1,14 @@
+---
+title: '向量距离函数'
+description: 'Databend 中用于相似性度量的向量距离函数'
+---
+
+# 向量距离函数
+
+Databend 提供了用于测量向量之间距离或相似度的函数，这对于向量搜索和机器学习应用至关重要。
+
+## 函数比较
+| 函数 | 描述 | 范围 | 最适合 | 使用场景 |
+|----------|-------------|-------|----------|-----------|
+| [L2_DISTANCE](01-vector-l2-distance.md) | 欧几里得（直线）距离 | [0, ∞) | 当大小很重要时 | • 图像相似性<br/>• 地理数据<br/>• 异常检测<br/>• 基于特征的聚类 |
+| [COSINE_DISTANCE](00-vector-cosine-distance.md) | 向量之间的角度距离 | [0, 1] | 当方向比大小更重要时 | • 文档相似性<br/>• 语义搜索<br/>• 推荐系统<br/>• 文本分析 |