databendlabs
diff --git a/‎docs/doc/14-sql-commands/00-ddl/20-table/10-ddl-create-table.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/doc/14-sql-commands/00-ddl/20-table/10-ddl-create-table.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/doc/14-sql-commands/00-ddl/20-table/60-optimize-table.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/doc/14-sql-commands/00-ddl/20-table/60-optimize-table.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/doc/14-sql-commands/00-ddl/20-table/70-ddl-restore-table.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/doc/14-sql-commands/00-ddl/20-table/70-ddl-restore-table.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎…ommands/20-query-syntax/01-dml-select.md‎ ‎…mands/20-query-syntax/01-query-select.md‎docs/doc/14-sql-commands/20-query-syntax/01-dml-select.md renamed to docs/doc/14-sql-commands/20-query-syntax/01-query-select.md
Lines changed: 2 additions & 2 deletions b/‎…ommands/20-query-syntax/01-dml-select.md‎ ‎…mands/20-query-syntax/01-query-select.md‎docs/doc/14-sql-commands/20-query-syntax/01-dml-select.md renamed to docs/doc/14-sql-commands/20-query-syntax/01-query-select.md
Lines changed: 2 additions & 2 deletions
diff --git a/‎…-commands/20-query-syntax/02-dml-with.md‎ ‎…ommands/20-query-syntax/02-query-with.md‎docs/doc/14-sql-commands/20-query-syntax/02-dml-with.md renamed to docs/doc/14-sql-commands/20-query-syntax/02-query-with.md b/‎…-commands/20-query-syntax/02-dml-with.md‎ ‎…ommands/20-query-syntax/02-query-with.md‎docs/doc/14-sql-commands/20-query-syntax/02-dml-with.md renamed to docs/doc/14-sql-commands/20-query-syntax/02-query-with.md
diff --git a/‎…ql-commands/20-query-syntax/03-dml-at.md‎ ‎…-commands/20-query-syntax/03-query-at.md‎docs/doc/14-sql-commands/20-query-syntax/03-dml-at.md renamed to docs/doc/14-sql-commands/20-query-syntax/03-query-at.md b/‎…ql-commands/20-query-syntax/03-dml-at.md‎ ‎…-commands/20-query-syntax/03-query-at.md‎docs/doc/14-sql-commands/20-query-syntax/03-dml-at.md renamed to docs/doc/14-sql-commands/20-query-syntax/03-query-at.md
diff --git a/‎…-commands/20-query-syntax/04-dml-join.md‎ ‎…ommands/20-query-syntax/04-query-join.md‎docs/doc/14-sql-commands/20-query-syntax/04-dml-join.md renamed to docs/doc/14-sql-commands/20-query-syntax/04-query-join.md b/‎…-commands/20-query-syntax/04-dml-join.md‎ ‎…ommands/20-query-syntax/04-query-join.md‎docs/doc/14-sql-commands/20-query-syntax/04-dml-join.md renamed to docs/doc/14-sql-commands/20-query-syntax/04-query-join.md
diff --git a/‎docs/doc/14-sql-commands/20-query-syntax/06-query-group-by.md‎
Lines changed: 132 additions & 0 deletions b/‎docs/doc/14-sql-commands/20-query-syntax/06-query-group-by.md‎
Lines changed: 132 additions & 0 deletions
diff --git a/‎docs/doc/14-sql-commands/20-query-syntax/07-query-group-by-grouping-sets.md‎
Lines changed: 135 additions & 0 deletions b/‎docs/doc/14-sql-commands/20-query-syntax/07-query-group-by-grouping-sets.md‎
Lines changed: 135 additions & 0 deletions
diff --git a/‎docs/doc/14-sql-commands/20-query-syntax/08-query-group-by-cube.md‎
Lines changed: 77 additions & 0 deletions b/‎docs/doc/14-sql-commands/20-query-syntax/08-query-group-by-cube.md‎
Lines changed: 77 additions & 0 deletions
@@ -80,7 +80,7 @@ AS SELECT query
 
 Creates a transient table. 
 
-Transient tables are used to hold transitory data that does not require a data protection or recovery mechanism. Dataebend does not hold historical data for a transient table so you will not be able to query from a previous version of the transient table with the Time Travel feature, for example, the [AT](./../../20-query-syntax/03-dml-at.md) clause in the SELECT statement will not work for transient tables. Please note that you can still [drop](./20-ddl-drop-table.md) and [undrop](./21-ddl-undrop-table.md) a transient table.
+Transient tables are used to hold transitory data that does not require a data protection or recovery mechanism. Dataebend does not hold historical data for a transient table so you will not be able to query from a previous version of the transient table with the Time Travel feature, for example, the [AT](./../../20-query-syntax/03-query-at.md) clause in the SELECT statement will not work for transient tables. Please note that you can still [drop](./20-ddl-drop-table.md) and [undrop](./21-ddl-undrop-table.md) a transient table.
 
 Transient tables help save your storage expenses because they do not need extra space for historical data compared to non-transient tables. See [example](#create-transient-table-1) for detailed explanations.
 
 
@@ -14,7 +14,7 @@ Snapshot, segment, and block are the concepts Databend uses for data storage. Da
 
 Databend automatically creates table snapshots upon data updates. A snapshot represents a version of the table's segment metadata.
 
-When working with Databend, you're most likely to access a snapshot with the snapshot ID when you retrieve and query a previous version of the table's data with the [AT](../../20-query-syntax/03-dml-at.md) clause.
+When working with Databend, you're most likely to access a snapshot with the snapshot ID when you retrieve and query a previous version of the table's data with the [AT](../../20-query-syntax/03-query-at.md) clause.
 
 A snapshot is a JSON file that does not save the table's data but indicate the segments the snapshot links to. If you run [FUSE_SNAPSHOT](../../../15-sql-functions/111-system-functions/fuse_snapshot.md) against a table, you can find the saved snapshots for the table.
 
 
@@ -14,7 +14,7 @@ The capability to restore a table is subject to these conditions:
 
 - You cannot roll back after restoring a table to a prior state, but you can restore the table again to an earlier state.
 
-- Databend recommends this command for emergency recovery only. To query the history data of a table, use the [AT](../../20-query-syntax/03-dml-at.md) clause.
+- Databend recommends this command for emergency recovery only. To query the history data of a table, use the [AT](../../20-query-syntax/03-query-at.md) clause.
 
 ## Syntax
 
 
@@ -98,7 +98,7 @@ SELECT number FROM numbers(3) AS a;
 
 ## AT Clause
 
-The AT clause enables you to query previous versions of your data. For more information, see [AT](./03-dml-at.md).
+The AT clause enables you to query previous versions of your data. For more information, see [AT](./03-query-at.md).
 
 ## WHERE Clause
 
@@ -143,7 +143,7 @@ SELECT number%2 as c1, number%3 as c2, MAX(number) FROM numbers(10000) GROUP BY
 ```
 
 
-`GROUP BY` can be extended with [GROUPING SETS](./21-grouping-sets.md) to do more complex grouping operations.
+`GROUP BY` can be extended with [GROUPING SETS](./07-query-group-by-grouping-sets.md) to do more complex grouping operations.
 
 ## HAVING Clause
 
 
@@ -0,0 +1,132 @@
+---
+title: GROUP BY
+---
+
+The GROUP BY clause in Databend SQL allows you to group rows sharing the same group-by-item expressions and apply aggregate functions to the resulting groups. A group-by-item expression can be a column name, a number referencing a position in the [SELECT](./01-query-select.md) list, or a general expression.
+
+Extensions include [GROUP BY CUBE](./08-query-group-by-cube.md), [GROUP BY GROUPING SETS](./07-query-group-by-grouping-sets.md), and [GROUP BY ROLLUP](./09-query-group-by-rollup.md).
+
+## Syntax
+
+```sql
+SELECT ...
+    FROM ...
+    [ ... ]
+GROUP BY groupItem [ , groupItem [ , ... ] ]
+    [ ... ]
+```
+
+Where:
+```sql
+groupItem ::= { <column_alias> | <position> | <expr> }
+```
+
+- `<column_alias>`: Column alias appearing in the query block’s SELECT list
+
+- `<position>`: Position of an expression in the SELECT list
+
+- `<expr>`: Any expression on tables in the current scope
+
+
+## Examples
+
+Sample Data Setup:
+```sql
+-- Create a sample employees table
+CREATE TABLE employees (
+    id INT,
+    first_name VARCHAR(50),
+    last_name VARCHAR(50),
+    department_id INT,
+    job_id INT,
+    hire_date DATE
+);
+
+-- Insert sample data into the employees table
+INSERT INTO employees (id, first_name, last_name, department_id, job_id, hire_date)
+VALUES (1, 'John', 'Doe', 1, 101, '2021-01-15'),
+       (2, 'Jane', 'Smith', 1, 101, '2021-02-20'),
+       (3, 'Alice', 'Johnson', 1, 102, '2021-03-10'),
+       (4, 'Bob', 'Brown', 2, 201, '2021-03-15'),
+       (5, 'Charlie', 'Miller', 2, 202, '2021-04-10'),
+       (6, 'Eve', 'Davis', 2, 202, '2021-04-15');
+```
+
+### Group By One Column
+
+This query groups employees by their `department_id` and counts the number of employees in each department:
+```sql
+SELECT department_id, COUNT(*) AS num_employees
+FROM employees
+GROUP BY department_id;
+```
+
+Output:
+```sql
++---------------+---------------+
+| department_id | num_employees |
++---------------+---------------+
+|             1 |             3 |
+|             2 |             3 |
++---------------+---------------+
+```
+
+### Group By Multiple Columns
+
+This query groups employees by `department_id` and `job_id`, then counts the number of employees in each group:
+```sql
+SELECT department_id, job_id, COUNT(*) AS num_employees
+FROM employees
+GROUP BY department_id, job_id;
+```
+
+Output:
+```sql
++---------------+--------+---------------+
+| department_id | job_id | num_employees |
++---------------+--------+---------------+
+|             1 |    101 |             2 |
+|             1 |    102 |             1 |
+|             2 |    201 |             1 |
+|             2 |    202 |             2 |
++---------------+--------+---------------+
+```
+
+### Group By Position
+
+This query is equivalent to the "Group By One Column" example above. The position 1 refers to the first item in the SELECT list, which is `department_id`:
+```sql
+SELECT department_id, COUNT(*) AS num_employees
+FROM employees
+GROUP BY 1;
+```
+
+Output:
+```sql
++---------------+---------------+
+| department_id | num_employees |
++---------------+---------------+
+|             1 |             3 |
+|             2 |             3 |
++---------------+---------------+
+```
+
+
+### Group By Expression
+
+This query groups employees by the year they were hired and counts the number of employees hired in each year:
+```sql
+SELECT EXTRACT(YEAR FROM hire_date) AS hire_year, COUNT(*) AS num_hires
+FROM employees
+GROUP BY EXTRACT(YEAR FROM hire_date);
+```
+
+Output:
+```sql
++-----------+-----------+
+| hire_year | num_hires |
++-----------+-----------+
+|      2021 |         6 |
++-----------+-----------+
+```
+
@@ -0,0 +1,135 @@
+---
+title: GROUP BY GROUPING SETS
+---
+
+`GROUP BY GROUPING SETS` is a powerful extension of the [GROUP BY](./06-query-group-by.md) clause that allows computing multiple group-by clauses in a single statement. The group set is a set of dimension columns.
+
+`GROUP BY GROUPING SETS` is equivalent to the UNION of two or more GROUP BY operations in the same result set:
+
+- `GROUP BY GROUPING SETS((a))` is equivalent to the single grouping set operation `GROUP BY a`.
+
+- `GROUP BY GROUPING SETS((a),(b))` is equivalent to `GROUP BY a UNION ALL GROUP BY b`.
+
+## Syntax
+
+```sql
+SELECT ...
+FROM ...
+[ ... ]
+GROUP BY GROUPING SETS ( groupSet [ , groupSet [ , ... ] ] )
+[ ... ]
+```
+
+Where:
+```sql
+groupSet ::= { <column_alias> | <position> | <expr> }
+```
+
+- `<column_alias>`: Column alias appearing in the query block’s SELECT list
+
+- `<position>`: Position of an expression in the SELECT list
+
+- `<expr>`: Any expression on tables in the current scope
+
+
+## Examples
+
+Sample Data Setup:
+```sql
+-- Create a sample sales table
+CREATE TABLE sales (
+    id INT,
+    sale_date DATE,
+    product_id INT,
+    store_id INT,
+    quantity INT
+);
+
+-- Insert sample data into the sales table
+INSERT INTO sales (id, sale_date, product_id, store_id, quantity)
+VALUES (1, '2021-01-01', 101, 1, 5),
+       (2, '2021-01-01', 102, 1, 10),
+       (3, '2021-01-01', 101, 2, 15),
+       (4, '2021-01-02', 102, 1, 8),
+       (5, '2021-01-02', 101, 2, 12),
+       (6, '2021-01-02', 103, 2, 20);
+```
+
+### GROUP BY GROUPING SETS with column aliases
+
+```sql
+SELECT product_id AS pid,
+       store_id AS sid,
+       SUM(quantity) AS total_quantity
+FROM sales
+GROUP BY GROUPING SETS((pid), (sid));
+```
+
+This query is equivalent to:
+
+```sql
+SELECT product_id AS pid,
+       NULL AS sid,
+       SUM(quantity) AS total_quantity
+FROM sales
+GROUP BY pid
+UNION ALL
+SELECT NULL AS pid,
+       store_id AS sid,
+       SUM(quantity) AS total_quantity
+FROM sales
+GROUP BY sid;
+```
+
+Output:
+```sql
++------+------+----------------+
+| pid  | sid  | total_quantity |
++------+------+----------------+
+|  102 | NULL |             18 |
+| NULL |    2 |             47 |
+|  101 | NULL |             32 |
+|  103 | NULL |             20 |
+| NULL |    1 |             23 |
++------+------+----------------+
+```
+
+### GROUP BY GROUPING SETS with positions
+
+```sql
+SELECT product_id,
+       store_id,
+       SUM(quantity) AS total_quantity
+FROM sales
+GROUP BY GROUPING SETS((1), (2));
+```
+
+This query is equivalent to:
+
+```sql
+SELECT product_id,
+       NULL AS store_id,
+       SUM(quantity) AS total_quantity
+FROM sales
+GROUP BY product_id
+UNION ALL
+SELECT NULL AS product_id,
+       store_id,
+       SUM(quantity) AS total_quantity
+FROM sales
+GROUP BY store_id;
+```
+
+Output:
+```sql
++------------+----------+----------------+
+| product_id | store_id | total_quantity |
++------------+----------+----------------+
+|        102 |     NULL |             18 |
+|       NULL |        2 |             47 |
+|        101 |     NULL |             32 |
+|        103 |     NULL |             20 |
+|       NULL |        1 |             23 |
++------------+----------+----------------+
+```
+
@@ -0,0 +1,77 @@
+---
+title: GROUP BY CUBE
+---
+
+`GROUP BY CUBE` is an extension of the [GROUP BY](./06-query-group-by.md) clause similar to [GROUP BY ROLLUP](./09-query-group-by-rollup.md). In addition to producing all the rows of a `GROUP BY ROLLUP`, `GROUP BY CUBE` adds all the "cross-tabulations" rows. Sub-total rows are rows that further aggregate whose values are derived by computing the same aggregate functions that were used to produce the grouped rows.
+
+A `CUBE` grouping is equivalent to a series of grouping sets and is essentially a shorter specification. The N elements of a CUBE specification correspond to `2^N GROUPING SETS`.
+
+## Syntax
+
+```sql
+SELECT ...
+FROM ...
+[ ... ]
+GROUP BY CUBE ( groupCube [ , groupCube [ , ... ] ] )
+[ ... ]
+```
+
+Where:
+```sql
+groupCube ::= { <column_alias> | <position> | <expr> }
+```
+
+- `<column_alias>`: Column alias appearing in the query block’s SELECT list
+
+- `<position>`: Position of an expression in the SELECT list
+
+- `<expr>`: Any expression on tables in the current scope
+
+
+## Examples
+
+Let's assume we have a sales_data table with the following schema and sample data:
+
+```sql
+CREATE TABLE sales_data (
+  region VARCHAR(255),
+  product VARCHAR(255),
+  sales_amount INT
+);
+
+INSERT INTO sales_data (region, product, sales_amount) VALUES
+  ('North', 'WidgetA', 200),
+  ('North', 'WidgetB', 300),
+  ('South', 'WidgetA', 400),
+  ('South', 'WidgetB', 100),
+  ('West', 'WidgetA', 300),
+  ('West', 'WidgetB', 200);
+```
+
+Now, let's use the `GROUP BY CUBE` clause to get the total sales amount for each region and product, along with all possible aggregations:
+
+```sql
+SELECT region, product, SUM(sales_amount) AS total_sales
+FROM sales_data
+GROUP BY CUBE (region, product);
+```
+
+The result will be:
+```sql
++--------+---------+-------------+
+| region | product | total_sales |
++--------+---------+-------------+
+| South  | NULL    |         500 |
+| NULL   | WidgetB |         600 |
+| West   | NULL    |         500 |
+| North  | NULL    |         500 |
+| West   | WidgetB |         200 |
+| NULL   | NULL    |        1500 |
+| North  | WidgetB |         300 |
+| South  | WidgetA |         400 |
+| North  | WidgetA |         200 |
+| NULL   | WidgetA |         900 |
+| West   | WidgetA |         300 |
+| South  | WidgetB |         100 |
++--------+---------+-------------+
+```