Skip to content

Commit a938cdb

Browse files
committed
update headings
1 parent 65b92c2 commit a938cdb

File tree

1 file changed

+5
-5
lines changed

1 file changed

+5
-5
lines changed

docs/hub/datasets-viewer-sql-console.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -24,9 +24,9 @@ You can also use the DuckDB CLI to query the dataset via the `hf://` protocol. S
2424
</Tip>
2525

2626

27-
# Examples
27+
## Examples
2828

29-
## Leakage Detection
29+
### Leakage Detection
3030

3131
Leakage detection is the process of identifying whether data in a dataset is present in multiple splits, for example, whether the test set is present in the training set.
3232

@@ -69,7 +69,7 @@ SELECT
6969
FROM overlapping_rows, total_unique_rows;
7070
```
7171

72-
## Filtering
72+
### Filtering
7373

7474
The SQL Console makes filtering datasets really easily. For example, if you want to filter the `SkunkworksAI/reasoning-0.01` dataset for instructions and responses with a reasoning length of at least 10, you can use the following query:
7575

@@ -89,7 +89,7 @@ order by reason_len desc
8989

9090
The [bar](https://duckdb.org/docs/sql/functions/char.html#barx-min-max-width) function is a neat built-in DuckDB function that creates a bar chart of the reasoning lengths.
9191

92-
## Histogram
92+
### Histogram
9393

9494
Many dataset authors choose to include statistics about the distribution of the data in the dataset. Using the DuckDB `histogram` function, we can plot a histogram of a column's values.
9595

@@ -107,7 +107,7 @@ For example, to plot a histogram of the `reason_len` column in the `SkunkworksAI
107107
from histogram(train, len(reasoning_chains))
108108
```
109109

110-
## Regex Matching
110+
### Regex Matching
111111

112112
One of the most powerful features of DuckDB is the deep support for regular expressions. You can use the `regexp` function to match patterns in your data.
113113

0 commit comments

Comments
 (0)