Skip to content

Commit 2e3ca74

Browse files
authored
Merge pull request #4883 from ClickHouse/Blargian-patch-388277
Improvement: show insert progress on dbpedia.md
2 parents ae977de + 8856ded commit 2e3ca74

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

docs/getting-started/example-datasets/dbpedia.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,11 @@ CREATE TABLE dbpedia
3535
To load the dataset from all Parquet files, run the following shell command:
3636

3737
```shell
38-
$ seq 0 25 | xargs -P1 -I{} clickhouse client -q "INSERT INTO dbpedia SELECT _id, title, text, \"text-embedding-3-large-1536-embedding\" FROM url('https://huggingface.co/api/datasets/Qdrant/dbpedia-entities-openai3-text-embedding-3-large-1536-1M/parquet/default/train/{}.parquet') SETTINGS max_http_get_redirects=5,enable_url_encoding=0;"
38+
for i in $(seq 0 25); do
39+
echo "Processing file ${i}..."
40+
clickhouse client -q "INSERT INTO dbpedia SELECT _id, title, text, \"text-embedding-3-large-1536-embedding\" FROM url('https://huggingface.co/api/datasets/Qdrant/dbpedia-entities-openai3-text-embedding-3-large-1536-1M/parquet/default/train/${i}.parquet') SETTINGS max_http_get_redirects=5,enable_url_encoding=0;"
41+
echo "File ${i} complete."
42+
done
3943
```
4044

4145
Alternatively, individual SQL statements can be run as shown below to load each of the 25 Parquet files:

0 commit comments

Comments
 (0)