File tree Expand file tree Collapse file tree 2 files changed +22
-44
lines changed
Expand file tree Collapse file tree 2 files changed +22
-44
lines changed Original file line number Diff line number Diff line change @@ -29,11 +29,30 @@ These benchmarks are derived from the [TPC-H][1] benchmark.
2929
3030## Generating Test Data
3131
32- TPC-H data can be generated using the ` tpch-gen.sh ` script, which creates a Docker image containing the TPC-DS data
33- generator.
32+ TPC-H data can be generated using [ tpchgen-rs] ( https://github.com/clflushopt/tpchgen-rs ) , a fast TPC-H data generator written in Rust.
3433
34+ ### Installation
35+
36+ Install via pip:
37+ ``` bash
38+ pip install tpchgen-cli
39+ ```
40+
41+ Or via cargo:
42+ ``` bash
43+ cargo install tpchgen-cli
44+ ```
45+
46+ ### Generating Data
47+
48+ Generate SF=1 data in Parquet format:
49+ ``` bash
50+ tpchgen-cli -s 1 --format parquet --output-dir data
51+ ```
52+
53+ For larger scale factors (e.g., SF=10):
3554``` bash
36- ./tpch-gen.sh
55+ tpchgen-cli -s 10 --format parquet --output-dir data
3756```
3857
3958Data will be generated into the ` data ` subdirectory and will not be checked in because this directory has been added
Load Diff This file was deleted.
You can’t perform that action at this time.
0 commit comments