You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+33-1Lines changed: 33 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -138,7 +138,9 @@ cargo build --release
138
138
139
139
### Running Tests
140
140
141
-
Run all tests:
141
+
#### Basic Tests
142
+
143
+
Run all unit tests (fast - excludes TPC-H validation):
142
144
143
145
```bash
144
146
cargo test
@@ -150,6 +152,20 @@ Run tests with output:
150
152
cargo test -- --nocapture
151
153
```
152
154
155
+
#### TPC-H Validation Integration Tests
156
+
157
+
Run comprehensive TPC-H validation tests that compare distributed DataFusion against regular DataFusion. No prerequisites needed - the tests handle everything automatically!
158
+
159
+
```bash
160
+
# Run all TPC-H validation tests
161
+
cargo test --test tpch_validation test_tpch_validation_all_queries -- --ignored --nocapture
162
+
163
+
# Run single query test for debugging
164
+
cargo test --test tpch_validation test_tpch_validation_single_query -- --ignored --nocapture
165
+
```
166
+
167
+
**Note:** TPC-H validation tests are annotated with #[ignore] to avoid slowing down `cargo test` during development. They're included in the CI pipeline and can be run manually when needed.
168
+
153
169
## Usage
154
170
155
171
With the code now built and ready, the next step is to set up the server and execute queries. To do that, we'll need a schema and some data—so for this example, we'll use the TPC-H schema and queries.
To pre-create views that queries can reference (such as for TPC-H q15), you can use the `DFRAY_VIEWS` environment variable:
272
+
273
+
```bash
274
+
# Example: Create a view for TPC-H q15 revenue calculation
275
+
DFRAY_VIEWS="CREATE VIEW revenue0 (supplier_no, total_revenue) AS SELECT l_suppkey, sum(l_extendedprice * (1 - l_discount)) FROM lineitem WHERE l_shipdate >= date '1996-08-01' AND l_shipdate < date '1996-08-01' + interval '3' month GROUP BY l_suppkey"
# Define views required for TPC-H queries (e.g., q15)
129
+
export DFRAY_VIEWS="create view revenue0 (supplier_no, total_revenue) as select l_suppkey, sum(l_extendedprice * (1 - l_discount)) from lineitem where l_shipdate >= date '1996-08-01' and l_shipdate < date '1996-08-01' + interval '3' month group by l_suppkey"
130
+
128
131
# Array to store worker PIDs and addresses
129
132
declare -a WORKER_PIDS
130
133
declare -a WORKER_ADDRESSES
@@ -146,7 +149,7 @@ for ((i=0; i<NUM_WORKERS; i++)); do
0 commit comments