Commit a497074
authored
bench: fix actually generate a lot of unique values in benchmark table (#17967)
* fix: actually generate a lot of unique values in benchmark table
also added benchmark for testing pure grouping performance for more than 1 column.
----
I run this query for the data:
```
SELECT
COUNT(*) AS total_count,
COUNT(DISTINCT u64_wide) AS unique_count,
COUNT(DISTINCT u64_wide) * 1.0 / COUNT(*) AS cardinality
FROM t;
```
Before:
```
| total_count | unique_count | cardinality |
| ----------- | ------------ | ----------- |
| 65536 | 2048 | 0.03125 |
```
After:
```
| total_count | unique_count | cardinality |
| ----------- | ------------ | ----------- |
| 65536 | 65536 | 1.0 |
```
* format
* added multi group by benchmark on primitive only columns1 parent 590ad29 commit a497074
File tree
2 files changed
+38
-5
lines changed- datafusion/core/benches
- data_utils
2 files changed
+38
-5
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
153 | 153 | | |
154 | 154 | | |
155 | 155 | | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
156 | 188 | | |
157 | 189 | | |
158 | 190 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
81 | 81 | | |
82 | 82 | | |
83 | 83 | | |
84 | | - | |
85 | | - | |
86 | | - | |
87 | | - | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
88 | 89 | | |
89 | 90 | | |
90 | 91 | | |
| |||
116 | 117 | | |
117 | 118 | | |
118 | 119 | | |
119 | | - | |
| 120 | + | |
120 | 121 | | |
121 | 122 | | |
122 | 123 | | |
| |||
0 commit comments