Commit 639ed15
Recreate group_values after spill merge to fix duplicate group keys (apache#20724)
When switching to streaming merge after spill, group_ordering is set to
Full but group_values is not recreated. The existing GroupValuesColumn<false>
uses vectorized_intern which can produce non-monotonic group indices,
violating GroupOrderingFull's assumption and causing duplicate groups
in the output.
Fix: recreate group_values with the correct streaming mode after
updating group_ordering in update_merged_stream().1 parent b7ccb53 commit 639ed15
1 file changed
+12
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1267 | 1267 | | |
1268 | 1268 | | |
1269 | 1269 | | |
| 1270 | + | |
| 1271 | + | |
| 1272 | + | |
| 1273 | + | |
| 1274 | + | |
| 1275 | + | |
| 1276 | + | |
| 1277 | + | |
| 1278 | + | |
| 1279 | + | |
| 1280 | + | |
| 1281 | + | |
1270 | 1282 | | |
1271 | 1283 | | |
1272 | 1284 | | |
| |||
0 commit comments