Commit bf85744
Recreate group_values after spill merge to fix duplicate group keys (apache#20724)
When switching to streaming merge after spill, group_ordering is set to
Full but group_values is not recreated. The existing GroupValuesColumn<false>
uses vectorized_intern which can produce non-monotonic group indices,
violating GroupOrderingFull's assumption and causing duplicate groups
in the output.
Fix: recreate group_values with the correct streaming mode after
updating group_ordering in update_merged_stream().1 parent 2947378 commit bf85744
1 file changed
+12
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1233 | 1233 | | |
1234 | 1234 | | |
1235 | 1235 | | |
| 1236 | + | |
| 1237 | + | |
| 1238 | + | |
| 1239 | + | |
| 1240 | + | |
| 1241 | + | |
| 1242 | + | |
| 1243 | + | |
| 1244 | + | |
| 1245 | + | |
| 1246 | + | |
| 1247 | + | |
1236 | 1248 | | |
1237 | 1249 | | |
1238 | 1250 | | |
| |||
0 commit comments