Commit 5b3e9ea
perf: optimize shuffle writer with buffered I/O and fix file size bug
1. Add BufWriter for buffered file I/O in shuffle writer
Wrapping File with BufWriter reduces syscalls when writing
multiple small batches to shuffle files. This affects both
the hash-partitioned shuffle path and the non-partitioned
write_stream_to_disk utility.
2. Fix file size read before writer finish
Previously, fs::metadata() was called before writer.finish(),
which could report incorrect file sizes since data may not
have been fully flushed to disk. This is especially important
now that BufWriter is used, as buffered data would not be
reflected in the file size until after finish() flushes it.
Co-Authored-By: Claude Opus 4.5 <[email protected]>1 parent 08544f4 commit 5b3e9ea
2 files changed
+7
-5
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
29 | 29 | | |
30 | 30 | | |
31 | 31 | | |
| 32 | + | |
32 | 33 | | |
33 | 34 | | |
34 | 35 | | |
| |||
105 | 106 | | |
106 | 107 | | |
107 | 108 | | |
108 | | - | |
| 109 | + | |
109 | 110 | | |
110 | 111 | | |
111 | 112 | | |
| |||
295 | 296 | | |
296 | 297 | | |
297 | 298 | | |
298 | | - | |
| 299 | + | |
299 | 300 | | |
300 | 301 | | |
301 | 302 | | |
| |||
323 | 324 | | |
324 | 325 | | |
325 | 326 | | |
326 | | - | |
327 | 327 | | |
| 328 | + | |
328 | 329 | | |
329 | 330 | | |
330 | 331 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
31 | 31 | | |
32 | 32 | | |
33 | 33 | | |
| 34 | + | |
34 | 35 | | |
35 | 36 | | |
36 | 37 | | |
| |||
148 | 149 | | |
149 | 150 | | |
150 | 151 | | |
151 | | - | |
| 152 | + | |
152 | 153 | | |
153 | 154 | | |
154 | | - | |
| 155 | + | |
155 | 156 | | |
156 | 157 | | |
157 | 158 | | |
| |||
0 commit comments