Commit 4539dfb
authored
### Rationale for this change
The documentation for [pyarrow.Table.combine_chunks](https://arrow.apache.org/docs/python/generated/pyarrow.Table.html#pyarrow.Table.combine_chunks) and [Table::CombineChunks](https://arrow.apache.org/docs/cpp/api/table.html#_CPPv4NK5arrow5Table13CombineChunksEP10MemoryPool) states: All the underlying chunks in the ChunkedArray of each column are concatenated into zero or one chunk.
However, [this comment](https://github.com/apache/arrow/blob/d7015bd6e610b6cd6752f6cd543509bd5f8853ff/cpp/src/arrow/table.cc#L567) indicates that binary columns can be combined into multiple chunks. Multiple chunks are produced when combining into one chunk would result in a buffer overflow.
A reproducible example is [here](#46633 (comment)).
### What changes are included in this PR?
Change `Table::CombineChunks` and `pyarrow.Table.combine_chunks` documentation to specify that binary columns can be combined into multiple chunks.
### Are these changes tested?
No, they are only documentation changes.
### Are there any user-facing changes?
Yes, documentation changes.
* GitHub Issue: #46633
Authored-by: Akum Kang <[email protected]>
Signed-off-by: AlenkaF <[email protected]>
1 parent dc0f5a9 commit 4539dfb
2 files changed
+6
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
214 | 214 | | |
215 | 215 | | |
216 | 216 | | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
217 | 220 | | |
218 | 221 | | |
219 | 222 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4510 | 4510 | | |
4511 | 4511 | | |
4512 | 4512 | | |
| 4513 | + | |
| 4514 | + | |
| 4515 | + | |
4513 | 4516 | | |
4514 | 4517 | | |
4515 | 4518 | | |
| |||
0 commit comments