-
Notifications
You must be signed in to change notification settings - Fork 1k
Undeprecate ArrowWriter::into_serialized_writer
and add docs
#8621
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 1 commit
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -450,11 +450,11 @@ impl<W: Write + Send> ArrowWriter<W> { | |
} | ||
|
||
/// Converts this writer into a lower-level [`SerializedFileWriter`] and [`ArrowRowGroupWriterFactory`]. | ||
/// This can be useful to provide more control over how files are written. | ||
#[deprecated( | ||
since = "57.0.0", | ||
note = "Construct a `SerializedFileWriter` and `ArrowRowGroupWriterFactory` directly instead" | ||
)] | ||
/// | ||
/// Flushes any outstanding data before returning. | ||
/// | ||
/// This can be useful to provide more control over how files are written, for example | ||
/// to write columns in parallel. See the example on [`ArrowColumnWriter`]. | ||
pub fn into_serialized_writer( | ||
mut self, | ||
) -> Result<(SerializedFileWriter<W>, ArrowRowGroupWriterFactory)> { | ||
|
@@ -872,6 +872,11 @@ impl ArrowColumnWriter { | |
} | ||
|
||
/// Encodes [`RecordBatch`] to a parquet row group | ||
/// | ||
/// You can create this structure via an [`ArrowRowGroupWriterFactory`] | ||
/// | ||
/// See the example on [`ArrowColumnWriter`] for how to encode columns in parallel | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
( There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Thanks -- I tried to improve the comments in 619210d |
||
#[derive(Debug)] | ||
struct ArrowRowGroupWriter { | ||
writers: Vec<ArrowColumnWriter>, | ||
schema: SchemaRef, | ||
|
@@ -907,6 +912,10 @@ impl ArrowRowGroupWriter { | |
} | ||
|
||
/// Factory that creates new column writers for each row group in the Parquet file. | ||
/// | ||
/// You can create this structure via an [`ArrowWriter::into_serialized_writer`]. | ||
/// See the example on [`ArrowColumnWriter`] for how to encode columns in parallel | ||
#[derive(Debug)] | ||
pub struct ArrowRowGroupWriterFactory { | ||
schema: SchemaDescPtr, | ||
arrow_schema: SchemaRef, | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I tried to drop a few breadcrumbs to find @adamreeve 's new example in #8582