Skip to main content

Module writer

Module writer 

Source
Expand description

Arrow IPC File and Stream Writers

Β§Notes

FileWriter and StreamWriter have similar interfaces, however the FileWriter expects a reader that supports Seeking

StructsΒ§

CompressionContext
Additional context that may be needed for compression.
DictionaryTracker
Keeps track of dictionaries that have been written, to avoid emitting the same dictionary multiple times.
EncodedData
Stores the encoded data, which is an crate::Message, and optional Arrow data
FileWriter
Arrow File Writer
IpcDataGenerator
Handles low level details of encoding [Array] and [Schema] into the Arrow IPC Format.
IpcMetadataBuilder πŸ”’
Accumulates the IPC metadata produced by write_array_data.
IpcWriteMetadata πŸ”’
Per-message sizes produced by IpcDataGenerator::write.
IpcWriteOptions
IPC write options used to control the behaviour of the IpcDataGenerator
StreamWriter
Arrow Stream Writer

EnumsΒ§

DictionaryComparison πŸ”’
Describes how two dictionary arrays compare to each other.
DictionaryHandling
Controls how dictionaries are handled in Arrow IPC messages
DictionaryUpdate
Describes what kind of update took place after a call to DictionaryTracker::insert.
EncodedBuffer πŸ”’
A single buffer segment ready to be written to the output stream.
IpcBodySink πŸ”’
Destination for the raw Arrow data bytes (the IPC message body) produced by write_array_data.

ConstantsΒ§

PADDING πŸ”’

FunctionsΒ§

append_variadic_buffer_counts πŸ”’
buffer_need_truncate πŸ”’
Whether to truncate the buffer
compare_dictionaries πŸ”’
encode_sink_buffer πŸ”’
Encodes a single Arrow [Buffer] into the IPC body and records its metadata.
estimate_encoded_buffer_count πŸ”’
Estimates the number of EncodedBuffer segments that write_array_data will produce for a column of the given type.
get_buffer_element_width πŸ”’
Returns byte width for a buffer spec. Only for BufferSpec::FixedWidth.
get_byte_array_buffers πŸ”’
Returns the values and offsets [Buffer] for a ByteArray with offset type O
get_list_array_buffers πŸ”’
Similar logic as get_byte_array_buffers() but slices the child array instead of a values buffer.
get_list_view_array_buffers πŸ”’
Returns the offsets, sizes, and child data buffers for a ListView array.
get_or_truncate_buffer πŸ”’
Returns the sliced views [Buffer] for a BinaryView/Utf8View array.
has_validity_bitmap πŸ”’
In V4, null types have no validity bitmap In V5 and later, null and union types have no validity bitmap Run end encoded type has no validity bitmap.
into_zero_offset_run_array πŸ”’
pad_to_alignment πŸ”’
Calculate an alignment boundary and return the number of bytes needed to pad to the alignment boundary
reencode_offsets πŸ”’
Common functionality for re-encoding offsets. Returns the new offsets as well as original start offset and length for use in slicing child data.
unslice_run_array πŸ”’
write_array_data πŸ”’
Recursively encodes array_data into its IPC representation.
write_body_buffers πŸ”’
write_continuation πŸ”’
Write a record batch to the writer, writing the message size before the message if the record batch is being written to a stream
write_message
Write a message’s IPC data and buffers, returning metadata and buffer data lengths written