Expand description
Contains writer which writes arrow data into parquet data.
ModulesΒ§
- byte_
array π - levels π
- Parquet definition and repetition levels
StructsΒ§
- Arrow
Column Chunk - The data for a single column chunk, see
ArrowColumnWriter - Arrow
Column πChunk Data - A single column chunk produced by
ArrowColumnWriter. - Arrow
Column Writer - Encodes
ArrowLeafColumntoArrowColumnChunk - Arrow
Column πWriter Factory - Creates
ArrowColumnWriterinstances - Arrow
Leaf Column - A leaf column that can be encoded by
ArrowColumnWriter - Arrow
Page πWriter - Arrow
RowGroup πWriter - Encodes [
RecordBatch] to a parquet row group - Arrow
RowGroup Writer Factory - Factory that creates new column writers for each row group in the Parquet file.
- Arrow
Writer - Encodes [
RecordBatch] to parquet - Arrow
Writer Options - Arrow-specific configuration settings for writing parquet files.
- InMemory
Page Store - The default
PageStore, holding blobs on the heap in aVec<Bytes>. - InMemory
Page Store Factory - Factory for
InMemoryPageStoreβ the default used byArrowWriter. - PageKey
- An opaque, store-allocated handle to a blob held by a
PageStore. - Page
Store Args - Context for a single
PageStoreFactory::createcall. - Streaming
Column πChunk Reader - A streaming
Readover one column chunkβs buffered pages, in final file order: the dictionary page (if any) first, then the data pages.
EnumsΒ§
TraitsΒ§
- Page
Store - A pluggable store for completed, serialized page blobs.
- Page
Store Factory - Creates a fresh
PageStorefor each column chunk.
FunctionsΒ§
- compute_
leaves - Computes the
ArrowLeafColumnfor a potentially nested [ArrayRef] - get_
bool_ πarray_ slice - get_
column_ writers Deprecated - Returns
ArrowColumnWriters for each column in a given schema - get_
decimal_ π32_ array_ slice - get_
decimal_ π64_ array_ slice - get_
decimal_ π128_ array_ slice - get_
decimal_ π256_ array_ slice - get_
float_ π16_ array_ slice - get_
fsb_ πarray_ slice - get_
interval_ πdt_ array_ slice - Returns 12-byte values representing 3 values of months, days and milliseconds (4-bytes each). An Arrow DayTime interval only stores days and millis, thus the first 4 bytes are not populated.
- get_
interval_ πym_ array_ slice - Returns 12-byte values representing 3 values of months, days and milliseconds (4-bytes each). An Arrow YearMonth interval only stores months, thus only the first 4 bytes are populated.
- write_
leaf π - write_
primitive π
Type AliasesΒ§
- Shared
Column πChunk - A shared
ArrowColumnChunkData