Expand description
ParquetRecordBatchStreamBuilder
: async
API for reading Parquet files as
[RecordBatch
]es
This can be used to decode a Parquet file in streaming fashion (without downloading the whole file at once) from a remote source, such as an object store.
See example on ParquetRecordBatchStreamBuilder::new
Modulesยง
Structsยง
- Column
Chunk ๐Iterator - Implements
PageIterator
for a single column chunk, yielding a singlePageReader
- InMemory
RowGroup ๐ - An in-memory collection of column chunks
- Metadata
Loader - An asynchronous interface to load
ParquetMetaData
from an async source - Parquet
Object Reader - Reads Parquet files in object storage using [
ObjectStore
]. - Parquet
Record Batch Stream - An asynchronous
Stream
of [RecordBatch
] constructed usingParquetRecordBatchStreamBuilder
to read parquet files. - Reader
Factory ๐ ReaderFactory
is used byParquetRecordBatchStream
to createParquetRecordBatchReader
Enumsยง
- Column
Chunk ๐Data - An in-memory column chunk
- Stream
State ๐
Traitsยง
- Async
File Reader - The asynchronous interface used by
ParquetRecordBatchStream
to read parquet files - Metadata
Fetch - A data source that can be used with
MetadataLoader
to loadParquetMetaData
Functionsยง
- fetch_
parquet_ metadata Deprecated - Fetches parquet metadata
Type Aliasesยง
- Parquet
Record Batch Stream Builder - A builder for reading parquet files from an
async
source asParquetRecordBatchStream
- Read
Result ๐