Module async_reader

Source
Expand description

async API for reading Parquet files as [RecordBatch]es

See the crate-level documentation for more details.

See example on ParquetRecordBatchStreamBuilder::new

Modulesยง

metadata ๐Ÿ”’
store ๐Ÿ”’

Structsยง

ColumnChunkIterator ๐Ÿ”’
Implements PageIterator for a single column chunk, yielding a single PageReader
InMemoryRowGroup ๐Ÿ”’
An in-memory collection of column chunks
ParquetObjectReader
Reads Parquet files in object storage using [ObjectStore].
ParquetRecordBatchStream
An asynchronous Streamof [RecordBatch] constructed using ParquetRecordBatchStreamBuilder to read parquet files.
ReaderFactory ๐Ÿ”’
ReaderFactory is used by ParquetRecordBatchStream to create ParquetRecordBatchReader

Enumsยง

ColumnChunkData ๐Ÿ”’
An in-memory column chunk
StreamState ๐Ÿ”’

Traitsยง

AsyncFileReader
The asynchronous interface used by ParquetRecordBatchStream to read parquet files
MetadataFetch
A data source that can be used with ParquetMetaDataReader to load ParquetMetaData
MetadataSuffixFetch
A data source that can be used with ParquetMetaDataReader to load ParquetMetaData via suffix requests, without knowing the file size

Type Aliasesยง

ParquetRecordBatchStreamBuilder
A builder for reading parquet files from an async source as ParquetRecordBatchStream
ReadResult ๐Ÿ”’
Returns a ReaderFactory and an optional ParquetRecordBatchReader for the next row group