Module async_reader

Source
Expand description

ParquetRecordBatchStreamBuilder: async API for reading Parquet files as [RecordBatch]es

This can be used to decode a Parquet file in streaming fashion (without downloading the whole file at once) from a remote source, such as an object store.

See example on ParquetRecordBatchStreamBuilder::new

Modulesยง

metadata ๐Ÿ”’
store ๐Ÿ”’

Structsยง

ColumnChunkIterator ๐Ÿ”’
Implements PageIterator for a single column chunk, yielding a single PageReader
InMemoryRowGroup ๐Ÿ”’
An in-memory collection of column chunks
MetadataLoader
An asynchronous interface to load ParquetMetaData from an async source
ParquetObjectReader
Reads Parquet files in object storage using [ObjectStore].
ParquetRecordBatchStream
An asynchronous Streamof [RecordBatch] constructed using ParquetRecordBatchStreamBuilder to read parquet files.
ReaderFactory ๐Ÿ”’
ReaderFactory is used by ParquetRecordBatchStream to create ParquetRecordBatchReader

Enumsยง

ColumnChunkData ๐Ÿ”’
An in-memory column chunk
StreamState ๐Ÿ”’

Traitsยง

AsyncFileReader
The asynchronous interface used by ParquetRecordBatchStream to read parquet files
MetadataFetch
A data source that can be used with MetadataLoader to load ParquetMetaData

Functionsยง

fetch_parquet_metadataDeprecated
Fetches parquet metadata

Type Aliasesยง

ParquetRecordBatchStreamBuilder
A builder for reading parquet files from an async source as ParquetRecordBatchStream
ReadResult ๐Ÿ”’