pyarrow.FileReader

class pyarrow.FileReader(source, footer_offset=None)[source]

Class for reading Arrow record batch data from the Arrow binary file format

Parameters:
  • source (str, pyarrow.NativeFile, or file-like Python object) – Either a file path, or a readable file object
  • footer_offset (int, default None) – If the file is embedded in some larger file, this is the byte offset to the very end of the file data
__init__(source, footer_offset=None)[source]

Methods

__init__(source[, footer_offset])
get_batch(self, int i)
get_record_batch _FileReader.get_batch(self, int i)
read_all(self) Read all record batches as a pyarrow.Table

Attributes

num_record_batches