pyarrow.ipc.RecordBatchStreamReader¶
-
class
pyarrow.ipc.RecordBatchStreamReader(source)[source]¶ Bases:
pyarrow.lib._RecordBatchStreamReader,pyarrow.ipc._ReadPandasOptionReader for the Arrow streaming binary format.
- Parameters
source (bytes/buffer-like, pyarrow.NativeFile, or file-like Python object) – Either an in-memory buffer, or a readable file object.
Methods
__init__(source)Initialize self.
get_next_batch(self)read_all(self)Read all record batches as a pyarrow.Table.
read_next_batch(self)Read next RecordBatch from the stream.
read_pandas(**options)Read contents of stream to a pandas.DataFrame.
Attributes
-
get_next_batch(self)¶
-
read_all(self)¶ Read all record batches as a pyarrow.Table.
-
read_next_batch(self)¶ Read next RecordBatch from the stream.
- Raises
StopIteration: – At end of stream.
-
read_pandas(**options)¶ Read contents of stream to a pandas.DataFrame.
Read all record batches as a pyarrow.Table then convert it to a pandas.DataFrame using Table.to_pandas.
- Parameters
**options (arguments to forward to Table.to_pandas) –
- Returns
df (pandas.DataFrame)
-
schema¶