pyarrow.input_stream¶
- pyarrow.input_stream(source, compression='detect', buffer_size=None)¶
Create an Arrow input stream.
- Parameters:
- source
str
,Path
, buffer, or file-like object The source to open for reading.
- compression
str
optional, default ‘detect’ The compression algorithm to use for on-the-fly decompression. If “detect” and source is a file path, then compression will be chosen based on the file extension. If None, no compression will be applied. Otherwise, a well-known algorithm name must be supplied (e.g. “gzip”).
- buffer_size
int
, defaultNone
If None or 0, no buffering will happen. Otherwise the size of the temporary read buffer.
- source
Examples
Create a readable BufferReader (NativeFile) from a Buffer or a memoryview object:
>>> import pyarrow as pa >>> buf = memoryview(b"some data") >>> with pa.input_stream(buf) as stream: ... stream.read(4) ... b'some'
Create a readable OSFile (NativeFile) from a string or file path:
>>> import gzip >>> with gzip.open('example.gz', 'wb') as f: ... f.write(b'some data') ... 9 >>> with pa.input_stream('example.gz') as stream: ... stream.read() ... b'some data'
Create a readable PythonFile (NativeFile) from a a Python file object:
>>> with open('example.txt', mode='w') as f: ... f.write('some text') ... 9 >>> with pa.input_stream('example.txt') as stream: ... stream.read(6) ... b'some t'