pyarrow.parquet.read_pandas

pyarrow.parquet.read_pandas(source, columns=None, nthreads=1, metadata=None)[source]

Read a Table from Parquet format, also reading DataFrame index values if known in the file metadata

Parameters:
  • source (str or pyarrow.io.NativeFile) – Location of Parquet dataset. If a string passed, can be a single file name. For passing Python file objects or byte buffers, see pyarrow.io.PythonFileInterface or pyarrow.io.BufferReader.
  • columns (list) – If not None, only these columns will be read from the file.
  • nthreads (int, default 1) – Number of columns to read in parallel. Requires that the underlying file source is threadsafe
  • metadata (FileMetaData) – If separately computed
Returns:

pyarrow.Table – Content of the file as a Table of Columns, including DataFrame indexes as Columns.