pyarrow.parquet.read_table(source, columns=None, nthreads=1, metadata=None, use_pandas_metadata=False)[source]

Read a Table from Parquet format

  • source (str or – Location of Parquet dataset. If a string passed, can be a single file name or directory name. For passing Python file objects or byte buffers, see or
  • columns (list) – If not None, only these columns will be read from the file. A column name may be a prefix of a nested field, e.g. ‘a’ will select ‘a.b’, ‘a.c’, and ‘a.d.e’
  • nthreads (int, default 1) – Number of columns to read in parallel. Requires that the underlying file source is threadsafe
  • metadata (FileMetaData) – If separately computed
  • use_pandas_metadata (boolean, default False) – If True and file has custom pandas schema metadata, ensure that index columns are also loaded

pyarrow.Table – Content of the file as a table (of columns)