pyarrow.dataset.ParquetFileFormat

class pyarrow.dataset.ParquetFileFormat(read_options=None, default_fragment_scan_options=None, **kwargs)

Bases: pyarrow._dataset.FileFormat

FileFormat for Parquet

Parameters
  • read_options (ParquetReadOptions) – Read options for the file.

  • default_fragment_scan_options (ParquetFragmentScanOptions) – Scan Options for the file.

  • **kwargs (dict) – Additional options for read option or scan option.

__init__(*args, **kwargs)

Initialize self. See help(type(self)) for accurate signature.

Methods

__init__(*args, **kwargs)

Initialize self.

equals(self, ParquetFileFormat other)

inspect(self, file[, filesystem])

Infer the schema of a file.

make_fragment(self, file[, filesystem, …])

make_write_options(self, **kwargs)

Attributes

default_extname

default_fragment_scan_options

read_options

default_extname
default_fragment_scan_options
equals(self, ParquetFileFormat other)
inspect(self, file, filesystem=None)

Infer the schema of a file.

make_fragment(self, file, filesystem=None, Expression partition_expression=None, row_groups=None)
make_write_options(self, **kwargs)
read_options