pyarrow.dataset.CsvFileFormat

class pyarrow.dataset.CsvFileFormat(ParseOptions parse_options=None, default_fragment_scan_options=None, ConvertOptions convert_options=None, ReadOptions read_options=None)

Bases: pyarrow._dataset.FileFormat

FileFormat for CSV files.

Parameters
  • parse_options (ParseOptions) – Options regarding CSV parsing.

  • convert_options (ConvertOptions) – Options regarding value conversion.

  • read_options (ReadOptions) – General read options.

  • default_fragment_scan_options (CsvFragmentScanOptions) – Default options for fragments scan.

__init__(*args, **kwargs)

Initialize self. See help(type(self)) for accurate signature.

Methods

__init__(*args, **kwargs)

Initialize self.

equals(self, CsvFileFormat other)

inspect(self, file[, filesystem])

Infer the schema of a file.

make_fragment(self, file[, filesystem])

Make a FileFragment of this FileFormat.

make_write_options(self, **kwargs)

Attributes

default_extname

default_fragment_scan_options

parse_options

default_extname
default_fragment_scan_options
equals(self, CsvFileFormat other)
inspect(self, file, filesystem=None)

Infer the schema of a file.

make_fragment(self, file, filesystem=None, Expression partition_expression=None)

Make a FileFragment of this FileFormat. The filter may not reference fields absent from the provided schema. If no schema is provided then one will be inferred.

make_write_options(self, **kwargs)
parse_options