Module org.apache.arrow.dataset
Class CsvFragmentScanOptions
java.lang.Object
org.apache.arrow.dataset.scanner.csv.CsvFragmentScanOptions
- All Implemented Interfaces:
FragmentScanOptions
-
Constructor Summary
ConstructorsConstructorDescriptionCsvFragmentScanOptions
(CsvConvertOptions convertOptions, Map<String, String> readOptions, Map<String, String> parseOptions) CSV scan options, map to CPP struct CsvFragmentScanOptions. -
Method Summary
Modifier and TypeMethodDescriptionFile format.String[]
This is an internal function to invoke by serializer.
-
Constructor Details
-
CsvFragmentScanOptions
public CsvFragmentScanOptions(CsvConvertOptions convertOptions, Map<String, String> readOptions, Map<String, String> parseOptions) CSV scan options, map to CPP struct CsvFragmentScanOptions. The key in config map is the field name of mapping cpp structCurrently, multi-valued options (which are std::vector values in C++) only support having a single value set. For example, for the null_values option, only one string can be set as the null value.
- Parameters:
convertOptions
- similar to CsvFragmentScanOptions#convert_options in CPP, the ArrowSchema represents column_types, convert data option such as null value recognition.readOptions
- similar to CsvFragmentScanOptions#read_options in CPP, specify how to read the file such as block_sizeparseOptions
- similar to CsvFragmentScanOptions#parse_options in CPP, parse file option such as delimiter
-
-
Method Details
-
fileFormat
File format.- Specified by:
fileFormat
in interfaceFragmentScanOptions
- Returns:
- file format.
-
serialize
This is an internal function to invoke by serializer. Serialize this class to string array and then called by JNI call.- Specified by:
serialize
in interfaceFragmentScanOptions
- Returns:
- string array as Map JNI bridge format.
-
getConvertOptions
-
getReadOptions
-
getParseOptions
-