Skip to contents

Hive partitioning embeds field names and values in path segments, such as "/year=2019/month=2/data.parquet".

Usage

hive_partition(..., null_fallback = NULL, segment_encoding = "uri")

Arguments

...

named list of data types, passed to schema()

null_fallback

character to be used in place of missing values (NA or NULL) in partition columns. Default is "__HIVE_DEFAULT_PARTITION__", which is what Hive uses.

segment_encoding

Decode partition segments after splitting paths. Default is "uri" (URI-decode segments). May also be "none" (leave as-is).

Value

A HivePartitioning, or a HivePartitioningFactory if calling hive_partition() with no arguments.

Details

Because fields are named in the path segments, order of fields passed to hive_partition() does not matter.

Examples

hive_partition(year = int16(), month = int8())
#> HivePartitioning