Hive partitioning embeds field names and values in path segments, such as "/year=2019/month=2/data.parquet".
hive_partition(..., null_fallback = NULL, segment_encoding = "uri")
named list of data types, passed to schema()
character to be used in place of missing values (NA
or NULL
)
in partition columns. Default is "__HIVE_DEFAULT_PARTITION__"
,
which is what Hive uses.
Decode partition segments after splitting paths.
Default is "uri"
(URI-decode segments). May also be "none"
(leave as-is).
A HivePartitioning, or a HivePartitioningFactory
if
calling hive_partition()
with no arguments.
Because fields are named in the path segments, order of fields passed to
hive_partition()
does not matter.