pyarrow.compute.hash_distinct¶
-
pyarrow.compute.
hash_distinct
(array, group_id_array, *, memory_pool=None, options=None, mode='only_valid')¶ Keep the distinct values in each group.
Whether nulls/values are kept is controlled by CountOptions. NaNs and signed zeroes are not normalized.
- Parameters
array (Array-like or scalar-like) – Argument to compute function
group_id_array (Array-like or scalar-like) – Argument to compute function
memory_pool (pyarrow.MemoryPool, optional) – If not passed, will allocate memory from the default memory pool.
options (pyarrow.compute.CountOptions, optional) – Parameters altering compute function semantics.
mode (optional) – Parameter for CountOptions constructor. Either options or mode can be passed, but not both at the same time.