pyarrow.compute.tdigest

pyarrow.compute.tdigest(array, *, memory_pool=None, options=None, q=0.5, delta=100, buffer_size=500, skip_nulls=True, min_count=0)

Approximate quantiles of a numeric array with T-Digest algorithm.

By default, 0.5 quantile (median) is returned. Nulls and NaNs are ignored. An array of nulls is returned if there is no valid data point.

Parameters
  • array (Array-like) – Argument to compute function

  • memory_pool (pyarrow.MemoryPool, optional) – If not passed, will allocate memory from the default memory pool.

  • options (pyarrow.compute.TDigestOptions, optional) – Parameters altering compute function semantics.

  • q (optional) – Parameter for TDigestOptions constructor. Either options or q can be passed, but not both at the same time.

  • delta (optional) – Parameter for TDigestOptions constructor. Either options or delta can be passed, but not both at the same time.

  • buffer_size (optional) – Parameter for TDigestOptions constructor. Either options or buffer_size can be passed, but not both at the same time.

  • skip_nulls (optional) – Parameter for TDigestOptions constructor. Either options or skip_nulls can be passed, but not both at the same time.

  • min_count (optional) – Parameter for TDigestOptions constructor. Either options or min_count can be passed, but not both at the same time.