Struct RunArray

pub struct RunArray<R>where
    R: RunEndIndexType,{
    data_type: DataType,
    run_ends: RunEndBuffer<<R as ArrowPrimitiveType>::Native>,
    values: Arc<dyn Array>,
}

Expand description

An array of run-end encoded values.

This encoding is variation on run-length encoding (RLE) and is good for representing data containing the same values repeated consecutively.

A RunArray consists of a run_ends buffer and a values array of equivalent lengths. The run_ends buffer stores the indexes at which the run ends. The values array stores the corresponding value of each run. The below example illustrates how a logical array is represented by a RunArray:

┌ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─┐
  ┌─────────────────┐  ┌─────────┐       ┌─────────────────┐
│ │        A        │  │    2    │ │     │        A        │
  ├─────────────────┤  ├─────────┤       ├─────────────────┤
│ │        D        │  │    3    │ │     │        A        │    run length of 'A' = runs_ends[0] - 0 = 2
  ├─────────────────┤  ├─────────┤       ├─────────────────┤
│ │        B        │  │    6    │ │     │        D        │    run length of 'D' = run_ends[1] - run_ends[0] = 1
  └─────────────────┘  └─────────┘       ├─────────────────┤
│        values          run_ends  │     │        B        │
                                         ├─────────────────┤
└ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─┘     │        B        │
                                         ├─────────────────┤
               RunArray                  │        B        │    run length of 'B' = run_ends[2] - run_ends[1] = 3
              length = 3                 └─────────────────┘

                                            Logical array
                                               Contents

Fields§

§data_type: DataType§run_ends: RunEndBuffer<<R as ArrowPrimitiveType>::Native>§values: Arc<dyn Array>

Implementations§

§

impl<R> RunArray<R>
where R: RunEndIndexType,

pub fn logical_len(run_ends: &PrimitiveArray<R>) -> usize

Calculates the logical length of the array encoded by treating the run_ends array as if it were a RunEndBuffer.

pub fn try_new( run_ends: &PrimitiveArray<R>, values: &dyn Array, ) -> Result<RunArray<R>, ArrowError>

Attempts to create a RunArray using the given run_ends and values.

§Errors

If run_ends and values have different lengths
If run_ends has any null values
If run_ends doesn’t consist of strictly increasing positive integers

pub fn run_ends(&self) -> &RunEndBuffer<<R as ArrowPrimitiveType>::Native>

Returns a reference to the RunEndBuffer.

pub fn values(&self) -> &Arc<dyn Array>

Returns a reference to the values array.

Any slicing of this RunArray array is not applied to the returned values here and must be handled separately.

pub fn get_start_physical_index(&self) -> usize

Returns the physical index at which the array slice starts.

See RunEndBuffer::get_start_physical_index.

pub fn get_end_physical_index(&self) -> usize

Returns the physical index at which the array slice ends.

See RunEndBuffer::get_end_physical_index.

pub fn downcast<V>(&self) -> Option<TypedRunArray<'_, R, V>>
where V: 'static,

Downcast this RunArray to a TypedRunArray

use arrow_array::{Array, ArrayAccessor, RunArray, StringArray, types::Int32Type};

let orig = [Some("a"), Some("b"), None];
let run_array = RunArray::<Int32Type>::from_iter(orig);
let typed = run_array.downcast::<StringArray>().unwrap();
assert_eq!(typed.value(0), "a");
assert_eq!(typed.value(1), "b");
assert!(typed.values().is_null(2));

pub fn get_physical_index(&self, logical_index: usize) -> usize

Calls RunEndBuffer::get_physical_index.

The result is arbitrary if logical_index >= self.len()

pub fn get_physical_indices( &self, logical_indices: &[I], ) -> Result<Vec<usize>, ArrowError>
where I: ArrowNativeType,

Returns the physical indices corresponding to the provided logical indices.

See RunEndBuffer::get_physical_indices for more details.

pub fn slice(&self, offset: usize, length: usize) -> RunArray<R>

Returns a zero-copy slice of this array with the indicated offset and length.

§Panics

Specified slice (offset + length) exceeds existing length

Trait Implementations§

§

impl<T> Array for RunArray<T>
where T: RunEndIndexType,

§

fn as_any(&self) -> &(dyn Any + 'static)

Returns the array as Any so that it can be downcasted to a specific implementation. Read more

§

fn to_data(&self) -> ArrayData

Returns the underlying data of this array

§

fn into_data(self) -> ArrayData

Returns the underlying data of this array Read more

§

fn data_type(&self) -> &DataType

Returns a reference to the DataType of this array. Read more

§

fn slice(&self, offset: usize, length: usize) -> Arc<dyn Array>

Returns a zero-copy slice of this array with the indicated offset and length. Read more

§

fn len(&self) -> usize

Returns the length (i.e., number of elements) of this array. Read more

§

fn is_empty(&self) -> bool

Returns whether this array is empty. Read more

§

fn shrink_to_fit(&mut self)

Shrinks the capacity of any exclusively owned buffer as much as possible Read more

§

fn offset(&self) -> usize

Returns the offset into the underlying data used by this array(-slice). Note that the underlying data can be shared by many arrays. This defaults to 0. Read more

§

fn nulls(&self) -> Option<&NullBuffer>

Returns the null buffer of this array if any. Read more

§

fn logical_nulls(&self) -> Option<NullBuffer>

Returns a potentially computed NullBuffer that represents the logical null values of this array, if any. Read more

§

fn is_nullable(&self) -> bool

Returns false if the array is guaranteed to not contain any logical nulls Read more

§

fn get_buffer_memory_size(&self) -> usize

Returns the total number of bytes of memory pointed to by this array. The buffers store bytes in the Arrow memory format, and include the data as well as the validity map. Note that this does not always correspond to the exact memory usage of an array, since multiple arrays can share the same buffers or slices thereof.

§

fn get_array_memory_size(&self) -> usize

Returns the total number of bytes of memory occupied physically by this array. This value will always be greater than returned by get_buffer_memory_size() and includes the overhead of the data structures that contain the pointers to the various buffers.

§

fn is_null(&self, index: usize) -> bool

Returns whether the element at index is null according to Array::nulls Read more

§

fn is_valid(&self, index: usize) -> bool

Returns whether the element at index is not null, the opposite of Self::is_null. Read more

§

fn null_count(&self) -> usize

Returns the total number of physical null values in this array. Read more

§

fn logical_null_count(&self) -> usize

Returns the total number of logical null values in this array. Read more

§

impl<R> Clone for RunArray<R>
where R: RunEndIndexType,

§

fn clone(&self) -> RunArray<R>

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

§

impl<R> Debug for RunArray<R>
where R: RunEndIndexType,

§

fn fmt(&self, f: &mut Formatter<'_>) -> Result<(), Error>

Formats the value using the given formatter. Read more

§

impl<R> From<ArrayData> for RunArray<R>
where R: RunEndIndexType,

§

fn from(data: ArrayData) -> RunArray<R>

Converts to this type from the input type.

§

impl<R> From<RunArray<R>> for ArrayData
where R: RunEndIndexType,

§

fn from(array: RunArray<R>) -> ArrayData

Converts to this type from the input type.

§

impl<'a, T> FromIterator<&'a str> for RunArray<T>
where T: RunEndIndexType,

Constructs a RunArray from an iterator of strings.

§Example:

use arrow_array::{RunArray, PrimitiveArray, StringArray, types::Int16Type};

let test = vec!["a", "a", "b", "c"];
let array: RunArray<Int16Type> = test.into_iter().collect();
assert_eq!(
    "RunArray {run_ends: [2, 3, 4], values: StringArray\n[\n  \"a\",\n  \"b\",\n  \"c\",\n]}\n",
    format!("{:?}", array)
);

§

fn from_iter(iter: I) -> RunArray<T>
where I: IntoIterator<Item = &'a str>,

Creates a value from an iterator. Read more

§

impl<'a, T> FromIterator<Option<&'a str>> for RunArray<T>
where T: RunEndIndexType,

Constructs a RunArray from an iterator of optional strings.

§Example:

use arrow_array::{RunArray, PrimitiveArray, StringArray, types::Int16Type};

let test = vec!["a", "a", "b", "c", "c"];
let array: RunArray<Int16Type> = test
    .iter()
    .map(|&x| if x == "b" { None } else { Some(x) })
    .collect();
assert_eq!(
    "RunArray {run_ends: [2, 3, 5], values: StringArray\n[\n  \"a\",\n  null,\n  \"c\",\n]}\n",
    format!("{:?}", array)
);