arrow_array::array::struct_array

Struct StructArray

Source
pub struct StructArray {
    len: usize,
    data_type: DataType,
    nulls: Option<NullBuffer>,
    fields: Vec<ArrayRef>,
}
Expand description

An array of structs

Each child (called field) is represented by a separate array.

§Comparison with RecordBatch

Both RecordBatch and StructArray represent a collection of columns / arrays with the same length.

However, there are a couple of key differences:

  • StructArray can be nested within other Array, including itself
  • RecordBatch can contain top-level metadata on its associated [Schema][arrow_schema::Schema]
  • StructArray can contain top-level nulls, i.e. null
  • RecordBatch can only represent nulls in its child columns, i.e. {"field": null}

StructArray is therefore a more general data container than RecordBatch, and as such code that needs to handle both will typically share an implementation in terms of StructArray and convert to/from RecordBatch as necessary.

From implementations are provided to facilitate this conversion, however, converting from a StructArray containing top-level nulls to a RecordBatch will panic, as there is no way to preserve them.

§Example: Create an array from a vector of fields

use std::sync::Arc;
use arrow_array::{Array, ArrayRef, BooleanArray, Int32Array, StructArray};
use arrow_schema::{DataType, Field};

let boolean = Arc::new(BooleanArray::from(vec![false, false, true, true]));
let int = Arc::new(Int32Array::from(vec![42, 28, 19, 31]));

let struct_array = StructArray::from(vec![
    (
        Arc::new(Field::new("b", DataType::Boolean, false)),
        boolean.clone() as ArrayRef,
    ),
    (
        Arc::new(Field::new("c", DataType::Int32, false)),
        int.clone() as ArrayRef,
    ),
]);
assert_eq!(struct_array.column(0).as_ref(), boolean.as_ref());
assert_eq!(struct_array.column(1).as_ref(), int.as_ref());
assert_eq!(4, struct_array.len());
assert_eq!(0, struct_array.null_count());
assert_eq!(0, struct_array.offset());

Fields§

§len: usize§data_type: DataType§nulls: Option<NullBuffer>§fields: Vec<ArrayRef>

Implementations§

Source§

impl StructArray

Source

pub fn new( fields: Fields, arrays: Vec<ArrayRef>, nulls: Option<NullBuffer>, ) -> Self

Create a new StructArray from the provided parts, panicking on failure

§Panics

Panics if Self::try_new returns an error

Source

pub fn try_new( fields: Fields, arrays: Vec<ArrayRef>, nulls: Option<NullBuffer>, ) -> Result<Self, ArrowError>

Create a new StructArray from the provided parts, returning an error on failure

§Errors

Errors if

  • fields.len() != arrays.len()
  • fields[i].data_type() != arrays[i].data_type()
  • arrays[i].len() != arrays[j].len()
  • arrays[i].len() != nulls.len()
  • !fields[i].is_nullable() && !nulls.contains(arrays[i].nulls())
Source

pub fn new_null(fields: Fields, len: usize) -> Self

Create a new StructArray of length len where all values are null

Source

pub unsafe fn new_unchecked( fields: Fields, arrays: Vec<ArrayRef>, nulls: Option<NullBuffer>, ) -> Self

Create a new StructArray from the provided parts without validation

§Safety

Safe if Self::new would not panic with the given arguments

Source

pub fn new_empty_fields(len: usize, nulls: Option<NullBuffer>) -> Self

Create a new StructArray containing no fields

§Panics

If len != nulls.len()

Source

pub fn into_parts(self) -> (Fields, Vec<ArrayRef>, Option<NullBuffer>)

Deconstruct this array into its constituent parts

Source

pub fn column(&self, pos: usize) -> &ArrayRef

Returns the field at pos.

Source

pub fn num_columns(&self) -> usize

Return the number of fields in this struct array

Source

pub fn columns(&self) -> &[ArrayRef]

Returns the fields of the struct array

Source

pub fn column_names(&self) -> Vec<&str>

Return field names in this struct array

Source

pub fn fields(&self) -> &Fields

Returns the [Fields] of this StructArray

Source

pub fn column_by_name(&self, column_name: &str) -> Option<&ArrayRef>

Return child array whose field name equals to column_name

Note: A schema can currently have duplicate field names, in which case the first field will always be selected. This issue will be addressed in ARROW-11178

Source

pub fn slice(&self, offset: usize, len: usize) -> Self

Returns a zero-copy slice of this array with the indicated offset and length.

Trait Implementations§

Source§

impl Array for StructArray

Source§

fn as_any(&self) -> &dyn Any

Returns the array as Any so that it can be downcasted to a specific implementation. Read more
Source§

fn to_data(&self) -> ArrayData

Returns the underlying data of this array
Source§

fn into_data(self) -> ArrayData

Returns the underlying data of this array Read more
Source§

fn data_type(&self) -> &DataType

Returns a reference to the [DataType] of this array. Read more
Source§

fn slice(&self, offset: usize, length: usize) -> ArrayRef

Returns a zero-copy slice of this array with the indicated offset and length. Read more
Source§

fn len(&self) -> usize

Returns the length (i.e., number of elements) of this array. Read more
Source§

fn is_empty(&self) -> bool

Returns whether this array is empty. Read more
Source§

fn shrink_to_fit(&mut self)

Shrinks the capacity of any exclusively owned buffer as much as possible Read more
Source§

fn offset(&self) -> usize

Returns the offset into the underlying data used by this array(-slice). Note that the underlying data can be shared by many arrays. This defaults to 0. Read more
Source§

fn nulls(&self) -> Option<&NullBuffer>

Returns the null buffer of this array if any. Read more
Source§

fn logical_null_count(&self) -> usize

Returns the total number of logical null values in this array. Read more
Source§

fn get_buffer_memory_size(&self) -> usize

Returns the total number of bytes of memory pointed to by this array. The buffers store bytes in the Arrow memory format, and include the data as well as the validity map. Note that this does not always correspond to the exact memory usage of an array, since multiple arrays can share the same buffers or slices thereof.
Source§

fn get_array_memory_size(&self) -> usize

Returns the total number of bytes of memory occupied physically by this array. This value will always be greater than returned by get_buffer_memory_size() and includes the overhead of the data structures that contain the pointers to the various buffers.
Source§

fn logical_nulls(&self) -> Option<NullBuffer>

Returns a potentially computed [NullBuffer] that represents the logical null values of this array, if any. Read more
Source§

fn is_null(&self, index: usize) -> bool

Returns whether the element at index is null according to Array::nulls Read more
Source§

fn is_valid(&self, index: usize) -> bool

Returns whether the element at index is not null, the opposite of Self::is_null. Read more
Source§

fn null_count(&self) -> usize

Returns the total number of physical null values in this array. Read more
Source§

fn is_nullable(&self) -> bool

Returns false if the array is guaranteed to not contain any logical nulls Read more
Source§

impl Clone for StructArray

Source§

fn clone(&self) -> StructArray

Returns a copy of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for StructArray

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl From<&StructArray> for RecordBatch

Source§

fn from(struct_array: &StructArray) -> Self

Converts to this type from the input type.
Source§

impl From<(Vec<(Arc<Field>, Arc<dyn Array>)>, Buffer)> for StructArray

Source§

fn from(pair: (Vec<(FieldRef, ArrayRef)>, Buffer)) -> Self

Converts to this type from the input type.
Source§

impl From<ArrayData> for StructArray

Source§

fn from(data: ArrayData) -> Self

Converts to this type from the input type.
Source§

impl From<RecordBatch> for StructArray

Source§

fn from(value: RecordBatch) -> Self

Converts to this type from the input type.
Source§

impl From<StructArray> for ArrayData

Source§

fn from(array: StructArray) -> Self

Converts to this type from the input type.
Source§

impl From<StructArray> for RecordBatch

Source§

fn from(value: StructArray) -> Self

Converts to this type from the input type.
Source§

impl From<Vec<(Arc<Field>, Arc<dyn Array>)>> for StructArray

Source§

fn from(v: Vec<(FieldRef, ArrayRef)>) -> Self

Converts to this type from the input type.
Source§

impl Index<&str> for StructArray

Source§

fn index(&self, name: &str) -> &Self::Output

Get a reference to a column’s array by name.

Note: A schema can currently have duplicate field names, in which case the first field will always be selected. This issue will be addressed in ARROW-11178

§Panics

Panics if the name is not in the schema.

Source§

type Output = Arc<dyn Array>

The returned type after indexing.
Source§

impl PartialEq for StructArray

Source§

fn eq(&self, other: &Self) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl TryFrom<Vec<(&str, Arc<dyn Array>)>> for StructArray

Source§

fn try_from(values: Vec<(&str, ArrayRef)>) -> Result<Self, ArrowError>

builds a StructArray from a vector of names and arrays.

Source§

type Error = ArrowError

The type returned in the event of a conversion error.

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dst: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dst. Read more
Source§

impl<T> Datum for T
where T: Array,

Source§

fn get(&self) -> (&dyn Array, bool)

Returns the value for this Datum and a boolean indicating if the value is scalar
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.