8 Using PyArrow from R

8.1 Introduction

For more information on using setting up and installing PyArrow to use in R, see the “Apache Arrow in Python and R with reticulate” vignette.

8.2 Create an Arrow object using PyArrow in R

You want to use PyArrow to create an Arrow object in an R session.

8.2.1 Solution

library(reticulate)
pa <- import("pyarrow")
pyarrow_scalar <- pa$scalar(42)
pyarrow_scalar
## <pyarrow.DoubleScalar: 42.0>

8.3 Call a PyArrow function from R

You want to call a PyArrow function from your R session.

8.3.1 Solution

table_1 <- arrow_table(mtcars[1:5,])
table_2 <- arrow_table(mtcars[11:15,])

pa$concat_tables(tables = list(table_1, table_2)) %>%
  collect()
## # A tibble: 10 × 11
##      mpg   cyl  disp    hp  drat    wt  qsec    vs    am  gear  carb
##    <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
##  1  21       6  160    110  3.9   2.62  16.5     0     1     4     4
##  2  21       6  160    110  3.9   2.88  17.0     0     1     4     4
##  3  22.8     4  108     93  3.85  2.32  18.6     1     1     4     1
##  4  21.4     6  258    110  3.08  3.22  19.4     1     0     3     1
##  5  18.7     8  360    175  3.15  3.44  17.0     0     0     3     2
##  6  17.8     6  168.   123  3.92  3.44  18.9     1     0     4     4
##  7  16.4     8  276.   180  3.07  4.07  17.4     0     0     3     3
##  8  17.3     8  276.   180  3.07  3.73  17.6     0     0     3     3
##  9  15.2     8  276.   180  3.07  3.78  18       0     0     3     3
## 10  10.4     8  472    205  2.93  5.25  18.0     0     0     3     4