Flight is a general-purpose client-server framework for high performance transport of large datasets over network interfaces, built as part of the Apache Arrow project.
Flight allows for highly efficient data transfer as it:
- removes the need for serialization during data transfer
- allows for parallel data streaming
- is highly optimized to take advantage of Arrow’s columnar format.
The arrow package provides methods for connecting to Flight RPC servers to send and receive data.
It should be noted that the Flight implementation in the R package depends on PyArrow which is called via reticulate. This is quite different from the other capabilities in the R package, nearly all of which are all implemented directly.
You want to connect to a Flight server running on a specified host and port.
For an example of how to set up a Flight server from R, see the Flight vignette.
You want to send data that you have in memory to a Flight server
You want to see what paths are available on a Flight server.
You want to retrieve data on a Flight server from a specified path.
# Connect to the Flight server <- flight_connect(host = "127.0.0.1", port = 8089) local_client # Retrieve data flight_get( local_client,"pollution_data" )
# Table # 153 rows x 6 columns # $Ozone <int32> # $Solar.R <int32> # $Wind <double> # $Temp <int32> # $Month <int32> # $Day <int32> # # See $metadata for additional Schema metadata