Arrow Flight RPC¶
Arrow Flight is an RPC framework for high-performance data services based on Arrow data, and is built on top of gRPC and the IPC format.
Flight is organized around streams of Arrow record batches, being either downloaded from or uploaded to another service. A set of metadata methods offers discovery and introspection of streams, as well as the ability to implement application-specific methods.
Methods and message wire formats are defined by Protobuf, enabling interoperability with clients that may support gRPC and Arrow separately, but not Flight. However, Flight implementations include further optimizations to avoid overhead in usage of Protobuf (mostly around avoiding excessive memory copies).
RPC Methods and Request Patterns¶
Flight defines a set of RPC methods for uploading/downloading data, retrieving metadata about a data stream, listing available data streams, and for implementing application-specific RPC methods. A Flight service implements some subset of these methods, while a Flight client can call any of these methods.
Data streams are identified by descriptors (the FlightDescriptor
message), which are either a path or an arbitrary binary command. For
instance, the descriptor may encode a SQL query, a path to a file on a
distributed file system, or even a pickled Python object; the
application can use this message as it sees fit.
Thus, one Flight client can connect to any service and perform basic operations. To facilitate this, Flight services are expected to support some common request patterns, described next. Of course, applications may ignore compatibility and simply treat the Flight RPC methods as low-level building blocks for their own purposes.
See Protocol Buffer Definitions for full details on the methods and messages involved.
Downloading Data¶
A client that wishes to download the data would:
Construct or acquire a
FlightDescriptor
for the data set they are interested in.A client may know what descriptor they want already, or they may use methods like
ListFlights
to discover them.Call
GetFlightInfo(FlightDescriptor)
to get aFlightInfo
message.Flight does not require that data live on the same server as metadata. Hence,
FlightInfo
contains details on where the data is located, so the client can go fetch the data from an appropriate server. This is encoded as a series ofFlightEndpoint
messages insideFlightInfo
. Each endpoint represents some location that contains a subset of the response data.An endpoint contains a list of locations (server addresses) where this data can be retrieved from, and a
Ticket
, an opaque binary token that the server will use to identify the data being requested.If
FlightInfo.ordered
is true, this signals there is some order between data from different endpoints. Clients should produce the same results as if the data returned from each of the endpoints was concatenated, in order, from front to back.If
FlightInfo.ordered
is false, the client may return data from any of the endpoints in arbitrary order. Data from any specific endpoint must be returned in order, but the data from different endpoints may be interleaved to allow parallel fetches.Note that since some clients may ignore
FlightInfo.ordered
, if ordering is important and client support can not be ensured, servers should return a single endpoint.The response also contains other metadata, like the schema, and optionally an estimate of the dataset size.
Consume each endpoint returned by the server.
To consume an endpoint, the client should connect to one of the locations in the endpoint, then call
DoGet(Ticket)
with the ticket in the endpoint. This will give the client a stream of Arrow record batches.If the server wishes to indicate that the data is on the local server and not a different location, then it can return an empty list of locations. The client can then reuse the existing connection to the original server to fetch data. Otherwise, the client must connect to one of the indicated locations.
In this way, the locations inside an endpoint can also be thought of as performing look-aside load balancing or service discovery functions. And the endpoints can represent data that is partitioned or otherwise distributed.
The client must consume all endpoints to retrieve the complete data set. The client can consume endpoints in any order, or even in parallel, or distribute the endpoints among multiple machines for consumption; this is up to the application to implement. The client can also use
FlightInfo.ordered
. See the previous item for details ofFlightInfo.ordered
.Each endpoint may have expiration time (
FlightEndpoint.expiration_time
). If an endpoint has expiration time, the client can get data multiple times byDoGet
until the expiration time is reached. Otherwise, it is application-defined whetherDoGet
requests may be retried. The expiration time is represented as google.protobuf.Timestamp.If the expiration time is short, the client may be able to extend the expiration time by
RenewFlightEndpoint
action. The client need to useDoAction
withRenewFlightEndpoint
action type to extend the expiration time.Action.body
must beRenewFlightEndpointRequest
that hasFlightEndpoint
to be renewed.The client may be able to cancel the returned
FlightInfo
byCancelFlightInfo
action. The client need to useDoAction
withCancelFlightInfo
action type to cancel theFlightInfo
.
Uploading Data¶
To upload data, a client would:
Construct or acquire a
FlightDescriptor
, as before.Call
DoPut(FlightData)
and upload a stream of Arrow record batches.The
FlightDescriptor
is included with the first message so the server can identify the dataset.
DoPut
allows the server to send response messages back to the
client with custom metadata. This can be used to implement things like
resumable writes (e.g. the server can periodically send a message
indicating how many rows have been committed so far).
Exchanging Data¶
Some use cases may require uploading and downloading data within a
single call. While this can be emulated with multiple calls, this may
be difficult if the application is stateful. For instance, the
application may wish to implement a call where the client uploads data
and the server responds with a transformation of that data; this would
require being stateful if implemented using DoGet
and
DoPut
. Instead, DoExchange
allows this to be implemented as a
single call. A client would:
Construct or acquire a
FlightDescriptor
, as before.Call
DoExchange(FlightData)
.The
FlightDescriptor
is included with the first message, as withDoPut
. At this point, both the client and the server may simultaneously stream data to the other side.
Authentication¶
Flight supports a variety of authentication methods that applications can customize for their needs.
- “Handshake” authentication
This is implemented in two parts. At connection time, the client calls the
Handshake
RPC method, and the application-defined authentication handler can exchange any number of messages with its counterpart on the server. The handler then provides a binary token. The Flight client will then include this token in the headers of all future calls, which is validated by the server authentication handler.Applications may use any part of this; for instance, they may ignore the initial handshake and send an externally acquired token (e.g. a bearer token) on each call, or they may establish trust during the handshake and not validate a token for each call, treating the connection as stateful (a “login” pattern).
Warning
Unless a token is validated on every call, this pattern is not secure, especially in the presence of a layer 7 load balancer, as is common with gRPC, or if gRPC transparently reconnects the client.
- Header-based/middleware-based authentication
Clients may include custom headers with calls. Custom middleware can then be implemented to validate and accept/reject calls on the server side.
- Mutual TLS (mTLS)
The client provides a certificate during connection establishment which is verified by the server. The application does not need to implement any authentication code, but must provision and distribute certificates.
This may only be available in certain implementations, and is only available when TLS is also enabled.
Some Flight implementations may expose the underlying gRPC API as well, in which case any authentication method supported by gRPC is available.
Transport Implementations¶
Flight is primarily defined in terms of its Protobuf and gRPC specification below, but Arrow implementations may also support alternative transports (see Flight RPC). In that case, implementations should use the following URI schemes for the given transport implementations:
Transport |
URI Scheme |
---|---|
gRPC (plaintext) |
grpc: or grpc+tcp: |
gRPC (TLS) |
grpc+tls: |
gRPC (Unix domain socket) |
grpc+unix: |
UCX (plaintext) |
ucx: |
Error Handling¶
Arrow Flight defines its own set of error codes. The implementation differs between languages (e.g. in C++, Unimplemented is a general Arrow error status while it’s a Flight-specific exception in Java), but the following set is exposed:
Error Code |
Description |
---|---|
UNKNOWN |
An unknown error. The default if no other error applies. |
INTERNAL |
An error internal to the service implementation occurred. |
INVALID_ARGUMENT |
The client passed an invalid argument to the RPC. |
TIMED_OUT |
The operation exceeded a timeout or deadline. |
NOT_FOUND |
The requested resource (action, data stream) was not found. |
ALREADY_EXISTS |
The resource already exists. |
CANCELLED |
The operation was cancelled (either by the client or the server). |
UNAUTHENTICATED |
The client is not authenticated. |
UNAUTHORIZED |
The client is authenticated, but does not have permissions for the requested operation. |
UNIMPLEMENTED |
The RPC is not implemented. |
UNAVAILABLE |
The server is not available. May be emitted by the client for connectivity reasons. |
External Resources¶
Protocol Buffer Definitions¶
1/*
2 * Licensed to the Apache Software Foundation (ASF) under one
3 * or more contributor license agreements. See the NOTICE file
4 * distributed with this work for additional information
5 * regarding copyright ownership. The ASF licenses this file
6 * to you under the Apache License, Version 2.0 (the
7 * "License"); you may not use this file except in compliance
8 * with the License. You may obtain a copy of the License at
9 * <p>
10 * http://www.apache.org/licenses/LICENSE-2.0
11 * <p>
12 * Unless required by applicable law or agreed to in writing, software
13 * distributed under the License is distributed on an "AS IS" BASIS,
14 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
15 * See the License for the specific language governing permissions and
16 * limitations under the License.
17 */
18
19syntax = "proto3";
20import "google/protobuf/timestamp.proto";
21
22option java_package = "org.apache.arrow.flight.impl";
23option go_package = "github.com/apache/arrow/go/arrow/flight/internal/flight";
24option csharp_namespace = "Apache.Arrow.Flight.Protocol";
25
26package arrow.flight.protocol;
27
28/*
29 * A flight service is an endpoint for retrieving or storing Arrow data. A
30 * flight service can expose one or more predefined endpoints that can be
31 * accessed using the Arrow Flight Protocol. Additionally, a flight service
32 * can expose a set of actions that are available.
33 */
34service FlightService {
35
36 /*
37 * Handshake between client and server. Depending on the server, the
38 * handshake may be required to determine the token that should be used for
39 * future operations. Both request and response are streams to allow multiple
40 * round-trips depending on auth mechanism.
41 */
42 rpc Handshake(stream HandshakeRequest) returns (stream HandshakeResponse) {}
43
44 /*
45 * Get a list of available streams given a particular criteria. Most flight
46 * services will expose one or more streams that are readily available for
47 * retrieval. This api allows listing the streams available for
48 * consumption. A user can also provide a criteria. The criteria can limit
49 * the subset of streams that can be listed via this interface. Each flight
50 * service allows its own definition of how to consume criteria.
51 */
52 rpc ListFlights(Criteria) returns (stream FlightInfo) {}
53
54 /*
55 * For a given FlightDescriptor, get information about how the flight can be
56 * consumed. This is a useful interface if the consumer of the interface
57 * already can identify the specific flight to consume. This interface can
58 * also allow a consumer to generate a flight stream through a specified
59 * descriptor. For example, a flight descriptor might be something that
60 * includes a SQL statement or a Pickled Python operation that will be
61 * executed. In those cases, the descriptor will not be previously available
62 * within the list of available streams provided by ListFlights but will be
63 * available for consumption for the duration defined by the specific flight
64 * service.
65 */
66 rpc GetFlightInfo(FlightDescriptor) returns (FlightInfo) {}
67
68 /*
69 * For a given FlightDescriptor, get the Schema as described in Schema.fbs::Schema
70 * This is used when a consumer needs the Schema of flight stream. Similar to
71 * GetFlightInfo this interface may generate a new flight that was not previously
72 * available in ListFlights.
73 */
74 rpc GetSchema(FlightDescriptor) returns (SchemaResult) {}
75
76 /*
77 * Retrieve a single stream associated with a particular descriptor
78 * associated with the referenced ticket. A Flight can be composed of one or
79 * more streams where each stream can be retrieved using a separate opaque
80 * ticket that the flight service uses for managing a collection of streams.
81 */
82 rpc DoGet(Ticket) returns (stream FlightData) {}
83
84 /*
85 * Push a stream to the flight service associated with a particular
86 * flight stream. This allows a client of a flight service to upload a stream
87 * of data. Depending on the particular flight service, a client consumer
88 * could be allowed to upload a single stream per descriptor or an unlimited
89 * number. In the latter, the service might implement a 'seal' action that
90 * can be applied to a descriptor once all streams are uploaded.
91 */
92 rpc DoPut(stream FlightData) returns (stream PutResult) {}
93
94 /*
95 * Open a bidirectional data channel for a given descriptor. This
96 * allows clients to send and receive arbitrary Arrow data and
97 * application-specific metadata in a single logical stream. In
98 * contrast to DoGet/DoPut, this is more suited for clients
99 * offloading computation (rather than storage) to a Flight service.
100 */
101 rpc DoExchange(stream FlightData) returns (stream FlightData) {}
102
103 /*
104 * Flight services can support an arbitrary number of simple actions in
105 * addition to the possible ListFlights, GetFlightInfo, DoGet, DoPut
106 * operations that are potentially available. DoAction allows a flight client
107 * to do a specific action against a flight service. An action includes
108 * opaque request and response objects that are specific to the type action
109 * being undertaken.
110 */
111 rpc DoAction(Action) returns (stream Result) {}
112
113 /*
114 * A flight service exposes all of the available action types that it has
115 * along with descriptions. This allows different flight consumers to
116 * understand the capabilities of the flight service.
117 */
118 rpc ListActions(Empty) returns (stream ActionType) {}
119
120}
121
122/*
123 * The request that a client provides to a server on handshake.
124 */
125message HandshakeRequest {
126
127 /*
128 * A defined protocol version
129 */
130 uint64 protocol_version = 1;
131
132 /*
133 * Arbitrary auth/handshake info.
134 */
135 bytes payload = 2;
136}
137
138message HandshakeResponse {
139
140 /*
141 * A defined protocol version
142 */
143 uint64 protocol_version = 1;
144
145 /*
146 * Arbitrary auth/handshake info.
147 */
148 bytes payload = 2;
149}
150
151/*
152 * A message for doing simple auth.
153 */
154message BasicAuth {
155 string username = 2;
156 string password = 3;
157}
158
159message Empty {}
160
161/*
162 * Describes an available action, including both the name used for execution
163 * along with a short description of the purpose of the action.
164 */
165message ActionType {
166 string type = 1;
167 string description = 2;
168}
169
170/*
171 * A service specific expression that can be used to return a limited set
172 * of available Arrow Flight streams.
173 */
174message Criteria {
175 bytes expression = 1;
176}
177
178/*
179 * An opaque action specific for the service.
180 */
181message Action {
182 string type = 1;
183 bytes body = 2;
184}
185
186/*
187 * The request of the CancelFlightInfo action.
188 *
189 * The request should be stored in Action.body.
190 */
191message CancelFlightInfoRequest {
192 FlightInfo info = 1;
193}
194
195/*
196 * The request of the RenewFlightEndpoint action.
197 *
198 * The request should be stored in Action.body.
199 */
200message RenewFlightEndpointRequest {
201 FlightEndpoint endpoint = 1;
202}
203
204/*
205 * An opaque result returned after executing an action.
206 */
207message Result {
208 bytes body = 1;
209}
210
211/*
212 * The result of a cancel operation.
213 *
214 * This is used by CancelFlightInfoResult.status.
215 */
216enum CancelStatus {
217 // The cancellation status is unknown. Servers should avoid using
218 // this value (send a NOT_FOUND error if the requested query is
219 // not known). Clients can retry the request.
220 CANCEL_STATUS_UNSPECIFIED = 0;
221 // The cancellation request is complete. Subsequent requests with
222 // the same payload may return CANCELLED or a NOT_FOUND error.
223 CANCEL_STATUS_CANCELLED = 1;
224 // The cancellation request is in progress. The client may retry
225 // the cancellation request.
226 CANCEL_STATUS_CANCELLING = 2;
227 // The query is not cancellable. The client should not retry the
228 // cancellation request.
229 CANCEL_STATUS_NOT_CANCELLABLE = 3;
230}
231
232/*
233 * The result of the CancelFlightInfo action.
234 *
235 * The result should be stored in Result.body.
236 */
237message CancelFlightInfoResult {
238 CancelStatus status = 1;
239}
240
241/*
242 * Wrap the result of a getSchema call
243 */
244message SchemaResult {
245 // The schema of the dataset in its IPC form:
246 // 4 bytes - an optional IPC_CONTINUATION_TOKEN prefix
247 // 4 bytes - the byte length of the payload
248 // a flatbuffer Message whose header is the Schema
249 bytes schema = 1;
250}
251
252/*
253 * The name or tag for a Flight. May be used as a way to retrieve or generate
254 * a flight or be used to expose a set of previously defined flights.
255 */
256message FlightDescriptor {
257
258 /*
259 * Describes what type of descriptor is defined.
260 */
261 enum DescriptorType {
262
263 // Protobuf pattern, not used.
264 UNKNOWN = 0;
265
266 /*
267 * A named path that identifies a dataset. A path is composed of a string
268 * or list of strings describing a particular dataset. This is conceptually
269 * similar to a path inside a filesystem.
270 */
271 PATH = 1;
272
273 /*
274 * An opaque command to generate a dataset.
275 */
276 CMD = 2;
277 }
278
279 DescriptorType type = 1;
280
281 /*
282 * Opaque value used to express a command. Should only be defined when
283 * type = CMD.
284 */
285 bytes cmd = 2;
286
287 /*
288 * List of strings identifying a particular dataset. Should only be defined
289 * when type = PATH.
290 */
291 repeated string path = 3;
292}
293
294/*
295 * The access coordinates for retrieval of a dataset. With a FlightInfo, a
296 * consumer is able to determine how to retrieve a dataset.
297 */
298message FlightInfo {
299 // The schema of the dataset in its IPC form:
300 // 4 bytes - an optional IPC_CONTINUATION_TOKEN prefix
301 // 4 bytes - the byte length of the payload
302 // a flatbuffer Message whose header is the Schema
303 bytes schema = 1;
304
305 /*
306 * The descriptor associated with this info.
307 */
308 FlightDescriptor flight_descriptor = 2;
309
310 /*
311 * A list of endpoints associated with the flight. To consume the
312 * whole flight, all endpoints (and hence all Tickets) must be
313 * consumed. Endpoints can be consumed in any order.
314 *
315 * In other words, an application can use multiple endpoints to
316 * represent partitioned data.
317 *
318 * If the returned data has an ordering, an application can use
319 * "FlightInfo.ordered = true" or should return the all data in a
320 * single endpoint. Otherwise, there is no ordering defined on
321 * endpoints or the data within.
322 *
323 * A client can read ordered data by reading data from returned
324 * endpoints, in order, from front to back.
325 *
326 * Note that a client may ignore "FlightInfo.ordered = true". If an
327 * ordering is important for an application, an application must
328 * choose one of them:
329 *
330 * * An application requires that all clients must read data in
331 * returned endpoints order.
332 * * An application must return the all data in a single endpoint.
333 */
334 repeated FlightEndpoint endpoint = 3;
335
336 // Set these to -1 if unknown.
337 int64 total_records = 4;
338 int64 total_bytes = 5;
339
340 /*
341 * FlightEndpoints are in the same order as the data.
342 */
343 bool ordered = 6;
344}
345
346/*
347 * A particular stream or split associated with a flight.
348 */
349message FlightEndpoint {
350
351 /*
352 * Token used to retrieve this stream.
353 */
354 Ticket ticket = 1;
355
356 /*
357 * A list of URIs where this ticket can be redeemed via DoGet().
358 *
359 * If the list is empty, the expectation is that the ticket can only
360 * be redeemed on the current service where the ticket was
361 * generated.
362 *
363 * If the list is not empty, the expectation is that the ticket can
364 * be redeemed at any of the locations, and that the data returned
365 * will be equivalent. In this case, the ticket may only be redeemed
366 * at one of the given locations, and not (necessarily) on the
367 * current service.
368 *
369 * In other words, an application can use multiple locations to
370 * represent redundant and/or load balanced services.
371 */
372 repeated Location location = 2;
373
374 /*
375 * Expiration time of this stream. If present, clients may assume
376 * they can retry DoGet requests. Otherwise, it is
377 * application-defined whether DoGet requests may be retried.
378 */
379 google.protobuf.Timestamp expiration_time = 3;
380}
381
382/*
383 * A location where a Flight service will accept retrieval of a particular
384 * stream given a ticket.
385 */
386message Location {
387 string uri = 1;
388}
389
390/*
391 * An opaque identifier that the service can use to retrieve a particular
392 * portion of a stream.
393 *
394 * Tickets are meant to be single use. It is an error/application-defined
395 * behavior to reuse a ticket.
396 */
397message Ticket {
398 bytes ticket = 1;
399}
400
401/*
402 * A batch of Arrow data as part of a stream of batches.
403 */
404message FlightData {
405
406 /*
407 * The descriptor of the data. This is only relevant when a client is
408 * starting a new DoPut stream.
409 */
410 FlightDescriptor flight_descriptor = 1;
411
412 /*
413 * Header for message data as described in Message.fbs::Message.
414 */
415 bytes data_header = 2;
416
417 /*
418 * Application-defined metadata.
419 */
420 bytes app_metadata = 3;
421
422 /*
423 * The actual batch of Arrow data. Preferably handled with minimal-copies
424 * coming last in the definition to help with sidecar patterns (it is
425 * expected that some implementations will fetch this field off the wire
426 * with specialized code to avoid extra memory copies).
427 */
428 bytes data_body = 1000;
429}
430
431/**
432 * The response message associated with the submission of a DoPut.
433 */
434message PutResult {
435 bytes app_metadata = 1;
436}