DataFusion Logo

Apache Arrow DataFusion

Star Fork

DataFusion is a very fast, extensible query engine for building high-quality data-centric systems in Rust, using the Apache Arrow in-memory format.

DataFusion offers SQL and Dataframe APIs, excellent performance, built-in support for CSV, Parquet, JSON, and Avro, extensive customization, and a great community.

The example usage section in the user guide and the datafusion-examples code in the crate contain information on using DataFusion.

Please see the developer’s guide for contributing and communication for getting in touch with us.