Welcome to the User Guide for the Python bindings of Arrow DataFusion. This guide aims to provide an introduction to DataFusion through various examples and highlight the most effective ways of using it.


DataFusion is a Python library and, as such, can be installed via pip from PyPI.

pip install datafusion

You can verify the installation by running:

In [1]: import datafusion

In [2]: datafusion.__version__
Out[2]: '33.0.0'