ParquetΒΆ

It is quite simple to read a parquet file using the SessionContext.read_parquet() function.

from datafusion import SessionContext

ctx = SessionContext()
df = ctx.read_parquet("file.parquet")

An alternative is to use SessionContext.register_parquet()

ctx.register_parquet("file", "file.parquet")
df = ctx.table("file")