ParquetΒΆ
It is quite simple to read a parquet file using the SessionContext.read_parquet()
function.
from datafusion import SessionContext
ctx = SessionContext()
df = ctx.read_parquet("file.parquet")
An alternative is to use SessionContext.register_parquet()
ctx.register_parquet("file", "file.parquet")
df = ctx.table("file")