Crate parquet_index

Source
Expand description

Binary that prints the [page index] of a parquet file

ยงInstall

parquet-layout can be installed using cargo:

cargo install parquet --features=cli

After this parquet-index should be available:

parquet-index XYZ.parquet COLUMN_NAME

The binary can also be built from the source code and run as follows:

cargo run --features=cli --bin parquet-index XYZ.parquet COLUMN_NAME

[page index]: https://github.com/apache/parquet-format/blob/master/PageIndex.md

Structsยง

Args ๐Ÿ”’

Functionsยง

compute_row_counts ๐Ÿ”’
Computes the number of rows in each page within a column chunk
main ๐Ÿ”’
print_index ๐Ÿ”’
Prints index information for a single column chunk