Skip to content

lp.read_parquet

Reads a Parquet file and returns a LazyFrame.

Parameters:

Name Type Description Default
path str

Path to the Parquet file.

required
columns list[str] | None

List of column names to read from the file. If None, all columns are read.

None
include_file_row_number bool

If True, includes a column with the row number from the file.

False
include_filename bool

If True, includes a column with the filename.

False
use_hive_partitioning bool

If True, enables Hive partitioning.

False
union_by_name bool

If True, unions files by column name.

False
compression str | None

Compression type to use when reading the Parquet file.

None

Returns:

Name Type Description
LazyFrame LazyFrame

A LazyFrame containing the data from the Parquet file.

Example:

import lazy_pandas as lp
df = lp.read_parquet('data.parquet', columns=['column1', 'column2'])
df.head()