Pyarrow lazyframes not collected properly in latest version #20370
Labels
bug
Something isn't working
needs triage
Awaiting prioritization by a maintainer
python
Related to Python Polars
Checks
Reproducible example
Log output
Issue description
In the new version of polars, scanning cloud folders with pyarrow's
scan_pyarrow_dataset
no longer works as expected.I believe this might be related to issue #20279 but i'm not certain. In any case, here is a simple and reproducible example
Expected behavior
calling
df.select(pl.len()).collect()
on a LazyFrame read through pyarrow should correctly evaluate the len of the df, instead of returning the whole dfInstalled versions
The text was updated successfully, but these errors were encountered: