Skip to content

Filter pushdown? #163

@eitsupi

Description

@eitsupi

I am wondering if there is a way to filter and read Parquet datasets in Julia.
https://arrow.apache.org/blog/2021/12/03/arrow-duckdb/

For example, it would be great to have a written example of the following query from pyarrow translated into Julia.

import pyarrow.dataset as ds
dataset = ds.dataset(base / "parquet_dataset", format="parquet")
dataset.to_table(filter=ds.field('a') >= 7).to_pandas()

https://arrow.apache.org/docs/python/dataset.html#filtering-data

I am hardly familiar with Julia and apologize if this is not the right place to post this.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions