-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
With PyArrow and Pandas, a user can select a set of columns to return and filter by a column that is not in this set. For example:
import pandas as pd
frame = pd.read_parquet('Npix=0.parquet', columns=['ra', 'dec'], filters=[('yMeanPSFMag', '<', 15)])However, when I try to do the same with lsdb.read_hats and the HATS server, I always get an empty DataFrame.
Could we make column selection a two-step process, where we first load the columns specified both by columns and filters, apply the filter, and then remove the columns that appeared only in filters?
Metadata
Metadata
Assignees
Labels
No labels
Type
Projects
Status
No status