-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Is your feature request related to a problem or challenge?
There is a common operation in libraries such as PySpark to fill NaN values across an entire DataFrame (or limit by columns). It would be useful to have a similar feature in DataFusion and datafusion-python.
Describe the solution you'd like
If I have a dataframe with a bunch of null values in different columns, I would want to replace all NaNs in those columns with the provided value IF it can be cast to the column's type. Otherwise no-op should happen. Also the user should be able to limit which columns this applies to.
Describe alternatives you've considered
Additional context
This is a repost from apache/datafusion-python#922, prompted by this PR comment
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request