-
Notifications
You must be signed in to change notification settings - Fork 5
Request: optional order_columns parameter of get_dataframe() #124
Description
- ML Launchpad version: 1.0.0
- Model Type used: Python
- DataSource type(s) used: n/a
- Python version: 3.x
- Operating System: all
Description
Right now, if I want to ensure column ordering in a dataframe, I need to pass the df through mllaunchpad.order_columns in my own code.
This makes sense where my code wrangles multiple dataframes in a way that would make the resulting ordering non-predictable from the initial ordering, and I just want to order the columns before I pass the data into my model.
While this works, having to call order_columns in three functions might appear tedious in use cases that pass one dataframe from the datasource straight into the model. Some users might prefer to just call data_sources['bla'].get_dataframe(order_columns=True).
Some users might also prefer to have an option order_columns: true (default false) for the datasources in the configuration file.
Not sure what is the best way to go forward here -- suggestions are welcome!
(issue added for a user)