Preprocessing Feature Improvements

The MLjar is a great framework, and I suggest customizing the preprocessing steps.

Currently, the preprocessing is encapsulated by the ModelFramework class and is unreachable for the programmer to modify. I'd like to add [dimensionality reduction](https://scikit-learn.org/stable/api/sklearn.decomposition.html), [oversampling](https://imbalanced-learn.org/stable/over_sampling.html), and [undersampling](https://imbalanced-learn.org/stable/under_sampling.html) methods to the preprocessing steps. I suggest making the preprocessing class accessible to outsiders and passing it through to the ModelFramework. This way, the preprocessing class could be a "manager" for all the preprocessing steps. The developers could then add custom/additional preprocessing steps to the pipeline. It would also be great to integrate the parameters to be optimized on those preprocessing methods (e.g., number of components for the dimensionality reduction method).

If there might already be a solution, I'm eager to hear about it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Preprocessing Feature Improvements #797

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Preprocessing Feature Improvements #797

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions