Currently, the splitting of training data (into a validation or cross-validation split(s)) happens in prepare_data_for_training() which is defined by each model
It should be moved from the models themselves to saber.Saber, i.e, it should be model agnostic.