Hey, is it possible to generate the balanced synthesized data even though the realtabformer model is trained on imbalanced data (the proportion is even up to 4 to 96). How do I do that?
CTGAN, TVAE, and even be_great are able to do this simply by:
model.sample(n_samples, start_col=target_col, start_col_dist={'Yes':0.5, 'No':0.5})