Hello,thanks for your great work. I have offline preprocess internvl_sft_1.2M data,but how do I use it to fast my training ?