Thanks for sharing!
I want to use my Chinese data for "Instruction Backtranslation". Do I need to modify only Seed Data and Unlabelled Data?
In addition, does "quality" in Seed Data need to be marked by itself? Because Seed Data of is the SFT data that has been manually checked, I do not know whether your Seed Data has other processing. At the same time, how can I provide "Unlabelled Data"?