我尝试在DRIVE上进行训练,epoch设置为200,batch_size设置为512;在train.sh进行训练,test.sh生成pred,再通过eval将生成的pred和label得到的结果比较差,能帮忙分析下原因吗 <img width="784" height="298" alt="Image" src="https://github.com/user-attachments/assets/96bf07d6-439f-426a-9052-9b85bc95d67f" /> <img width="332" height="164" alt="Image" src="https://github.com/user-attachments/assets/732ada03-b2b4-4512-b79d-ab3eb74322e2" />