You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The results were inconsistent when the inference model of DeepSeek-R1 was used for classification evaluation, and the DeepSeek-Chat model did not have this problem. What is the reason for this?