You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Enhance majority voting plugin and update eval config
Upgrades the majority voting plugin with category-aware answer extraction, adaptive temperature control, improved normalization, response quality filtering, and smart fallback strategies. Updates the default test-time compute configuration in eval_optillmbench.py to use 5 candidates instead of 8 for fairer comparison and memory efficiency, and revises related reporting logic and documentation.
0 commit comments