Abduhu1

Abdullah Andrabi Abduhu1

Pinned Loading

Evaluating-SLMs-on-JEEBench Evaluating-SLMs-on-JEEBench Public

A comprehensive analysis of seven state-of-the-art SLMs on JEEBench, a rigorous benchmark for mathematical and scientific reasoning. This project explores the impact of zero-shot, few-shot, and Cha…

Python 1