In this line, when you transform the messages into a textual prompt, you may need to add the arg `add_generation_prompt=True` https://github.com/PremiLab-Math/MathCheck/blob/fc0454ed5333b5e33d70760a03fc8c5d33814491/scripts/text_vllm_inference.py#L96