Hi, thanks for the great work! I’ve been training a model following your setup. I’d like to know: what’s a good range for val/test_score/JudgeLRM? Just want to better understand how my results compare. Thanks in advance!