v0.2.1
·
10 commits
to main
since this release
Feature
- Test-based Evaluation with LLM-as-a-judge (#225) (
0f1f0f8) - Add a
code_interpretertool (#232) (b03c964)
Fix
- Add simple lock to hf generation to prevent using incorrect weights (#237) (
6b2a527) - Collection of small fixes (#238) (
2120112) - Fix unused litellm import (#246) (
633bfd7) - Minor updates to answer relevance (#245) (
bde9b4d) - Pre-commit file selection (#243) (
e70d307)