这个多模型测评特别棒,想问问能不能做成可以扩展到不同的数据源,目前看起来真是支持Falcon中文数据集 #2943
Evan-Luo-Engnieer
started this conversation in
Ideas
Replies: 1 comment
-
|
暂时还不支持,后续可以考虑,BIRD和Spider只能跑DEV数据集,如果有当前需求,也可以自己基于当前的评测框架兼容一下数据集的格式 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
我搜索了一下,可以做成可以配置的, 比如可以添加下面的英文数据集
Beta Was this translation helpful? Give feedback.
All reactions