Skip to content

Commit d3d769b

Browse files
authored
evalplus: add llama 3.1 8b
1 parent 05c4824 commit d3d769b

File tree

2 files changed

+13
-1
lines changed

2 files changed

+13
-1
lines changed

leaderboard.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -93,7 +93,7 @@ <h1 class="text-nowrap mt-5">🏆 EvalPlus Leaderboard 🏆</h1>
9393
<h3 class="fw-light text-nowrap">
9494
<small id="warning">EvalPlus evaluates AI Coders with rigorous tests.<br /></small>
9595
</h3>
96-
<p>📢 Beyond correctness, how's their code efficiency? Checkout <a href="./evalperf.html">🚀EvalPerf</a>!</p>
96+
<p>📢 News: Beyond correctness, how's their code efficiency? Checkout <a href="./evalperf.html">🚀EvalPerf</a>!</p>
9797
<div class="d-flex flex-row justify-content-center gap-3">
9898
<a href="https://github.com/evalplus/evalplus"><img
9999
src="https://img.shields.io/badge/github-%23121011.svg?style=for-the-badge&logo=github&logoColor=white"

results.json

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -527,6 +527,18 @@
527527
"prompted": false,
528528
"size": 8.0
529529
},
530+
"Llama3.1-8B-instruct": {
531+
"link": "https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct",
532+
"open-data": "NONE",
533+
"pass@1": {
534+
"humaneval": 69.5,
535+
"humaneval+": 62.8,
536+
"mbpp": 66.4,
537+
"mbpp+": 54.0
538+
},
539+
"prompted": true,
540+
"size": 8.0
541+
},
530542
"Llama3-8B-instruct": {
531543
"link": "https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct",
532544
"open-data": "NONE",

0 commit comments

Comments
 (0)