Skip to content

Commit ad506a1

Browse files
authored
remove duplicate tags/groups (#3343)
1 parent d5ddccd commit ad506a1

21 files changed

+44
-44
lines changed

lm_eval/tasks/longbench2/_longbench2.yaml

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,10 @@
11
group: longbench2
22
task:
3-
- longbench2_history
4-
- longbench2_incontext
5-
- longbench2_multi
6-
- longbench2_single
7-
- longbench2_structured
3+
- longbench2_history_tasks
4+
- longbench2_incontext_tasks
5+
- longbench2_multi_tasks
6+
- longbench2_single_tasks
7+
- longbench2_structured_tasks
88
- longbench2_code
99
aggregate_metric_list:
1010
- metric: acc
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
include: _longbench_common_yaml
22
tag:
3-
- longbench2
4-
- longbench2_multi
3+
- longbench2_tasks
4+
- longbench2_multi_tasks
55
task: longbench2_academic_multi
66
dataset_name: academic_multi
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
include: _longbench_common_yaml
22
tag:
3-
- longbench2
4-
- longbench2_single
3+
- longbench2_tasks
4+
- longbench2_single_tasks
55
task: longbench2_academic_single
66
dataset_name: academic_single
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
include: _longbench_common_yaml
22
tag:
3-
- longbench2
4-
- longbench2_history
3+
- longbench2_tasks
4+
- longbench2_history_tasks
55
task: longbench2_agent_history
66
dataset_name: agent_history_qa
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
include: _longbench_common_yaml
22
tag:
3-
- longbench2
4-
- longbench2_single
3+
- longbench2_tasks
4+
- longbench2_single_tasks
55
task: longbench2_detective
66
dataset_name: detective
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
include: _longbench_common_yaml
22
tag:
3-
- longbench2
4-
- longbench2_history
3+
- longbench2_tasks
4+
- longbench2_history_tasks
55
task: longbench2_dialogue_history
66
dataset_name: dialogue_history_qa
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
include: _longbench_common_yaml
22
tag:
3-
- longbench2
4-
- longbench2_single
3+
- longbench2_tasks
4+
- longbench2_single_tasks
55
task: longbench2_event_order
66
dataset_name: event_ordering
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
include: _longbench_common_yaml
22
tag:
3-
- longbench2
4-
- longbench2_multi
3+
- longbench2_tasks
4+
- longbench2_multi_tasks
55
task: longbench2_fin_multi
66
dataset_name: financial_multi
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
include: _longbench_common_yaml
22
tag:
3-
- longbench2
4-
- longbench2_single
3+
- longbench2_tasks
4+
- longbench2_single_tasks
55
task: longbench2_fin_single
66
dataset_name: financial_single
Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
include: _longbench_common_yaml
22
tag:
3-
- longbench2
4-
- longbench2_multi
3+
- longbench2_tasks
4+
- longbench2_multi_tasks
55
task: longbench2_govt_multi
66
dataset_name: government_multi

0 commit comments

Comments
 (0)