Skip to content

Commit c5e1685

Browse files
committed
Addressing comments
1 parent 4f2f6ac commit c5e1685

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

CHANGELOG.md

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,8 +6,6 @@
66

77
### Tasks
88

9-
- Added `BalancedCOPA` benchmark task (balanced variant of COPA from `pkavumba/balanced-copa` on HuggingFace)
10-
119
### Metrics
1210

1311
### General

src/eval_framework/tasks/benchmarks/balancedcopa.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,8 @@ def _split_dataset_into_train_and_val(self, dataset: DatasetDict) -> DatasetDict
3737
# We split the train data into train and validation splits so that
3838
# the validation split matches the validation split of the original COPA dataset.
3939
# These magic numbers of the ids below were arrived at after manual inspection of the dataset.
40+
# Numbers 401-500 correspond to the validation split of the original COPA dataset.
41+
# Numbers 1401-1500 correspond to the mirrored version of the val split.
4042
# The sanity of this version is maintained by the HF_REVISION above.
4143
dataset["validation"], dataset["train"] = split_dataset_by_id_ranges(
4244
dataset["train"], "id", [(401, 500), (1401, 1500)]

0 commit comments

Comments
 (0)