Skip to content

Commit 9fbe48c

Browse files
authored
fix longbech citation (#3061)
* fix longbech citation
1 parent e20ef72 commit 9fbe48c

File tree

1 file changed

+3
-9
lines changed

1 file changed

+3
-9
lines changed

lm_eval/tasks/longbench/README.md

Lines changed: 3 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,23 +1,17 @@
1-
# Task-name
1+
# LongBench
22

33
### Paper
44

5-
Title: `LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks`
5+
Title: `LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding`
66

7-
Abstract: `This paper introduces LongBench v2, a benchmark designed to assess the ability of LLMs to handle long-context problems requiring deep understanding and reasoning across real-world multitasks. LongBench v2 consists of 503 challenging multiple-choice questions, with contexts ranging from 8k to 2M words, across six major task categories: single-document QA, multi-document QA, long in-context learning, long-dialogue history understanding, code repository understanding, and long structured data understanding.`
7+
Abstract: `In this paper, we introduce LongBench, the first bilingual, multi-task benchmark for long context understanding, enabling a more rigorous evaluation of long context understanding. LongBench comprises 21 datasets across 6 task categories in both English and Chinese, with an average length of 6,711 words (English) and 13,386 characters (Chinese). These tasks cover key long-text application areas including single-doc QA, multi-doc QA, summarization, few-shot learning, synthetic tasks, and code completion. All datasets in LongBench are standardized into a unified format, allowing for effortless automatic evaluation of LLMs`
88

99
Homepage: `https://github.com/THUDM/LongBench`
1010

1111

1212
### Citation
1313

1414
```
15-
@article{bai2024longbench2,
16-
title={LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks},
17-
author={Yushi Bai and Shangqing Tu and Jiajie Zhang and Hao Peng and Xiaozhi Wang and Xin Lv and Shulin Cao and Jiazheng Xu and Lei Hou and Yuxiao Dong and Jie Tang and Juanzi Li},
18-
journal={arXiv preprint arXiv:2412.15204},
19-
year={2024}
20-
}
2115
@inproceedings{bai2024longbench,
2216
title = "{L}ong{B}ench: A Bilingual, Multitask Benchmark for Long Context Understanding",
2317
author = "Bai, Yushi and Lv, Xin and Zhang, Jiajie and Lyu, Hongchang and

0 commit comments

Comments
 (0)