Skip to content

Commit d15b66d

Browse files
authored
Update README.md
1 parent e3e8430 commit d15b66d

File tree

1 file changed

+15
-1
lines changed

1 file changed

+15
-1
lines changed

README.md

Lines changed: 15 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -214,4 +214,18 @@ The evaluation result contains additional scoring fields:
214214
- `key_validation_score`: Score from validating expected keys in JSON output (for non-renderable outputs)
215215
- `raw_output_eval`: Array of boolean values indicating whether each raw output metric was satisfied
216216
- `raw_output_score`: Score from the raw output evaluation
217-
- `final_eval_score`: Overall evaluation score between 0 and 1
217+
- `final_eval_score`: Overall evaluation score between 0 and 1
218+
219+
## Citation
220+
Please cite us with the following bibtex:
221+
```
222+
@misc{yang2025structeval,
223+
title={StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs},
224+
author={Jialin Yang and Dongfu Jiang and Lipeng He and Sherman Siu and Yuxuan Zhang and Disen Liao and Zhuofeng Li and Huaye Zeng and Yiming Jia and Haozhe Wang and Benjamin Schneider and Chi Ruan and Wentao Ma and Zhiheng Lyu and Yifei Wang and Yi Lu and Quy Duc Do and Ziyan Jiang and Ping Nie and Wenhu Chen},
225+
year={2025},
226+
eprint={2505.20139},
227+
archivePrefix={arXiv},
228+
primaryClass={cs.SE},
229+
doi={10.48550/arXiv.2505.20139}
230+
}
231+
```

0 commit comments

Comments
 (0)