๐Ÿ“ Publications

* indicates equal contribution, and โ€  indicates corresponding author.

ACL 2025 findings
sym

Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models

Shuliang Liu$^{*}$, Xinze Li$^{*}$, Zhenghao Liu$^โ€ $,Yukun Yan,Cheng Yang,Zheni Zeng,Zhiyuan Liu,Maosong Sun,Ge Yu ๐Ÿ“ƒPaper | ๐Ÿ“„PDF

  • This paper introduces the Judge-Consistency (ConsJudge) method, which aims to enhance LLMs to generate more accurate evaluations for RAG models.