๐ Publications
* indicates equal contribution, and โ indicates corresponding author.
ACL 2025 findings

Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models
Shuliang Liu$^{*}$, Xinze Li$^{*}$, Zhenghao Liu$^โ $,Yukun Yan,Cheng Yang,Zheni Zeng,Zhiyuan Liu,Maosong Sun,Ge Yu ๐Paper | ๐PDF
- This paper introduces the Judge-Consistency (ConsJudge) method, which aims to enhance LLMs to generate more accurate evaluations for RAG models.