Qingchen Yu

profile_photo

I am currently a Ph.D. student in Artificial Intelligence at Beihang University. My research interests mainly focus on large language models.

Email: zhgyqc[at]163[dot]com

GitHub | Twitter | DINQ
Google Scholar | CV | Hugging Face

Education

Beihang University Beijing, China
Ph.D. Student in Artificial Intelligence 2025.09 - Present
Shanghai University Shanghai, China
M.Mgt. in Management Science and Engineering 2022.09 - 2025.04
Henan University of Economics and Law Zhengzhou, China
B.Mgt. in E-commerce 2018.09 - 2022.06

Selected Publications

* Contributed Equally; Corresponding Author

guessarena_framework

GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning
Qingchen Yu*, Zifan Zheng*, Ding Chen*, Simin Niu, Bo Tang, et al.
ACL, 2025 ACL Anthology | arXiv | GitHub | Poster

xfinder_framework

xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
Qingchen Yu*, Zifan Zheng*, Shichao Song*, Zhiyu Li, Feiyu Xiong, et al.
ICLR, 2025 OpenReview | arXiv | GitHub | Models | Dataset | WeChat Article | Poster

xverify_framework

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
Ding Chen*, Qingchen Yu*, Pengyuan Wang*, Wentao Zhang, Bo Tang, et al.
arXiv, 2025 arXiv | GitHub | Models | WeChat Article | X Thread

turtle_framework

TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles
Qingchen Yu*, Shichao Song*, Ke Fang*, Yunfeng Shi, Zifan Zheng, et al.
arXiv, 2024 IEEE Xplore | arXiv | GitHub | Dataset | Demo | Blog Post | Poster

grimoire_framework

Grimoire is All You Need for Enhancing Large Language Models
Ding Chen*, Shichao Song*, Qingchen Yu, Zhiyu Li, Wenjin Wang, et al.
arXiv, 2024 arXiv | GitHub | WeChat Article

Honors and Awards

  • Outstanding Graduate Award of Henan (2022)
  • National Scholarship for Undergraduate Students (2021)
  • National Second Prize in the China Undergraduate Mathematical Contest in Modeling (2020)

Academic Services

  • Reviewer for ICLR, CVPR, ECCV, ACM TIST, TMLR
  • Volunteer for COSCon'25