Li, Ruosen

3 publications

NeurIPS 2024 IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering Ruosen Li, Ruochen Li, Barry Wang, Xinya Du

NeurIPS 2024 MEQA: A Benchmark for Multi-Hop Event-Centric Question Answering with Explanations Ruosen Li, Zimu Wang, Son Quoc Tran, Lei Xia, Xinya Du

TMLR 2024 PRD: Peer Rank and Discussion Improve Large Language Model Based Evaluations Ruosen Li, Teerth Patel, Xinya Du