Shen, Minghe

1 publications

ICLR 2026 Noisy but Valid: Robust Statistical Evaluation of LLMs with Imperfect Judges Chen Feng, Minghe Shen, Ananth Balashankar, Carsten Gerner-Beuerle, Miguel R. D. Rodrigues