ML Anthology
Authors
Search
About
Shen, Minghe
1 publications
ICLR
2026
Noisy but Valid: Robust Statistical Evaluation of LLMs with Imperfect Judges
Chen Feng
,
Minghe Shen
,
Ananth Balashankar
,
Carsten Gerner-Beuerle
,
Miguel R. D. Rodrigues