Frick, Evan

3 publications

ICML 2025 From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and Benchbuilder Pipeline Tianle Li, Wei-Lin Chiang, Evan Frick, Lisa Dunlap, Tianhao Wu, Banghua Zhu, Joseph E. Gonzalez, Ion Stoica
ICLR 2025 How to Evaluate Reward Models for RLHF Evan Frick, Tianle Li, Connor Chen, Wei-Lin Chiang, Anastasios Nikolas Angelopoulos, Jiantao Jiao, Banghua Zhu, Joseph E. Gonzalez, Ion Stoica
ICML 2025 Prompt-to-Leaderboard: Prompt-Adaptive LLM Evaluations Evan Frick, Connor Chen, Joseph Tennyson, Tianle Li, Wei-Lin Chiang, Anastasios Nikolas Angelopoulos, Ion Stoica