ML Anthology
Authors
Search
About
Wei, Kevin
3 publications
TMLR
2025
Infrastructure for AI Agents
Alan Chan
,
Kevin Wei
,
Sihao Huang
,
Nitarshan Rajkumar
,
Elija Perrier
,
Seth Lazar
,
Gillian K Hadfield
,
Markus Anderljung
ICLRW
2025
Model Evaluations Need Rigorous and Transparent Human Baselines
Kevin Wei
,
Patricia Paskov
,
Sunishchal Dev
,
Michael J Byun
,
Anka Reuel
,
Xavier Roberts-Gaal
,
Rachel Calcott
,
Evie Coxon
,
Chinmay Deshpande
ICML
2025
Position: Human Baselines in Model Evaluations Need Rigor and Transparency (With Recommendations & Reporting Checklist)
Kevin Wei
,
Patricia Paskov
,
Sunishchal Dev
,
Michael J Byun
,
Anka Reuel
,
Xavier Roberts-Gaal
,
Rachel Calcott
,
Evie Coxon
,
Chinmay Deshpande