Xia, Emily

2 publications

ICML 2025 Putnam-AXIOM: A Functional & Static Benchmark for Measuring Higher Level Mathematical Reasoning in LLMs Aryan Gulati, Brando Miranda, Eric Chen, Emily Xia, Kai Fronsdal, Bruno De Moraes Dumont, Sanmi Koyejo
NeurIPSW 2024 Putnam-AXIOM: A Functional and Static Benchmark for Measuring Higher Level Mathematical Reasoning Aryan Gulati, Brando Miranda, Eric Chen, Emily Xia, Kai Fronsdal, Bruno de Moraes Dumont, Sanmi Koyejo