Smith, Virginia
54 publications
ICMLW
2024
Learning to Reason by Failing: Offline RL on Sub-Optimal Rollouts Scales Synthetic Data by 8x
NeurIPS
2024
On the Benefits of Public Representations for Private Transfer Learning Under Distribution Shift
NeurIPS
2024
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold
NeurIPS
2023
Complementary Benefits of Contrastive Learning and Self-Training Under Distribution Shift
NeurIPSW
2022
To Federate or Not to Federate: Incentivizing Client Participation in Federated Learning