Strauss, Hubert

1 publications

NeurIPS 2025 What Makes a Reward Model a Good Teacher? an Optimization Perspective Noam Razin, Zixuan Wang, Hubert Strauss, Stanley Wei, Jason D. Lee, Sanjeev Arora