Laidlaw, Cassidy
21 publications
ICLR
2025
Iterative Label Refinement Matters More than Preference Optimization Under Weak Supervision
ICLR
2024
Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF
21 publications