Gandhi, Kanishk
11 publications
NeurIPSW
2024
Policy Dreamer: Diverse Public Policy Generation via Elicitation and Simulation of Human Preferences
NeurIPS
2024
Self-Supervised Alignment with Mutual Information: Learning to Follow Principles Without Preference Labels