Christiano, Paul F

5 publications

NeurIPS 2022 Training Language Models to Follow Instructions with Human Feedback Long Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul F Christiano, Jan Leike, Ryan Lowe
NeurIPS 2020 Learning to Summarize with Human Feedback Nisan Stiennon, Long Ouyang, Jeffrey Wu, Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul F Christiano
NeurIPS 2017 Deep Reinforcement Learning from Human Preferences Paul F Christiano, Jan Leike, Tom Brown, Miljan Martic, Shane Legg, Dario Amodei
COLT 2016 Provably Manipulation-Resistant Reputation Systems Paul F. Christiano
COLT 2014 Open Problem: Online Local Learning Paul F. Christiano