Uesato, Jonathan

9 publications

NeurIPS 2022 Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models Maribeth Rauh, John Mellor, Jonathan Uesato, Po-Sen Huang, Johannes Welbl, Laura Weidinger, Sumanth Dathathri, Amelia Glaese, Geoffrey Irving, Iason Gabriel, William Isaac, Lisa Anne Hendricks
NeurIPS 2021 Make Sure You're Unsure: A Framework for Verifying Probabilistic Specifications Leonard Berrada, Sumanth Dathathri, Krishnamurthy Dvijotham, Robert Stanforth, Rudy R Bunel, Jonathan Uesato, Sven Gowal, M. Pawan Kumar
NeurIPS 2020 Enabling Certification of Verification-Agnostic Networks via Memory-Efficient Semidefinite Programming Sumanth Dathathri, Krishnamurthy Dvijotham, Alexey Kurakin, Aditi Raghunathan, Jonathan Uesato, Rudy R Bunel, Shreya Shankar, Jacob Steinhardt, Ian Goodfellow, Percy Liang, Pushmeet Kohli
ICLR 2020 Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control Tsui-Wei Weng, Krishnamurthy Dvijotham, Jonathan Uesato, Kai Xiao, Sven Gowal, Robert Stanforth, Pushmeet Kohli
NeurIPS 2019 Are Labels Required for Improving Adversarial Robustness? Jean-Baptiste Alayrac, Jonathan Uesato, Po-Sen Huang, Alhussein Fawzi, Robert Stanforth, Pushmeet Kohli
ICLR 2019 Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures Jonathan Uesato, Ananya Kumar, Csaba Szepesvari, Tom Erez, Avraham Ruderman, Keith Anderson, Krishnamurthy Dvijotham, Nicolas Heess, Pushmeet Kohli
ICLR 2019 Verification of Non-Linear Specifications for Neural Networks Chongli Qin, Krishnamurthy Dvijotham, Brendan O'Donoghue, Rudy Bunel, Robert Stanforth, Sven Gowal, Jonathan Uesato, Grzegorz Swirszcz, Pushmeet Kohli
ICML 2018 Adversarial Risk and the Dangers of Evaluating Against Weak Attacks Jonathan Uesato, Brendan O’Donoghue, Pushmeet Kohli, Aaron Oord
ICML 2017 RobustFill: Neural Program Learning Under Noisy I/O Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli