Uesato, Jonathan

9 publications

NeurIPS 2022 Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models Maribeth Rauh, John Mellor, Jonathan Uesato, Po-Sen Huang, Johannes Welbl, Laura Weidinger, Sumanth Dathathri, Amelia Glaese, Geoffrey Irving, Iason Gabriel, William Isaac, Lisa Anne Hendricks

NeurIPS 2021 Make Sure You're Unsure: A Framework for Verifying Probabilistic Specifications Leonard Berrada, Sumanth Dathathri, Krishnamurthy Dvijotham, Robert Stanforth, Rudy R Bunel, Jonathan Uesato, Sven Gowal, M. Pawan Kumar

NeurIPS 2020 Enabling Certification of Verification-Agnostic Networks via Memory-Efficient Semidefinite Programming Sumanth Dathathri, Krishnamurthy Dvijotham, Alexey Kurakin, Aditi Raghunathan, Jonathan Uesato, Rudy R Bunel, Shreya Shankar, Jacob Steinhardt, Ian Goodfellow, Percy Liang, Pushmeet Kohli

ICLR 2020 Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control Tsui-Wei Weng, Krishnamurthy Dvijotham, Jonathan Uesato, Kai Xiao, Sven Gowal, Robert Stanforth, Pushmeet Kohli

NeurIPS 2019 Are Labels Required for Improving Adversarial Robustness? Jean-Baptiste Alayrac, Jonathan Uesato, Po-Sen Huang, Alhussein Fawzi, Robert Stanforth, Pushmeet Kohli

ICLR 2019 Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures Jonathan Uesato, Ananya Kumar, Csaba Szepesvari, Tom Erez, Avraham Ruderman, Keith Anderson, Krishnamurthy Dvijotham, Nicolas Heess, Pushmeet Kohli

ICLR 2019 Verification of Non-Linear Specifications for Neural Networks Chongli Qin, Krishnamurthy Dvijotham, Brendan O'Donoghue, Rudy Bunel, Robert Stanforth, Sven Gowal, Jonathan Uesato, Grzegorz Swirszcz, Pushmeet Kohli

ICML 2018 Adversarial Risk and the Dangers of Evaluating Against Weak Attacks Jonathan Uesato, Brendan O’Donoghue, Pushmeet Kohli, Aaron Oord

ICML 2017 RobustFill: Neural Program Learning Under Noisy I/O Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli