Bahdanau, Dzmitry

14 publications

TMLR 2025 LLMs Can Learn Self-Restraint Through Iterative Self-Reflection Alexandre Piché, Aristides Milios, Dzmitry Bahdanau, Christopher Pal

ICLRW 2025 NNetNav: Unsupervised Learning of Browser Agents Through Environment Interaction in the Wild Shikhar Murty, Hao Zhu, Dzmitry Bahdanau, Christopher D Manning

ICLRW 2024 Self-Evaluation and Self-Prompting to Improve the Reliability of LLMs Alexandre Piché, Aristides Milios, Dzmitry Bahdanau, Christopher Pal

TMLR 2023 StarCoder: May the Source Be with You! Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Joel Lamy-Poirier, Joao Monteiro, Nicolas Gontier, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu, Ben Lipkin, Muhtasham Oblokulov, Zhiruo Wang, Rudra Murthy, Jason T Stillerman, Siva Sankalp Patel, Dmitry Abulkhanov, Marco Zocca, Manan Dey, Zhihan Zhang, Urvashi Bhattacharyya, Wenhao Yu, Sasha Luccioni, Paulo Villegas, Fedor Zhdanov, Tony Lee, Nadav Timor, Jennifer Ding, Claire S Schlesinger, Hailey Schoelkopf, Jan Ebert, Tri Dao, Mayank Mishra, Alex Gu, Carolyn Jane Anderson, Brendan Dolan-Gavitt, Danish Contractor, Siva Reddy, Daniel Fried, Dzmitry Bahdanau, Yacine Jernite, Carlos Muñoz Ferrandis, Sean Hughes, Thomas Wolf, Arjun Guha, Leandro Von Werra, Harm de Vries

TMLR 2023 The Stack: 3 TB of Permissively Licensed Source Code Denis Kocetkov, Raymond Li, Loubna Ben Allal, Jia Li, Chenghao Mou, Yacine Jernite, Margaret Mitchell, Carlos Muñoz Ferrandis, Sean Hughes, Thomas Wolf, Dzmitry Bahdanau, Leandro Von Werra, Harm de Vries

NeurIPS 2021 Systematic Generalization with Edge Transformers Leon Bergen, Timothy O'Donnell, Dzmitry Bahdanau

AAAI 2020 Combating False Negatives in Adversarial Imitation Learning (Student Abstract) Konrad Zolna, Chitwan Saharia, Léonard Boussioux, David Yu-Tung Hui, Maxime Chevalier-Boisvert, Dzmitry Bahdanau, Yoshua Bengio

ICLR 2019 BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning Maxime Chevalier-Boisvert, Dzmitry Bahdanau, Salem Lahlou, Lucas Willems, Chitwan Saharia, Thien Huu Nguyen, Yoshua Bengio

ICLR 2019 Learning to Understand Goal Specifications by Modelling Reward Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Arian Hosseini, Pushmeet Kohli, Edward Grefenstette

ICLR 2019 Systematic Generalization: What Is Required and Can It Be Learned? Dzmitry Bahdanau, Shikhar Murty, Michael Noukhovitch, Thien Huu Nguyen, Harm de Vries, Aaron Courville

ICLR 2017 An Actor-Critic Algorithm for Sequence Prediction Dzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron C. Courville, Yoshua Bengio

ICML 2017 Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-Control Natasha Jaques, Shixiang Gu, Dzmitry Bahdanau, José Miguel Hernández-Lobato, Richard E. Turner, Douglas Eck

NeurIPS 2015 Attention-Based Models for Speech Recognition Jan K Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, Kyunghyun Cho, Yoshua Bengio

ICLR 2015 Neural Machine Translation by Jointly Learning to Align and Translate Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio