Gur, Izzeddin

22 publications

ICLR 2025 Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models Yinlam Chow, Guy Tennenholtz, Izzeddin Gur, Vincent Zhuang, Bo Dai, Aviral Kumar, Rishabh Agarwal, Sridhar Thiagarajan, Craig Boutilier, Aleksandra Faust
ICLR 2024 A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis Izzeddin Gur, Hiroki Furuta, Austin V Huang, Mustafa Safdari, Yutaka Matsuo, Douglas Eck, Aleksandra Faust
TMLR 2024 Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models Avi Singh, John D Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron T Parisi, Abhishek Kumar, Alexander A Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Fathy Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura A Culp, Lechao Xiao, Maxwell Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel
TMLR 2024 Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web Hiroki Furuta, Yutaka Matsuo, Aleksandra Faust, Izzeddin Gur
ICLRW 2024 Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web Hiroki Furuta, Yutaka Matsuo, Aleksandra Faust, Izzeddin Gur
NeurIPS 2024 Geometric-Averaged Preference Optimization for Soft Preference Labels Hiroki Furuta, Kuang-Huei Lee, Shixiang Shane Gu, Yutaka Matsuo, Aleksandra Faust, Heiga Zen, Izzeddin Gur
ICLR 2024 Multimodal Web Navigation with Instruction-Finetuned Foundation Models Hiroki Furuta, Kuang-Huei Lee, Ofir Nachum, Yutaka Matsuo, Aleksandra Faust, Shixiang Shane Gu, Izzeddin Gur
ICML 2024 Scaling Exponents Across Parameterizations and Optimizers Katie E Everett, Lechao Xiao, Mitchell Wortsman, Alexander A Alemi, Roman Novak, Peter J Liu, Izzeddin Gur, Jascha Sohl-Dickstein, Leslie Pack Kaelbling, Jaehoon Lee, Jeffrey Pennington
ICLR 2024 Small-Scale Proxies for Large-Scale Transformer Training Instabilities Mitchell Wortsman, Peter J Liu, Lechao Xiao, Katie E Everett, Alexander A Alemi, Ben Adlam, John D Co-Reyes, Izzeddin Gur, Abhishek Kumar, Roman Novak, Jeffrey Pennington, Jascha Sohl-Dickstein, Kelvin Xu, Jaehoon Lee, Justin Gilmer, Simon Kornblith
ICML 2023 CLUTR: Curriculum Learning via Unsupervised Task Representation Learning Abdus Salam Azad, Izzeddin Gur, Jasper Emhoff, Nathaniel Alexis, Aleksandra Faust, Pieter Abbeel, Ion Stoica
ICLRW 2023 Instruction-Finetuned Foundation Models for Multimodal Web Navigation Hiroki Furuta, Ofir Nachum, Kuang-Huei Lee, Yutaka Matsuo, Shixiang Shane Gu, Izzeddin Gur
NeurIPSW 2023 Language Model Agents Suffer from Compositional Generalization in Web Automation Hiroki Furuta, Yutaka Matsuo, Aleksandra Faust, Izzeddin Gur
ICLRW 2023 Understanding HTML with Large Language Models Izzeddin Gur, Ofir Nachum, Yingjie Miao, Mustafa Safdari, Austin V Huang, Aakanksha Chowdhery, Sharan Narang, Noah Fiedel, Aleksandra Faust
NeurIPSW 2022 CLUTR: Curriculum Learning via Unsupervised Task Representation Learning Abdus Salam Azad, Izzeddin Gur, Aleksandra Faust, Pieter Abbeel, Ion Stoica
NeurIPSW 2022 CLUTR: Curriculum Learning via Unsupervised Task Representation Learning Abdus Salam Azad, Izzeddin Gur, Aleksandra Faust, Pieter Abbeel, Ion Stoica
UAI 2022 Fast Inference and Transfer of Compositional Task Structures for Few-Shot Task Generalization Sungryull Sohn, Hyunjae Woo, Jongwook Choi, Lyubing Qiang, Izzeddin Gur, Aleksandra Faust, Honglak Lee
CVPR 2022 Less Is More: Generating Grounded Navigation Instructions from Landmarks Su Wang, Ceslee Montgomery, Jordi Orbay, Vighnesh Birodkar, Aleksandra Faust, Izzeddin Gur, Natasha Jaques, Austin Waters, Jason Baldridge, Peter Anderson
NeurIPS 2021 Environment Generation for Zero-Shot Compositional Reinforcement Learning Izzeddin Gur, Natasha Jaques, Yingjie Miao, Jongwook Choi, Manoj Tiwari, Honglak Lee, Aleksandra Faust
NeurIPSW 2021 Fast Inference and Transfer of Compositional Task for Few-Shot Task Generalization Sungryull Sohn, Hyunjae Woo, Jongwook Choi, Lyubing Qiang, Izzeddin Gur, Aleksandra Faust, Honglak Lee
ICMLW 2021 SparseDice: Imitation Learning for Temporally Sparse Data via Regularization Alberto Camacho, Izzeddin Gur, Marcin Lukasz Moczulski, Ofir Nachum, Aleksandra Faust
NeurIPSW 2021 Targeted Environment Design from Offline Data Izzeddin Gur, Ofir Nachum, Aleksandra Faust
ICLR 2019 Learning to Navigate the Web Izzeddin Gur, Ulrich Rueckert, Aleksandra Faust, Dilek Hakkani-Tur