Boutilier, Craig

155 publications

ICLR 2025 Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models Yinlam Chow, Guy Tennenholtz, Izzeddin Gur, Vincent Zhuang, Bo Dai, Aviral Kumar, Rishabh Agarwal, Sridhar Thiagarajan, Craig Boutilier, Aleksandra Faust
ICML 2025 Preference Adaptive and Sequential Text-to-Image Generation Ofir Nabati, Guy Tennenholtz, Chihwei Hsu, Moonkyung Ryu, Deepak Ramachandran, Yinlam Chow, Xiang Li, Craig Boutilier
ICLR 2024 Demystifying Embedding Spaces Using Large Language Models Guy Tennenholtz, Yinlam Chow, ChihWei Hsu, Jihwan Jeong, Lior Shani, Azamat Tulepbergenov, Deepak Ramachandran, Martin Mladenov, Craig Boutilier
NeurIPS 2024 Density-Based User Representation Using Gaussian Process Regression for Multi-Interest Personalized Retrieval Haolun Wu, Ofer Meshi, Masrour Zoghi, Fernando Diaz, Xue Liu, Craig Boutilier, Maryam Karimzadehgan
NeurIPS 2024 DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning Anthony Liang, Guy Tennenholtz, Chih-Wei Hsu, Yinlam Chow, Erdem Biyik, Craig Boutilier
ICMLW 2024 DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning Anthony Liang, Guy Tennenholtz, ChihWei Hsu, Yinlam Chow, Erdem Biyik, Craig Boutilier
NeurIPS 2024 Embedding-Aligned Language Models Guy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Lior Shani, Ethan Liang, Craig Boutilier
IJCAI 2024 Model-Free Preference Elicitation Carlos Martin, Craig Boutilier, Ofer Meshi, Tuomas Sandholm
AAAI 2024 Recommender Ecosystems: A Mechanism Design Perspective on Holistic Modeling and Optimization Craig Boutilier, Martin Mladenov, Guy Tennenholtz
ICLR 2023 A Mixture-of-Expert Approach to RL-Based Dialogue Management Yinlam Chow, Azamat Tulepbergenov, Ofir Nachum, Dhawal Gupta, Moonkyung Ryu, Mohammad Ghavamzadeh, Craig Boutilier
NeurIPS 2023 DPOK: Reinforcement Learning for Fine-Tuning Text-to-Image Diffusion Models Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee
NeurIPSW 2023 Model-Free Preference Elicitation Carlos Martin, Craig Boutilier, Ofer Meshi
NeurIPS 2023 Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management Dhawal Gupta, Yinlam Chow, Azamat Tulepbergenov, Mohammad Ghavamzadeh, Craig Boutilier
ICMLW 2023 Preference Elicitation for Music Recommendations Ofer Meshi, Jon Feldman, Li Yang, Ben Scheetz, Yanli Cai, Mohammadhossein Bateni, Corbyn Salisbury, Vikram Aggarwal, Craig Boutilier
ICML 2023 Reinforcement Learning with History Dependent Dynamic Contexts Guy Tennenholtz, Nadav Merlis, Lior Shani, Martin Mladenov, Craig Boutilier
AISTATS 2022 Thompson Sampling with a Mixture Prior Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh, Craig Boutilier
NeurIPSW 2022 A Mixture-of-Expert Approach to RL-Based Dialogue Management Yinlam Chow, Azamat Tulepbergenov, Ofir Nachum, Dhawal Gupta, Moonkyung Ryu, Mohammad Ghavamzadeh, Craig Boutilier
IJCAI 2022 IMO3: Interactive Multi-Objective Off-Policy Optimization Nan Wang, Hongning Wang, Maryam Karimzadehgan, Branislav Kveton, Craig Boutilier
AAAI 2022 Subjective Attributes in Conversational Recommendation Systems: Challenges and Opportunities Filip Radlinski, Craig Boutilier, Deepak Ramachandran, Ivan Vendrov
ICML 2021 Meta-Thompson Sampling Branislav Kveton, Mikhail Konobeev, Manzil Zaheer, Chih-Wei Hsu, Martin Mladenov, Craig Boutilier, Csaba Szepesvari
IJCAI 2020 BRPO: Batch Residual Policy Optimization Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed H. Chi, Craig Boutilier
ICLR 2020 CAQL: Continuous Action Q-Learning Moonkyung Ryu, Yinlam Chow, Ross Anderson, Christian Tjandraatmadja, Craig Boutilier
ICML 2020 ConQUR: Mitigating Delusional Bias in Deep Q-Learning Dijia Su, Jayden Ooi, Tyler Lu, Dale Schuurmans, Craig Boutilier
NeurIPS 2020 Differentiable Meta-Learning of Bandit Policies Craig Boutilier, Chih-wei Hsu, Branislav Kveton, Martin Mladenov, Csaba Szepesvari, Manzil Zaheer
AAAI 2020 Gradient-Based Optimization for Bayesian Preference Elicitation Ivan Vendrov, Tyler Lu, Qingqing Huang, Craig Boutilier
NeurIPS 2020 Latent Bandits Revisited Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Craig Boutilier
ICML 2020 Optimizing Long-Term Social Welfare in Recommender Systems: A Constrained Matching Approach Martin Mladenov, Elliot Creager, Omer Ben-Porat, Kevin Swersky, Richard Zemel, Craig Boutilier
AISTATS 2020 Randomized Exploration in Generalized Linear Bandits Branislav Kveton, Manzil Zaheer, Csaba Szepesvari, Lihong Li, Mohammad Ghavamzadeh, Craig Boutilier
IJCAI 2019 Advantage Amplification in Slowly Evolving Latent-State Environments Martin Mladenov, Ofer Meshi, Jayden Ooi, Dale Schuurmans, Craig Boutilier
UAI 2019 Perturbed-History Exploration in Stochastic Linear Bandits Branislav Kveton, Csaba Szepesvári, Mohammad Ghavamzadeh, Craig Boutilier
IJCAI 2019 Perturbed-History Exploration in Stochastic Multi-Armed Bandits Branislav Kveton, Csaba Szepesvári, Mohammad Ghavamzadeh, Craig Boutilier
IJCAI 2019 SlateQ: A Tractable Decomposition for Reinforcement Learning with Recommendation Sets Eugene Ie, Vihan Jain, Jing Wang, Sanmit Narvekar, Ritesh Agarwal, Rui Wu, Heng-Tze Cheng, Tushar Chandra, Craig Boutilier
NeurIPS 2018 Data Center Cooling Using Model-Predictive Control Nevena Lazic, Craig Boutilier, Tyler Lu, Eehern Wong, Binz Roy, Mk Ryu, Greg Imwalle
NeurIPS 2018 Non-Delusional Q-Learning and Value-Iteration Tyler Lu, Dale Schuurmans, Craig Boutilier
IJCAI 2018 Planning and Learning with Stochastic Action Sets Craig Boutilier, Alon Cohen, Avinatan Hassidim, Yishay Mansour, Ofer Meshi, Martin Mladenov, Dale Schuurmans
IJCAI 2017 Logistic Markov Decision Processes Martin Mladenov, Craig Boutilier, Dale Schuurmans, Ofer Meshi, Gal Elidan, Tyler Lu
IJCAI 2017 Multiple-Profile Prediction-of-Use Games Andrew Perrault, Craig Boutilier
UAI 2016 Budget Allocation Using Weakly Coupled, Constrained Markov Decision Processes Craig Boutilier, Tyler Lu
IJCAI 2015 Approximately Stable Pricing for Coordinated Purchasing of Electricity Andrew Perrault, Craig Boutilier
AAAI 2015 The Pricing War Continues: On Competitive Multi-Item Pricing Omer Lev, Joel Oren, Craig Boutilier, Jeffrey S. Rosenschein
AAAI 2015 Value-Directed Compression of Large-Scale Assignment Problems Tyler Lu, Craig Boutilier
AAAI 2014 A Game-Theoretic Analysis of Catalog Optimization Joel Oren, Nina Narodytska, Craig Boutilier
JMLR 2014 Effective Sampling and Learning for Mallows Models with Pairwise-Preference Data Tyler Lu, Craig Boutilier
AAAI 2014 Preference Elicitation and Interview Minimization in Stable Matchings Joanna Drummond, Craig Boutilier
AAAI 2014 Regret-Based Optimization and Preference Elicitation for Stackelberg Security Games with Uncertainty Thanh Hong Nguyen, Amulya Yadav, Bo An, Milind Tambe, Craig Boutilier
AAAI 2014 Robust Winners and Winner Determination Policies Under Candidate Uncertainty Craig Boutilier, Jérôme Lang, Joel Oren, Héctor Palacios
IJCAI 2013 Analysis and Optimization of Multi-Dimensional Percentile Mechanisms Xin Sui, Craig Boutilier, Tuomas Sandholm
IJCAI 2013 Efficient Vote Elicitation Under Candidate Uncertainty Joel Oren, Yuval Filmus, Craig Boutilier
IJCAI 2013 Elicitation and Approximately Stable Matching with Partial Preferences Joanna Drummond, Craig Boutilier
IJCAI 2013 Multi-Dimensional Single-Peaked Consistency and Its Approximations Xin Sui, Alex Francois-Nienaber, Craig Boutilier
IJCAI 2013 Multi-Winner Social Choice with Incomplete Preferences Tyler Lu, Craig Boutilier
AAAI 2013 On the Value of Using Group Discounts Under Price Competition Reshef Meir, Tyler Lu, Moshe Tennenholtz, Craig Boutilier
AAAI 2012 A Dynamic Rationalization of Distance Rationalizability Craig Boutilier, Ariel D. Procaccia
ICML 2012 Active Learning for Matching Problems Laurent Charlin, Richard S. Zemel, Craig Boutilier
UAI 2012 Bayesian Vote Manipulation: Optimal Strategies and Impact on Welfare Tyler Lu, Pingzhong Tang, Ariel D. Procaccia, Craig Boutilier
UAI 2011 A Framework for Optimizing Paper Matching Laurent Charlin, Richard S. Zemel, Craig Boutilier
IJCAI 2011 Budgeted Social Choice: From Consensus to Personalized Decision Making Tyler Lu, Craig Boutilier
AAAI 2011 Efficiency and Privacy Tradeoffs in Mechanism Design Xin Sui, Craig Boutilier
IJCAI 2011 Eliciting Additive Reward Functions for Markov Decision Processes Kevin Regan, Craig Boutilier
ICML 2011 Learning Mallows Models with Pairwise Preferences Tyler Lu, Craig Boutilier
AAAI 2011 Recommendation Sets and Choice Queries: There Is No Exploration/Exploitation Tradeoff! Paolo Viappiani, Craig Boutilier
IJCAI 2011 Robust Approximation and Incremental Elicitation in Voting Protocols Tyler Lu, Craig Boutilier
IJCAI 2011 Robust Online Optimization of Reward-Uncertain MDPs Kevin Regan, Craig Boutilier
AAAI 2010 Automated Channel Abstraction for Advertising Auctions William E. Walsh, Craig Boutilier, Tuomas Sandholm, Rob Shields, George L. Nemhauser, David C. Parkes
NeurIPS 2010 Optimal Bayesian Recommendation Sets and Myopically Optimal Choice Query Sets Paolo Viappiani, Craig Boutilier
AAAI 2010 Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies Kevin Regan, Craig Boutilier
AAAI 2010 Simultaneous Elicitation of Preference Features and Utility Craig Boutilier, Kevin Regan, Paolo Viappiani
IJCAI 2009 IJCAI 2009, Proceedings of the 21st International Joint Conference on Artificial Intelligence, Pasadena, California, USA, July 11-17, 2009 Craig Boutilier
ICML 2009 Online Feature Elicitation in Interactive Optimization Craig Boutilier, Kevin Regan, Paolo Viappiani
UAI 2009 Regret-Based Reward Elicitation for Markov Decision Processes Kevin Regan, Craig Boutilier
AAAI 2008 Computing Reserve Prices and Identifying the Value Distribution in Real-World Auctions with Market Disruptions William E. Walsh, David C. Parkes, Tuomas Sandholm, Craig Boutilier
AAAI 2008 Expressive Banner Ad Auctions and Model-Based Online Optimization for Clearing Craig Boutilier, David C. Parkes, Tuomas Sandholm, William E. Walsh
UAI 2008 Toward Experiential Utility Elicitation for Interface Customization Bowen Hui, Craig Boutilier
IJCAI 2007 Automated Design of Multistage Mechanisms Tuomas Sandholm, Vincent Conitzer, Craig Boutilier
IJCAI 2007 Coalitional Bargaining with Agent Type Uncertainty Georgios Chalkiadakis, Craig Boutilier
AAAI 2007 Computing Optimal Subsets Maxim Binshtok, Ronen I. Brafman, Solomon Eyal Shimony, Ajay Mani, Craig Boutilier
IJCAI 2007 Mechanism Design with Partial Revelation Nathanael Hyafil, Craig Boutilier
UAI 2007 Minimax Regret Based Elicitation of Generalized Additive Utilities Darius Braziunas, Craig Boutilier
AAAI 2007 Partial Revelation Automated Mechanism Design Nathanael Hyafil, Craig Boutilier
UAI 2006 Practical Linear Value-Approximation Techniques for First-Order MDPs Scott Sanner, Craig Boutilier
AAAI 2006 Preference Elicitation and Generalized Additive Utility Darius Braziunas, Craig Boutilier
AAAI 2006 Regret-Based Incremental Partial Revelation Mechanisms Nathanael Hyafil, Craig Boutilier
IJCAI 2005 A Decision-Theoretic Approach to Task Assistance for Persons with Dementia Jennifer Boger, Pascal Poupart, Jesse Hoey, Craig Boutilier, Geoff R. Fernie, Alex Mihailidis
UAI 2005 Approximate Linear Programming for First-Order MDPs Scott Sanner, Craig Boutilier
UAI 2005 Local Utility Elicitation in GAI Models Darius Braziunas, Craig Boutilier
AAAI 2005 New Approaches to Optimization and Utility Elicitation in Autonomic Computing Relu Patrascu, Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh
IJCAI 2005 Regret-Based Utility Elicitation in Constraint-Based Decision Problems Craig Boutilier, Relu Patrascu, Pascal Poupart, Dale Schuurmans
JAIR 2004 CP-Nets: A Tool for Representing and Reasoning with Conditional Ceteris Paribus Preference Statements Craig Boutilier, Ronen I. Brafman, Carmel Domshlak, Holger H. Hoos, David Poole
AAAI 2004 Eliciting Bid Taker Non-Price Preferences in (Combinatorial) Auctions Craig Boutilier, Tuomas Sandholm, Rob Shields
UAI 2004 Regret Minimizing Equilibria and Mechanisms for Games with Strict Type Uncertainty Nathanael Hyafil, Craig Boutilier
AAAI 2004 Stochastic Local Search for POMDP Controllers Darius Braziunas, Craig Boutilier
NeurIPS 2004 VDCBPI: An Approximate Scalable Algorithm for Large POMDPs Pascal Poupart, Craig Boutilier
IJCAI 2003 A Bayesian Approach to Imitation in Reinforcement Learning Bob Price, Craig Boutilier
JAIR 2003 Accelerating Reinforcement Learning Through Implicit Imitation Bob Price, Craig Boutilier
UAI 2003 Active Collaborative Filtering Craig Boutilier, Richard S. Zemel, Benjamin M. Marlin
AISTATS 2003 An Active Approach to Collaborative Filtering Richard S. Zemel, Craig Boutilier
NeurIPS 2003 Bounded Finite State Controllers Pascal Poupart, Craig Boutilier
UAI 2003 Cooperative Negotiation in Autonomic Systems Using Incremental Utility Elicitation Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh
IJCAI 2003 Incremental Utility Elicitation with the Minimax Regret Decision Criterion Tianhan Wang, Craig Boutilier
IJCAI 2003 On the Foundations of Expected Expected Utility Craig Boutilier
IJCAI 2003 Towards Cooperative Negotiation for Decentralized Resource Allocation in Autonomic Computing Systems Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, William E. Walsh
AAAI 2002 A POMDP Formulation of Preference Elicitation Problems Craig Boutilier
AAAI 2002 Greedy Linear Value-Approximation for Factored Markov Decision Processes Relu Patrascu, Pascal Poupart, Dale Schuurmans, Craig Boutilier, Carlos Guestrin
AAAI 2002 Piecewise Linear Value Function Approximation for Factored MDPs Pascal Poupart, Craig Boutilier, Relu Patrascu, Dale Schuurmans
AAAI 2002 Solving Concisely Expressed Combinatorial Auction Problems Craig Boutilier
NeurIPS 2002 Value-Directed Compression of POMDPs Pascal Poupart, Craig Boutilier
IJCAI 2001 Bidding Languages for Combinatorial Auctions Craig Boutilier, Holger H. Hoos
JAIR 2001 Partial-Order Planning with Concurrent Interacting Actions Craig Boutilier, Ronen I. Brafman
IJCAI 2001 Symbolic Dynamic Programming for First-Order MDPs Craig Boutilier, Raymond Reiter, Bob Price
UAI 2001 UCP-Networks: A Directed Graphical Representation of Conditional Utilities Craig Boutilier, Fahiem Bacchus, Ronen I. Brafman
UAI 2001 Value-Directed Sampling Methods for POMDPs Pascal Poupart, Luis E. Ortiz, Craig Boutilier
UAI 2001 Vector-Space Analysis of Belief-State Approximation for POMDPs Pascal Poupart, Craig Boutilier
NeurIPS 2000 APRICODD: Approximate Policy Construction Using Decision Diagrams Robert St-Aubin, Jesse Hoey, Craig Boutilier
UAI 2000 Approximately Optimal Monitoring of Plan Preconditions Craig Boutilier
AAAI 2000 Decision Making Under Uncertainty: Operations Research Meets AI (Again) Craig Boutilier
AAAI 2000 Decision-Theoretic, High-Level Agent Programming in the Situation Calculus Craig Boutilier, Raymond Reiter, Mikhail Soutchanski, Sebastian Thrun
AAAI 2000 Solving Combinatorial Auctions Using Stochastic Local Search Holger H. Hoos, Craig Boutilier
UAI 2000 UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30 - July 3, 2000 Craig Boutilier, Moisés Goldszmidt
UAI 2000 Value-Directed Belief State Approximation for POMDPs Pascal Poupart, Craig Boutilier
UAI 1999 Continuous Value Function Approximation for Sequential Bidding Policies Craig Boutilier, Moisés Goldszmidt, Bikash Sabata
JAIR 1999 Decision-Theoretic Planning: Structural Assumptions and Computational Leverage Craig Boutilier, Thomas L. Dean, Steve Hanks
ICML 1999 Implicit Imitation in Multiagent Reinforcement Learning Bob Price, Craig Boutilier
UAI 1999 Reasoning with Conditional Ceteris Paribus Preference Statements Craig Boutilier, Ronen I. Brafman, Holger H. Hoos, David Poole
UAI 1999 SPUDD: Stochastic Planning Using Decision Diagrams Jesse Hoey, Robert St-Aubin, Alan J. Hu, Craig Boutilier
IJCAI 1999 Sequential Auctions for the Allocation of Resources with Complementarities Craig Boutilier, Moisés Goldszmidt, Bikash Sabata
IJCAI 1999 Sequential Optimality and Coordination in Multiagent Systems Craig Boutilier
AAAI 1998 Belief Revision with Unreliable Observations Craig Boutilier, Nir Friedman, Joseph Y. Halpern
UAI 1998 Hierarchical Solution of Markov Decision Processes Using Macro-Actions Milos Hauskrecht, Nicolas Meuleau, Leslie Pack Kaelbling, Thomas L. Dean, Craig Boutilier
AAAI 1998 Solving Very Large Weakly Coupled Markov Decision Processes Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, Leonid Peshkin, Leslie Pack Kaelbling, Thomas L. Dean, Craig Boutilier
UAI 1998 Structured Reachability Analysis for Markov Decision Processes Craig Boutilier, Ronen I. Brafman, Christopher W. Geib
AAAI 1998 The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems Caroline Claus, Craig Boutilier
UAI 1997 Correlated Action Effects in Decision Theoretic Regression Craig Boutilier
AAAI 1997 Planning with Concurrent Interacting Actions Craig Boutilier, Ronen I. Brafman
IJCAI 1997 Prioritized Goal Decomposition of Markov Decision Processes: Toward a Synthesis of Classical and Decision Theoretic Planning Craig Boutilier, Ronen I. Brafman, Christopher W. Geib
UAI 1997 Structured Arc Reversal and Simulation of Dynamic Probabilistic Networks Adrian Y. W. Cheuk, Craig Boutilier
AAAI 1997 Structured Solution Methods for Non-Markovian Decision Processes Fahiem Bacchus, Craig Boutilier, Adam J. Grove
ICML 1996 Approximate Value Trees in Structured Dynamic Programming Craig Boutilier, Richard Dearden
AAAI 1996 Computing Optimal Policies for Partially Observable Decision Processes Using Compact Representations Craig Boutilier, David Poole
UAI 1996 Context-Specific Independence in Bayesian Networks Craig Boutilier, Nir Friedman, Moisés Goldszmidt, Daphne Koller
UAI 1996 Learning Conventions in Multiagent Stochastic Domains Using Likelihood Estimates Craig Boutilier
AAAI 1996 Rewarding Behaviors Fahiem Bacchus, Craig Boutilier, Adam J. Grove
IJCAI 1995 Exploiting Structure in Policy Construction Craig Boutilier, Richard Dearden, Moisés Goldszmidt
IJCAI 1995 Generalized Update: Belief Change in Dynamic Settings Craig Boutilier
IJCAI 1995 Process-Oriented Planning and Average-Reward Optimality Craig Boutilier, Martin L. Puterman
UAI 1994 Integrating Planning and Execution in Stochastic Domains Richard Dearden, Craig Boutilier
AAAI 1994 Using Abstractions for Decision-Theoretic Planning with Time Constraints Craig Boutilier, Richard Dearden
AAAI 1993 Abduction as Belief Revision: A Model of Preferred Explanations Craig Boutilier, Verónica Becher
IJCAI 1993 Revision Sequences and Nested Conditionals Craig Boutilier
AAAI 1993 Revision by Conditional Beliefs Craig Boutilier, Moisés Goldszmidt
UAI 1993 The Probability of a Possibility: Adding Uncertainty to Default Rules Craig Boutilier
AAAI 1992 A Logic for Revision and Subjunctive Queries Craig Boutilier
UAI 1992 Modal Logics for Qualitative Possibility and Beliefs Craig Boutilier
IJCAI 1991 Inaccessible Worlds and Irrelevance: Preliminary Report Craig Boutilier
AAAI 1990 Conditional Logics of Normality as Modal Systems Craig Boutilier
IJCAI 1989 A Semantical Approach to Stable Inheritance Reasoning Craig Boutilier