Wen, Zheng

48 publications

ICMLW 2024 RLHF and IIA: Perverse Incentives Wanqiao Xu, Shi Dong, Xiuyuan Lu, Grace Lam, Zheng Wen, Benjamin Van Roy
UAI 2023 Approximate Thompson Sampling via Epistemic Neural Networks Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy
TMLR 2023 Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale Botao Hao, Rahul Jain, Dengwang Tang, Zheng Wen
TMLR 2023 Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping Vikranth Dwaracherla, Zheng Wen, Ian Osband, Xiuyuan Lu, Seyed Mohammad Asghari, Benjamin Van Roy
NeurIPS 2023 Epistemic Neural Networks Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy
ICML 2023 Leveraging Demonstrations to Improve Online Learning: Quality Matters Botao Hao, Rahul Jain, Tor Lattimore, Benjamin Van Roy, Zheng Wen
FnTML 2023 Reinforcement Learning, Bit by Bit Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen
NeurIPS 2022 An Analysis of Ensemble Sampling Chao Qin, Zheng Wen, Xiuyuan Lu, Benjamin Van Roy
UAI 2022 Evaluating High-Order Predictive Distributions in Deep Learning Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Xiuyuan Lu, Benjamin Van Roy
ICLR 2022 Neural Contextual Bandits with Deep Representation and Shallow Exploration Pan Xu, Zheng Wen, Handong Zhao, Quanquan Gu
ICLRW 2022 Teamwork Reinforcement Learning with Concave Utilities Zheng Yu, Junyu Zhang, Zheng Wen, Andrea Tacchetti, Mengdi Wang, Ian Gemp
NeurIPS 2022 The Neural Testbed: Evaluating Joint Predictions Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Dieterich Lawson, Botao Hao, Brendan O'Donoghue, Benjamin Van Roy
ICML 2021 Joint Online Learning and Decision-Making via Dual Mirror Descent Alfonso Lobos, Paul Grigas, Zheng Wen
ICML 2020 Budgeted Online Influence Maximization Pierre Perrault, Jennifer Healey, Zheng Wen, Michal Valko
ICML 2020 Graphical Models Meet Bandits: A Variational Thompson Sampling Approach Tong Yu, Branislav Kveton, Zheng Wen, Ruiyi Zhang, Ole J. Mengshoel
ICLR 2020 Hypermodels for Exploration Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Ian Osband, Zheng Wen, Benjamin Van Roy
AISTATS 2020 Nested-Wasserstein Self-Imitation Learning for Sequence Generation Ruiyi Zhang, Changyou Chen, Zhe Gan, Zheng Wen, Wenlin Wang, Lawrence Carin
NeurIPS 2020 On Efficiency in Hierarchical Reinforcement Learning Zheng Wen, Doina Precup, Morteza Ibrahimi, Andre Barreto, Benjamin Van Roy, Satinder P. Singh
AAAI 2020 Stochastic Online Learning with Probabilistic Graph Feedback Shuai Li, Wei Chen, Zheng Wen, Kwong-Sak Leung
ICML 2020 Structured Policy Iteration for Linear Quadratic Regulator Youngsuk Park, Ryan Rossi, Zheng Wen, Gang Wu, Handong Zhao
NeurIPS 2019 Bootstrapping Upper Confidence Bound Botao Hao, Yasin Abbasi Yadkori, Zheng Wen, Guang Cheng
UAI 2019 Cascading Linear Submodular Bandits: Accounting for Position Bias and Diversity in Online Learning to Rank Gaurush Hiranandani, Harvineet Singh, Prakhar Gupta, Iftikhar Ahamath Burhanuddin, Zheng Wen, Branislav Kveton
AISTATS 2019 Conservative Exploration Using Interleaving Sumeet Katariya, Branislav Kveton, Zheng Wen, Vamsi K. Potluru
JMLR 2019 Deep Exploration via Randomized Value Functions Ian Osband, Benjamin Van Roy, Daniel J. Russo, Zheng Wen
ICML 2019 Garbage in, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits Branislav Kveton, Csaba Szepesvari, Sharan Vaswani, Zheng Wen, Tor Lattimore, Mohammad Ghavamzadeh
AISTATS 2019 Nearly Optimal Adaptive Procedure with Change Detection for Piecewise-Stationary Bandit Yang Cao, Zheng Wen, Branislav Kveton, Yao Xie
AISTATS 2019 Scalable Thompson Sampling via Optimal Transport Ruiyi Zhang, Zheng Wen, Changyou Chen, Chen Fang, Tong Yu, Lawrence Carin
FnTML 2018 A Tutorial on Thompson Sampling Daniel Russo, Benjamin Van Roy, Abbas Kazerouni, Ian Osband, Zheng Wen
NeurIPS 2018 Scalar Posterior Sampling with Applications Georgios Theocharous, Zheng Wen, Yasin Abbasi Yadkori, Nikos Vlassis
ECML-PKDD 2018 SpectralLeader: Online Spectral Learning for Single Topic Models Tong Yu, Branislav Kveton, Zheng Wen, Hung Bui, Ole J. Mengshoel
IJCAI 2017 Bernoulli Rank-1 Bandits for Click Feedback Sumeet Katariya, Branislav Kveton, Csaba Szepesvári, Claire Vernade, Zheng Wen
ICML 2017 Model-Independent Online Learning for Influence Maximization Sharan Vaswani, Branislav Kveton, Zheng Wen, Mohammad Ghavamzadeh, Laks V. S. Lakshmanan, Mark Schmidt
NeurIPS 2017 Online Influence Maximization Under Independent Cascade Model with Semi-Bandit Feedback Zheng Wen, Branislav Kveton, Michal Valko, Sharan Vaswani
ICML 2017 Online Learning to Rank in Stochastic Click Models Masrour Zoghi, Tomas Tunys, Mohammad Ghavamzadeh, Branislav Kveton, Csaba Szepesvari, Zheng Wen
AISTATS 2017 Stochastic Rank-1 Bandits Sumeet Katariya, Branislav Kveton, Csaba Szepesvári, Claire Vernade, Zheng Wen
UAI 2016 Cascading Bandits for Large-Scale Recommendation Problems Shi Zong, Hao Ni, Kenny Sung, Nan Rosemary Ke, Zheng Wen, Branislav Kveton
ICML 2016 DCM Bandits: Learning to Rank with Multiple Clicks Sumeet Katariya, Branislav Kveton, Csaba Szepesvari, Zheng Wen
ICML 2016 Generalization and Exploration via Randomized Value Functions Ian Osband, Benjamin Van Roy, Zheng Wen
ICML 2015 Cascading Bandits: Learning to Rank in the Cascade Model Branislav Kveton, Csaba Szepesvari, Zheng Wen, Azin Ashkan
NeurIPS 2015 Combinatorial Cascading Bandits Branislav Kveton, Zheng Wen, Azin Ashkan, Csaba Szepesvari
ICML 2015 Efficient Learning in Large-Scale Combinatorial Semi-Bandits Zheng Wen, Branislav Kveton, Azin Ashkan
IJCAI 2015 Optimal Greedy Diversity for Recommendation Azin Ashkan, Branislav Kveton, Shlomo Berkovsky, Zheng Wen
AISTATS 2015 Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits Branislav Kveton, Zheng Wen, Azin Ashkan, Csaba Szepesvári
AAAI 2014 Large-Scale Optimistic Adaptive Submodularity Victor Gabillon, Branislav Kveton, Zheng Wen, Brian Eriksson, S. Muthukrishnan
UAI 2014 Matroid Bandits: Fast Combinatorial Optimization with Learning Branislav Kveton, Zheng Wen, Azin Ashkan, Hoda Eydgahi, Brian Eriksson
NeurIPS 2013 Adaptive Submodular Maximization in Bandit Setting Victor Gabillon, Branislav Kveton, Zheng Wen, Brian Eriksson, S. Muthukrishnan
NeurIPS 2013 Efficient Exploration and Value Function Generalization in Deterministic Systems Zheng Wen, Benjamin Van Roy
ICML 2013 Sequential Bayesian Search Zheng Wen, Branislav Kveton, Brian Eriksson, Sandilya Bhamidipati