ML Anthology
Authors
Search
About
Wen, Zheng
48 publications
ICMLW
2024
RLHF and IIA: Perverse Incentives
Wanqiao Xu
,
Shi Dong
,
Xiuyuan Lu
,
Grace Lam
,
Zheng Wen
,
Benjamin Van Roy
UAI
2023
Approximate Thompson Sampling via Epistemic Neural Networks
Ian Osband
,
Zheng Wen
,
Seyed Mohammad Asghari
,
Vikranth Dwaracherla
,
Morteza Ibrahimi
,
Xiuyuan Lu
,
Benjamin Van Roy
TMLR
2023
Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale
Botao Hao
,
Rahul Jain
,
Dengwang Tang
,
Zheng Wen
TMLR
2023
Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Vikranth Dwaracherla
,
Zheng Wen
,
Ian Osband
,
Xiuyuan Lu
,
Seyed Mohammad Asghari
,
Benjamin Van Roy
NeurIPS
2023
Epistemic Neural Networks
Ian Osband
,
Zheng Wen
,
Seyed Mohammad Asghari
,
Vikranth Dwaracherla
,
Morteza Ibrahimi
,
Xiuyuan Lu
,
Benjamin Van Roy
ICML
2023
Leveraging Demonstrations to Improve Online Learning: Quality Matters
Botao Hao
,
Rahul Jain
,
Tor Lattimore
,
Benjamin Van Roy
,
Zheng Wen
FnTML
2023
Reinforcement Learning, Bit by Bit
Xiuyuan Lu
,
Benjamin Van Roy
,
Vikranth Dwaracherla
,
Morteza Ibrahimi
,
Ian Osband
,
Zheng Wen
NeurIPS
2022
An Analysis of Ensemble Sampling
Chao Qin
,
Zheng Wen
,
Xiuyuan Lu
,
Benjamin Van Roy
UAI
2022
Evaluating High-Order Predictive Distributions in Deep Learning
Ian Osband
,
Zheng Wen
,
Seyed Mohammad Asghari
,
Vikranth Dwaracherla
,
Xiuyuan Lu
,
Benjamin Van Roy
ICLR
2022
Neural Contextual Bandits with Deep Representation and Shallow Exploration
Pan Xu
,
Zheng Wen
,
Handong Zhao
,
Quanquan Gu
ICLRW
2022
Teamwork Reinforcement Learning with Concave Utilities
Zheng Yu
,
Junyu Zhang
,
Zheng Wen
,
Andrea Tacchetti
,
Mengdi Wang
,
Ian Gemp
NeurIPS
2022
The Neural Testbed: Evaluating Joint Predictions
Ian Osband
,
Zheng Wen
,
Seyed Mohammad Asghari
,
Vikranth Dwaracherla
,
Xiuyuan Lu
,
Morteza Ibrahimi
,
Dieterich Lawson
,
Botao Hao
,
Brendan O'Donoghue
,
Benjamin Van Roy
ICML
2021
Joint Online Learning and Decision-Making via Dual Mirror Descent
Alfonso Lobos
,
Paul Grigas
,
Zheng Wen
ICML
2020
Budgeted Online Influence Maximization
Pierre Perrault
,
Jennifer Healey
,
Zheng Wen
,
Michal Valko
ICML
2020
Graphical Models Meet Bandits: A Variational Thompson Sampling Approach
Tong Yu
,
Branislav Kveton
,
Zheng Wen
,
Ruiyi Zhang
,
Ole J. Mengshoel
ICLR
2020
Hypermodels for Exploration
Vikranth Dwaracherla
,
Xiuyuan Lu
,
Morteza Ibrahimi
,
Ian Osband
,
Zheng Wen
,
Benjamin Van Roy
AISTATS
2020
Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Ruiyi Zhang
,
Changyou Chen
,
Zhe Gan
,
Zheng Wen
,
Wenlin Wang
,
Lawrence Carin
NeurIPS
2020
On Efficiency in Hierarchical Reinforcement Learning
Zheng Wen
,
Doina Precup
,
Morteza Ibrahimi
,
Andre Barreto
,
Benjamin Van Roy
,
Satinder P. Singh
AAAI
2020
Stochastic Online Learning with Probabilistic Graph Feedback
Shuai Li
,
Wei Chen
,
Zheng Wen
,
Kwong-Sak Leung
ICML
2020
Structured Policy Iteration for Linear Quadratic Regulator
Youngsuk Park
,
Ryan Rossi
,
Zheng Wen
,
Gang Wu
,
Handong Zhao
NeurIPS
2019
Bootstrapping Upper Confidence Bound
Botao Hao
,
Yasin Abbasi Yadkori
,
Zheng Wen
,
Guang Cheng
UAI
2019
Cascading Linear Submodular Bandits: Accounting for Position Bias and Diversity in Online Learning to Rank
Gaurush Hiranandani
,
Harvineet Singh
,
Prakhar Gupta
,
Iftikhar Ahamath Burhanuddin
,
Zheng Wen
,
Branislav Kveton
AISTATS
2019
Conservative Exploration Using Interleaving
Sumeet Katariya
,
Branislav Kveton
,
Zheng Wen
,
Vamsi K. Potluru
JMLR
2019
Deep Exploration via Randomized Value Functions
Ian Osband
,
Benjamin Van Roy
,
Daniel J. Russo
,
Zheng Wen
ICML
2019
Garbage in, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Branislav Kveton
,
Csaba Szepesvari
,
Sharan Vaswani
,
Zheng Wen
,
Tor Lattimore
,
Mohammad Ghavamzadeh
AISTATS
2019
Nearly Optimal Adaptive Procedure with Change Detection for Piecewise-Stationary Bandit
Yang Cao
,
Zheng Wen
,
Branislav Kveton
,
Yao Xie
AISTATS
2019
Scalable Thompson Sampling via Optimal Transport
Ruiyi Zhang
,
Zheng Wen
,
Changyou Chen
,
Chen Fang
,
Tong Yu
,
Lawrence Carin
FnTML
2018
A Tutorial on Thompson Sampling
Daniel Russo
,
Benjamin Van Roy
,
Abbas Kazerouni
,
Ian Osband
,
Zheng Wen
NeurIPS
2018
Scalar Posterior Sampling with Applications
Georgios Theocharous
,
Zheng Wen
,
Yasin Abbasi Yadkori
,
Nikos Vlassis
ECML-PKDD
2018
SpectralLeader: Online Spectral Learning for Single Topic Models
Tong Yu
,
Branislav Kveton
,
Zheng Wen
,
Hung Bui
,
Ole J. Mengshoel
IJCAI
2017
Bernoulli Rank-1 Bandits for Click Feedback
Sumeet Katariya
,
Branislav Kveton
,
Csaba Szepesvári
,
Claire Vernade
,
Zheng Wen
ICML
2017
Model-Independent Online Learning for Influence Maximization
Sharan Vaswani
,
Branislav Kveton
,
Zheng Wen
,
Mohammad Ghavamzadeh
,
Laks V. S. Lakshmanan
,
Mark Schmidt
NeurIPS
2017
Online Influence Maximization Under Independent Cascade Model with Semi-Bandit Feedback
Zheng Wen
,
Branislav Kveton
,
Michal Valko
,
Sharan Vaswani
ICML
2017
Online Learning to Rank in Stochastic Click Models
Masrour Zoghi
,
Tomas Tunys
,
Mohammad Ghavamzadeh
,
Branislav Kveton
,
Csaba Szepesvari
,
Zheng Wen
AISTATS
2017
Stochastic Rank-1 Bandits
Sumeet Katariya
,
Branislav Kveton
,
Csaba Szepesvári
,
Claire Vernade
,
Zheng Wen
UAI
2016
Cascading Bandits for Large-Scale Recommendation Problems
Shi Zong
,
Hao Ni
,
Kenny Sung
,
Nan Rosemary Ke
,
Zheng Wen
,
Branislav Kveton
ICML
2016
DCM Bandits: Learning to Rank with Multiple Clicks
Sumeet Katariya
,
Branislav Kveton
,
Csaba Szepesvari
,
Zheng Wen
ICML
2016
Generalization and Exploration via Randomized Value Functions
Ian Osband
,
Benjamin Van Roy
,
Zheng Wen
ICML
2015
Cascading Bandits: Learning to Rank in the Cascade Model
Branislav Kveton
,
Csaba Szepesvari
,
Zheng Wen
,
Azin Ashkan
NeurIPS
2015
Combinatorial Cascading Bandits
Branislav Kveton
,
Zheng Wen
,
Azin Ashkan
,
Csaba Szepesvari
ICML
2015
Efficient Learning in Large-Scale Combinatorial Semi-Bandits
Zheng Wen
,
Branislav Kveton
,
Azin Ashkan
IJCAI
2015
Optimal Greedy Diversity for Recommendation
Azin Ashkan
,
Branislav Kveton
,
Shlomo Berkovsky
,
Zheng Wen
AISTATS
2015
Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits
Branislav Kveton
,
Zheng Wen
,
Azin Ashkan
,
Csaba Szepesvári
AAAI
2014
Large-Scale Optimistic Adaptive Submodularity
Victor Gabillon
,
Branislav Kveton
,
Zheng Wen
,
Brian Eriksson
,
S. Muthukrishnan
UAI
2014
Matroid Bandits: Fast Combinatorial Optimization with Learning
Branislav Kveton
,
Zheng Wen
,
Azin Ashkan
,
Hoda Eydgahi
,
Brian Eriksson
NeurIPS
2013
Adaptive Submodular Maximization in Bandit Setting
Victor Gabillon
,
Branislav Kveton
,
Zheng Wen
,
Brian Eriksson
,
S. Muthukrishnan
NeurIPS
2013
Efficient Exploration and Value Function Generalization in Deterministic Systems
Zheng Wen
,
Benjamin Van Roy
ICML
2013
Sequential Bayesian Search
Zheng Wen
,
Branislav Kveton
,
Brian Eriksson
,
Sandilya Bhamidipati