Efficient Ordered Combinatorial Semi-Bandits for Whole-Page Recommendation
Abstract
Multi-Armed Bandit (MAB) framework has been successfully applied in many web applications. However, many complex real-world applications that involve multiple content recommendations cannot fit into the traditional MAB setting. To address this issue, we consider an ordered combinatorial semi-bandit problem where the learner recommends S actions from a base set of K actions, and displays the results in S (out of M) different positions. The aim is to maximize the cumulative reward with respect to the best possible subset and positions in hindsight. By the adaptation of a minimum-cost maximum-flow network, a practical algorithm based on Thompson sampling is derived for the (contextual) combinatorial problem, thus resolving the problem of computational intractability.With its potential to work with whole-page recommendation and any probabilistic models, to illustrate the effectiveness of our method, we focus on Gaussian process optimization and a contextual setting where click-through rate is predicted using logistic regression. We demonstrate the algorithms’ performance on synthetic Gaussian process problems and on large-scale news article recommendation datasets from Yahoo! Front Page Today Module.
Cite
Text
Wang et al. "Efficient Ordered Combinatorial Semi-Bandits for Whole-Page Recommendation." AAAI Conference on Artificial Intelligence, 2017. doi:10.1609/AAAI.V31I1.10939Markdown
[Wang et al. "Efficient Ordered Combinatorial Semi-Bandits for Whole-Page Recommendation." AAAI Conference on Artificial Intelligence, 2017.](https://mlanthology.org/aaai/2017/wang2017aaai-efficient/) doi:10.1609/AAAI.V31I1.10939BibTeX
@inproceedings{wang2017aaai-efficient,
title = {{Efficient Ordered Combinatorial Semi-Bandits for Whole-Page Recommendation}},
author = {Wang, Yingfei and Ouyang, Hua and Wang, Chu and Chen, Jianhui and Asamov, Tsvetan and Chang, Yi},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2017},
pages = {2746-2753},
doi = {10.1609/AAAI.V31I1.10939},
url = {https://mlanthology.org/aaai/2017/wang2017aaai-efficient/}
}