Learning Diverse Rankings with Multi-Armed Bandits

Radlinski, Filip; Kleinberg, Robert; Joachims, Thorsten

doi:10.1145/1390156.1390255

Learning Diverse Rankings with Multi-Armed Bandits

Filip Radlinski, Robert Kleinberg, Thorsten Joachims

ICML 2008 pp. 784-791

doi:10.1145/1390156.1390255 /icml/2008/radlinski2008icml-learning/

Abstract

Algorithms for learning to rank Web documents usually assume a document's relevance is independent of other documents. This leads to learned ranking functions that produce rankings with redundant results. In contrast, user studies have shown that diversity at high ranks is often preferred. We present two new learning algorithms that directly learn a diverse ranking of documents based on users' clicking behavior. We show that these algorithms minimize abandonment, or alternatively, maximize the probability that a relevant document is found in the top k positions of a ranking. We show that one of our algorithms asymptotically achieves the best possible payoff obtainable in polynomial time even as user's interests change. The other performs better empirically when user interests are static, and is still theoretically near-optimal in that case.

PDF ICML Semantic Scholar

Cite

Text

Radlinski et al. "Learning Diverse Rankings with Multi-Armed Bandits." International Conference on Machine Learning, 2008. doi:10.1145/1390156.1390255

Markdown

[Radlinski et al. "Learning Diverse Rankings with Multi-Armed Bandits." International Conference on Machine Learning, 2008.](https://mlanthology.org/icml/2008/radlinski2008icml-learning/) doi:10.1145/1390156.1390255

BibTeX

@inproceedings{radlinski2008icml-learning,
  title     = {{Learning Diverse Rankings with Multi-Armed Bandits}},
  author    = {Radlinski, Filip and Kleinberg, Robert and Joachims, Thorsten},
  booktitle = {International Conference on Machine Learning},
  year      = {2008},
  pages     = {784-791},
  doi       = {10.1145/1390156.1390255},
  url       = {https://mlanthology.org/icml/2008/radlinski2008icml-learning/}
}