Unifying Online and Counterfactual Learning to Rank: A Novel Counterfactual Estimator That Effectively Utilizes Online Interventions (Extended Abstract)

Abstract

State-of-the-art Learning to Rank (LTR) methods for optimizing ranking systems based on user interactions are divided into online approaches – that learn by direct interaction – and counterfactual approaches – that learn from historical interactions. We propose a novel intervention-aware estimator to bridge this online/counterfactual division. The estimator corrects for the effect of position bias, trust bias, and item-selection bias by using corrections based on the behavior of the logging policy and on online interventions: changes to the logging policy made during the gathering of click data. Our experimental results show that, unlike existing counterfactual LTR methods, the intervention-aware estimator can greatly benefit from online interventions. To the best of our knowledge, this is the first method that is shown to be highly effective in both online and counterfactual scenarios.

Cite

Text

Oosterhuis and de Rijke. "Unifying Online and Counterfactual Learning to Rank: A Novel Counterfactual Estimator That Effectively Utilizes Online Interventions (Extended Abstract)." International Joint Conference on Artificial Intelligence, 2021. doi:10.24963/IJCAI.2021/656

Markdown

[Oosterhuis and de Rijke. "Unifying Online and Counterfactual Learning to Rank: A Novel Counterfactual Estimator That Effectively Utilizes Online Interventions (Extended Abstract)." International Joint Conference on Artificial Intelligence, 2021.](https://mlanthology.org/ijcai/2021/oosterhuis2021ijcai-unifying/) doi:10.24963/IJCAI.2021/656

BibTeX

@inproceedings{oosterhuis2021ijcai-unifying,
  title     = {{Unifying Online and Counterfactual Learning to Rank: A Novel Counterfactual Estimator That Effectively Utilizes Online Interventions (Extended Abstract)}},
  author    = {Oosterhuis, Harrie and de Rijke, Maarten},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {4809-4813},
  doi       = {10.24963/IJCAI.2021/656},
  url       = {https://mlanthology.org/ijcai/2021/oosterhuis2021ijcai-unifying/}
}