Time Series Clustering for Enhanced Dynamic Allocation in A/B Testing

Abstract

An A/B-Test is a method for evaluating online experiments on target items and observing which A/B/C/... variations are better through log reports and statistical analysis of the rewards earned by each variation. Recent advancements in A/B-Tests through reinforcement learning encompass dynamic allocation employing multiarmed bandits (MAB). MABs provides A/B-Tests with fast identification of the best variation (A or B) and helps limit the loss of the test i.e. the cost of exploring low-reward variation. When partial information is available before assigning variations, dynamic allocation is extended to the contextual multiarmed bandit problem (CMAB). Current state-of-the-art approaches for empirically estimating the context-dependent reward function for each variation demonstrate strong performance in limiting test loss and personalized tests. However, few studies have addressed this problem in the context of variable-sized time series. This paper presents a new reinforcement learning methodology to handle A/B-Tests with variable-sized time series as context information. We provide two new methods that obtain a minimization of the cumulative regret with a soft computational cost. This paper also provides numerical results on real A/B-Test datasets, in addition to public data, to demonstrate an improvement over traditional methods.

Cite

Text

Claeys et al. "Time Series Clustering for Enhanced Dynamic Allocation in A/B Testing." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024. doi:10.1007/978-3-031-70378-2_22

Markdown

[Claeys et al. "Time Series Clustering for Enhanced Dynamic Allocation in A/B Testing." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024.](https://mlanthology.org/ecmlpkdd/2024/claeys2024ecmlpkdd-time/) doi:10.1007/978-3-031-70378-2_22

BibTeX

@inproceedings{claeys2024ecmlpkdd-time,
  title     = {{Time Series Clustering for Enhanced Dynamic Allocation in A/B Testing}},
  author    = {Claeys, Emmanuelle and Maumy-Bertrand, Myriam and Gançarski, Pierre},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2024},
  pages     = {351-367},
  doi       = {10.1007/978-3-031-70378-2_22},
  url       = {https://mlanthology.org/ecmlpkdd/2024/claeys2024ecmlpkdd-time/}
}