RetailNet: Enhancing Retails of Perishable Products with Multiple Selling Strategies via Pair-Wise Multi-Q Learning

Abstract

We propose RetailNet, an end-to-end reinforcement learning (RL)-based neural network, to achieve efficient selling strategies for perishable products in order to maximize retailers’ long-term profit. We design Pair-wise Multi-Q network for Q value estimation to model each state-action pair and to capture the interdependence between actions. Generalized Advantage Estimation (GAE)and Entropy are incorporated into the loss function for balancing the tradeoff between exploitation and exploration. Experiments show that Re-tailNet efficiently produces the near-optimal solution, providing practitioners valuable guidance on their inventory replenishment, pricing, and products display strategies in the retailing industry.

Cite

Text

Ma et al. "RetailNet: Enhancing Retails of Perishable Products with Multiple Selling Strategies via Pair-Wise Multi-Q Learning." ICML 2019 Workshops: RL4RealLife, 2019.

Markdown

[Ma et al. "RetailNet: Enhancing Retails of Perishable Products with Multiple Selling Strategies via Pair-Wise Multi-Q Learning." ICML 2019 Workshops: RL4RealLife, 2019.](https://mlanthology.org/icmlw/2019/ma2019icmlw-retailnet/)

BibTeX

@inproceedings{ma2019icmlw-retailnet,
  title     = {{RetailNet: Enhancing Retails of Perishable Products with Multiple Selling Strategies via Pair-Wise Multi-Q Learning}},
  author    = {Ma, Xiyao and Lu, Fan and Pan, Xiajun Amy and Zhou, Yanlin and Li, Xiaolin Andy},
  booktitle = {ICML 2019 Workshops: RL4RealLife},
  year      = {2019},
  url       = {https://mlanthology.org/icmlw/2019/ma2019icmlw-retailnet/}
}