Hierarchical Reinforcement Learning for Point of Interest Recommendation
Abstract
Recent research has demonstrated the effectiveness of knowledge distillation in Domain Generalization. However, existing approaches often overlook domain-specific knowledge and rely on an offline distillation strategy, limiting the effectiveness of knowledge transfer. To address these limitations, we propose Balanced Online knowLedge Distillation (BOLD). BOLD leverages a multi-domain expert teacher model, with each expert specializing in a specific source domain, enabling the student to distill both domain-invariant and domain-specific knowledge. We incorporate the Pareto optimization principle and uncertainty weighting to balance these two types of knowledge, ensuring simultaneous optimization without compromising either. Additionally, BOLD employs an online knowledge distillation strategy, allowing the teacher and student to learn concurrently. This dynamic interaction enables the teacher to adapt based on student feedback, facilitating more effective knowledge transfer. Extensive experiments on seven benchmarks demonstrate that BOLD outperforms state-of-the-art methods. Furthermore, we provide theoretical insights that highlight the importance of domain-specific knowledge and the advantages of uncertainty weighting.
Cite
Text
Xiao et al. "Hierarchical Reinforcement Learning for Point of Interest Recommendation." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/272Markdown
[Xiao et al. "Hierarchical Reinforcement Learning for Point of Interest Recommendation." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/xiao2024ijcai-hierarchical/) doi:10.24963/ijcai.2024/272BibTeX
@inproceedings{xiao2024ijcai-hierarchical,
title = {{Hierarchical Reinforcement Learning for Point of Interest Recommendation}},
author = {Xiao, Yanan and Jiang, Lu and Liu, Kunpeng and Xu, Yuanbo and Wang, Pengyang and Yin, Minghao},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2024},
pages = {2460-2468},
doi = {10.24963/ijcai.2024/272},
url = {https://mlanthology.org/ijcai/2024/xiao2024ijcai-hierarchical/}
}