Reconstruction Weighting Principal Component Analysis with Fusion Contrastive Learning
Abstract
Reinforcement Learning (RL) is a sampling-based method for sequential decision-making, in which a learning agent iteratively converges toward an optimal policy by leveraging feedback from the environment in the form of scalar reward signals. While timing information is often abstracted in discrete-time domains, time-critical learning applications—such as queuing systems, population processes, and manufacturing systems—are naturally modeled as Continuous-Time Markov Decision Processes (CTMDPs). Since the seminal work of Bradtke and Duff, model-free RL for CTMDPs has become well-understood. However, in many practical applications, practitioners possess high-quality information about system rates derived from traditional queuing theory, which learning agents could potentially exploit to accelerate convergence. Despite this, classical RL algorithms for CTMDPs typically re-learn these parameters through sampling. In this work, we propose continuous-time reward machines (CTRMs), a novel framework that embeds reward functions and real-time state-action dynamics into a unified structure. CTRMs enable RL agents to effectively navigate dense-time environments while leveraging reward shaping and counterfactual experiences for accelerated learning. Our empirical results demonstrate CTRMs' ability to improve learning efficiency in time-critical environments.
Cite
Text
Wang et al. "Reconstruction Weighting Principal Component Analysis with Fusion Contrastive Learning." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/563Markdown
[Wang et al. "Reconstruction Weighting Principal Component Analysis with Fusion Contrastive Learning." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/wang2024ijcai-reconstruction/) doi:10.24963/ijcai.2024/563BibTeX
@inproceedings{wang2024ijcai-reconstruction,
title = {{Reconstruction Weighting Principal Component Analysis with Fusion Contrastive Learning}},
author = {Wang, Qianqian and Liu, Meiling and Feng, Wei and Jiang, Mengping and Xu, Haiming and Gao, Quanxue},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2024},
pages = {5091-5098},
doi = {10.24963/ijcai.2024/563},
url = {https://mlanthology.org/ijcai/2024/wang2024ijcai-reconstruction/}
}