SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-View Total Correlation
Abstract
The advent of abundant image data has catalyzed the advancement of visual control in reinforcement learning (RL) systems, leveraging multiple view- points to capture the same physical states, which could enhance control performance theoretically. However, integrating multi-view data into representation learning remains challenging. In this paper, we introduce SMuCo, an innovative multi-view reinforcement learning algorithm that constructs robust latent representations by optimizing multi- view sequential total correlation. This technique effectively captures task-relevant information and temporal dynamics while filtering out irrelevant data. Our method supports an unlimited number of views and demonstrates superior performance over leading model-free and model-based RL algorithms. Empirical results from the DeepMind Control Suite and the Sapien Basic Manipulation Task confirm SMuCo’s enhanced efficacy, significantly improving task performance across diverse scenarios and views.
Cite
Text
Cheng et al. "SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-View Total Correlation." Uncertainty in Artificial Intelligence, 2024.Markdown
[Cheng et al. "SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-View Total Correlation." Uncertainty in Artificial Intelligence, 2024.](https://mlanthology.org/uai/2024/cheng2024uai-smuco/)BibTeX
@inproceedings{cheng2024uai-smuco,
title = {{SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-View Total Correlation}},
author = {Cheng, Tong and Dong, Hang and Wang, Lu and Qiao, Bo and Lin, Qingwei and Rajmohan, Saravan and Moscibroda, Thomas},
booktitle = {Uncertainty in Artificial Intelligence},
year = {2024},
pages = {698-717},
volume = {244},
url = {https://mlanthology.org/uai/2024/cheng2024uai-smuco/}
}