ClothPPO: A Proximal Policy Optimization Enhancing Framework for Robotic Cloth Manipulation with Observation-Aligned Action Spaces
Abstract
The increasing demands in analyzing complex associated scenes pose necessities to researching multi-image understanding abilities. Compared with understanding individual images, both the alignments and differences between images are essential aspects of understanding the intricate relationships for multi-image inference tasks. However, existing benchmarks face difficulties in addressing both of these aspects simultaneously, resulting in obstacles to modeling relationships under various granularities and domains of images. In this paper, we introduce M4Bench to enhance the capability of aligning and distinguishing multi-images with multi-domain multi-granularity comparison. We carefully design five comparison tasks related to coarse and fine-grained granularities in single and multiple domains of images and evaluate them on 13 state-of-the-art multi-modal large language models with various sizes. Besides, we analyze the evaluation results and provide several observations and viewpoints for the multi-image understanding research. The data and evaluation code are available at https://github.com/eaglelab-zju/M4Bench.
Cite
Text
Yang et al. "ClothPPO: A Proximal Policy Optimization Enhancing Framework for Robotic Cloth Manipulation with Observation-Aligned Action Spaces." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/762Markdown
[Yang et al. "ClothPPO: A Proximal Policy Optimization Enhancing Framework for Robotic Cloth Manipulation with Observation-Aligned Action Spaces." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/yang2024ijcai-clothppo/) doi:10.24963/ijcai.2024/762BibTeX
@inproceedings{yang2024ijcai-clothppo,
title = {{ClothPPO: A Proximal Policy Optimization Enhancing Framework for Robotic Cloth Manipulation with Observation-Aligned Action Spaces}},
author = {Yang, Libing and Li, Yang and Chen, Long},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2024},
pages = {6895-6903},
doi = {10.24963/ijcai.2024/762},
url = {https://mlanthology.org/ijcai/2024/yang2024ijcai-clothppo/}
}