Towards Equilibrium: An Instantaneous Probe-and-Rebalance Multimodal Learning Approach

Yang, Yang; Wu, Xixian; Jiang, Qing-Yuan

doi:10.24963/IJCAI.2025/395

Towards Equilibrium: An Instantaneous Probe-and-Rebalance Multimodal Learning Approach

Yang Yang, Xixian Wu, Qing-Yuan Jiang

IJCAI 2025 pp. 3552-3560

doi:10.24963/IJCAI.2025/395 /ijcai/2025/yang2025ijcai-equilibrium/

Abstract

The multimodal imbalance problem has been extensively studied to prevent the undesirable scenario where multimodal performance falls below that of unimodal models. However, existing methods typically assess the strength of modalities and perform learning simultaneously under the imbalanced status. This deferred strategy fails to rebalance multimodal learning instantaneously, leading to performance degeneration. To address this, we propose a novel multimodal learning approach, termed instantaneous probe-and-rebalance multimodal learning (IPRM), which employs a two-pass forward method to first probe (but not learn) and then perform rebalanced learning under the balanced status. Concretely, we first employ the geodesic multimodal mixup (GMM) to incorporate fusion representation and probe modality strength in the first forward phase. Then the weights are instantaneously recalibrated based on the probed strength, facilitating balanced training via the second forward pass. This process is applied dynamically throughout the entire training process. Extensive experiments reveal that our proposed IPRM outperforms all baselines, achieving state-of-the-art (SOTA) performance on numerous widely used datasets. The code is available at https://github.com/njustkmg/IJCAI25-IPRM.

PDF IJCAI Semantic Scholar

Cite

Text

Yang et al. "Towards Equilibrium: An Instantaneous Probe-and-Rebalance Multimodal Learning Approach." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/395

Markdown

[Yang et al. "Towards Equilibrium: An Instantaneous Probe-and-Rebalance Multimodal Learning Approach." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/yang2025ijcai-equilibrium/) doi:10.24963/IJCAI.2025/395

BibTeX

@inproceedings{yang2025ijcai-equilibrium,
  title     = {{Towards Equilibrium: An Instantaneous Probe-and-Rebalance Multimodal Learning Approach}},
  author    = {Yang, Yang and Wu, Xixian and Jiang, Qing-Yuan},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {3552-3560},
  doi       = {10.24963/IJCAI.2025/395},
  url       = {https://mlanthology.org/ijcai/2025/yang2025ijcai-equilibrium/}
}