Meta-Learning Approach for Joint Multimodal Signals with Multimodal Iterative Adaptation

Abstract

In the pursuit of effectively modeling real-world joint multimodal signals, learning to learn multiple Implicit Neural Representations (INRs) jointly has gained attention to overcome data scarcity and enhance fitting speed. However, predominant methods based on multi- modal encoders often underperform due to their reliance on direct data-to-parameter map- ping functions, bypassing the optimization steps necessary for capturing the complexities of real-world signals. To address this gap, we propose Multimodal Iterative Adaptation (MIA), a novel framework that combines the strengths of multimodal fusion with optimization-based meta-learning. The key idea is to enhance the learning of INRs by facilitating exchange of cross-modal knowledge among learners during the iterative optimization processes, improv- ing generalization and enabling a more nuanced adaptation to complex signals. To achieve this, we introduce State Fusion Transformers (SFTs), an attention-based meta-learner de- signed to operate in the backward pass of the learners, aggregating learning states, capturing cross-modal relationships, and predicting enhanced parameter updates for the learners. Our extensive evaluation in various real-world multimodal signal regression setups shows that MIA outperforms existing baselines in both generalization and memorization performances. Our code is available at https://github.com/yhytoto12/MIA.

Cite

Text

Lee et al. "Meta-Learning Approach for Joint Multimodal Signals with Multimodal Iterative Adaptation." Transactions on Machine Learning Research, 2024.

Markdown

[Lee et al. "Meta-Learning Approach for Joint Multimodal Signals with Multimodal Iterative Adaptation." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/lee2024tmlr-metalearning/)

BibTeX

@article{lee2024tmlr-metalearning,
  title     = {{Meta-Learning Approach for Joint Multimodal Signals with Multimodal Iterative Adaptation}},
  author    = {Lee, Sehun and Lee, Wonkwang and Kim, Gunhee},
  journal   = {Transactions on Machine Learning Research},
  year      = {2024},
  url       = {https://mlanthology.org/tmlr/2024/lee2024tmlr-metalearning/}
}