DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications
Abstract
Significant efforts have been directed towards adapting self-supervised multimodal learning for Earth observation applications. However, most current methods produce coarse patch-sized embeddings, limiting their effectiveness and integration with other modalities like LiDAR. To close this gap, we present DUNIA, an approach to learn pixel-sized embeddings through cross-modal alignment between images and full-waveform LiDAR data. As the model is trained in a contrastive manner, the embeddings can be directly leveraged in the context of a variety of environmental monitoring tasks in a zero-shot setting. In our experiments, we demonstrate the effectiveness of the embeddings for seven such tasks: canopy height mapping, fractional canopy cover, land cover mapping, tree species identification, plant area index, crop type classification, and per-pixel waveform-based vertical structure mapping. The results show that the embeddings, along with zero-shot classifiers, often outperform specialized supervised models, even in low-data regimes. In the fine-tuning setting, we show strong performances near or better than the state-of-the-art on five out of six tasks.
Cite
Text
Fayad et al. "DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Fayad et al. "DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/fayad2025icml-dunia/)BibTeX
@inproceedings{fayad2025icml-dunia,
title = {{DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications}},
author = {Fayad, Ibrahim and Zimmer, Max and Schwartz, Martin and Gieseke, Fabian and Ciais, Philippe and Belouze, Gabriel and Brood, Sarah and De Truchis, Aurélien and D’Aspremont, Alexandre},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {16375-16406},
volume = {267},
url = {https://mlanthology.org/icml/2025/fayad2025icml-dunia/}
}