Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans
Abstract
We propose an unsupervised method for parsing large 3D scans of real-world scenes with easily-interpretable shapes. This work aims to provide a practical tool for analyzing 3D scenes in the context of aerial surveying and mapping without the need for user annotations. Our approach is based on a probabilistic reconstruction model that decomposes an input 3D point cloud into a small set of learned prototypical 3D shapes. The resulting reconstruction is visually interpretable and can be used to perform unsupervised instance and low-shot semantic segmentation of complex scenes. We demonstrate the usefulness of our model on a novel dataset of seven large aerial LiDAR scans from diverse real-world scenarios. Our approach outperforms state-of-the-art unsupervised methods in terms of decomposition accuracy while remaining visually interpretable. Our code and dataset are available at https://romainloiseau.fr/learnable-earth-parser/.
Cite
Text
Loiseau et al. "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.02633Markdown
[Loiseau et al. "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/loiseau2024cvpr-learnable/) doi:10.1109/CVPR52733.2024.02633BibTeX
@inproceedings{loiseau2024cvpr-learnable,
title = {{Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans}},
author = {Loiseau, Romain and Vincent, Elliot and Aubry, Mathieu and Landrieu, Loic},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {27874-27884},
doi = {10.1109/CVPR52733.2024.02633},
url = {https://mlanthology.org/cvpr/2024/loiseau2024cvpr-learnable/}
}