Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans

Abstract

We propose an unsupervised method for parsing large 3D scans of real-world scenes with easily-interpretable shapes. This work aims to provide a practical tool for analyzing 3D scenes in the context of aerial surveying and mapping without the need for user annotations. Our approach is based on a probabilistic reconstruction model that decomposes an input 3D point cloud into a small set of learned prototypical 3D shapes. The resulting reconstruction is visually interpretable and can be used to perform unsupervised instance and low-shot semantic segmentation of complex scenes. We demonstrate the usefulness of our model on a novel dataset of seven large aerial LiDAR scans from diverse real-world scenarios. Our approach outperforms state-of-the-art unsupervised methods in terms of decomposition accuracy while remaining visually interpretable. Our code and dataset are available at https://romainloiseau.fr/learnable-earth-parser/.

Cite

Text

Loiseau et al. "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.02633

Markdown

[Loiseau et al. "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/loiseau2024cvpr-learnable/) doi:10.1109/CVPR52733.2024.02633

BibTeX

@inproceedings{loiseau2024cvpr-learnable,
  title     = {{Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans}},
  author    = {Loiseau, Romain and Vincent, Elliot and Aubry, Mathieu and Landrieu, Loic},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {27874-27884},
  doi       = {10.1109/CVPR52733.2024.02633},
  url       = {https://mlanthology.org/cvpr/2024/loiseau2024cvpr-learnable/}
}