Sensor Equivariance: A Framework for Semantic Segmentation with Diverse Camera Models

Hannes Reichert, Manuel Hetzel, Andreas Hubert, Konrad Doll, Bernhard Sick

CVPRW 2024 pp. 1254-1261

doi:10.1109/CVPRW63382.2024.00132 /cvprw/2024/reichert2024cvprw-sensor/

Abstract

Objects are represented differently in projection-based sensors such as cameras depending on sensor resolution, field of view, and distortion, leading to distorted physical and geometric properties. As a result, sensor data processing depend on these properties. With the large variations of sensors on the market, an equivariant representation and suitable processing are necessary to become independent of the sensor used. In this work, we propose an extension of conventional image data by an additional channel in which the associated projection properties are encoded. Furthermore, we introduce a SensorConv layer as an extension to the conventional convolution layer. SensorConv enable using projection properties in convolutional neural networks. To that end, we propose an architecture for using the SensorConv layer in the Detectron2 [21] framework. We collected a dataset of equirectangular images for our experiments with the CARLA [3] simulator. To analyze multiple sensor models (i.e., sensor intrinsic), we created an augmentation method to emulate a high variability of sensors from the collected equirectangular panoramas. In our experiment, we show that our method can generalize better across different camera sensors.

CVPRW Semantic Scholar

Cite

Text

Reichert et al. "Sensor Equivariance: A Framework for Semantic Segmentation with Diverse Camera Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00132

Markdown

[Reichert et al. "Sensor Equivariance: A Framework for Semantic Segmentation with Diverse Camera Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/reichert2024cvprw-sensor/) doi:10.1109/CVPRW63382.2024.00132

BibTeX

@inproceedings{reichert2024cvprw-sensor,
  title     = {{Sensor Equivariance: A Framework for Semantic Segmentation with Diverse Camera Models}},
  author    = {Reichert, Hannes and Hetzel, Manuel and Hubert, Andreas and Doll, Konrad and Sick, Bernhard},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2024},
  pages     = {1254-1261},
  doi       = {10.1109/CVPRW63382.2024.00132},
  url       = {https://mlanthology.org/cvprw/2024/reichert2024cvprw-sensor/}
}