Multi-Camera 3D Position Estimation Using Conditional Random Field

Abstract

In order to realize effective and safe human-robot collaboration where many humans and robots complement each other in close proximity, digital twin of the space would play a crucial role to monitor the behaviors of many robots and humans simultaneously and precisely in real time. Constructing such a digital twin requires estimating the precise 3D positions of instances in space, but Bluetooth sensors lack accuracy, and LiDARs are costly when covering wide areas. Therefore, we propose the use of multiple cameras to capture overlapping videos of the space and reconstruct the 3D positions of instances using geometrical methods. We propose a multimodal approach that utilizes not only vision features, but also position features, to detect the same objects in multiple cameras and use Conditional Random Field (CRF) to infer the identicality of objects detected in multiple cameras. The 3D positions of an instance taken from multiple 2D cameras are then geographically estimated. In the evaluation, we demonstrate the effects of CRF and multimodal approach, and achieve comparative performance with the state-of-the-art method.

Cite

Text

Matsuda et al. "Multi-Camera 3D Position Estimation Using Conditional Random Field." IEEE/CVF International Conference on Computer Vision Workshops, 2023. doi:10.1109/ICCVW60793.2023.00205

Markdown

[Matsuda et al. "Multi-Camera 3D Position Estimation Using Conditional Random Field." IEEE/CVF International Conference on Computer Vision Workshops, 2023.](https://mlanthology.org/iccvw/2023/matsuda2023iccvw-multicamera/) doi:10.1109/ICCVW60793.2023.00205

BibTeX

@inproceedings{matsuda2023iccvw-multicamera,
  title     = {{Multi-Camera 3D Position Estimation Using Conditional Random Field}},
  author    = {Matsuda, Shusuke and Techasarntikul, Nattaon and Shimonishi, Hideyuki},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2023},
  pages     = {1900-1908},
  doi       = {10.1109/ICCVW60793.2023.00205},
  url       = {https://mlanthology.org/iccvw/2023/matsuda2023iccvw-multicamera/}
}