Pose Estimation, Model Refinement, and Enhanced Visualization Using Video

Abstract

In this paper we present methods for exploitation and enhanced visualization of video given a prior coarse untextured polyhedral model of a scene. Since it is necessary to estimate the 3D poses of the moving camera, we develop an algorithm where tracked features are used to predict the pose between frames and the predicted poses are refined by a coarse to fine process of aligning projected 3D model line segments to oriented image gradient energy pyramids. The estimated poses can be used to update the model with information derived from video, and to re-project and visualize the video from different points of view with a larger scene context. Via image registration, we update the placement of objects in the model and the 3D shape of new or erroneously modeled objects, then map video texture to the model. Experimental results are presented for long aerial and ground level videos of a large-scale urban scene.

Cite

Text

Hsu et al. "Pose Estimation, Model Refinement, and Enhanced Visualization Using Video." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2000. doi:10.1109/CVPR.2000.855859

Markdown

[Hsu et al. "Pose Estimation, Model Refinement, and Enhanced Visualization Using Video." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2000.](https://mlanthology.org/cvpr/2000/hsu2000cvpr-pose/) doi:10.1109/CVPR.2000.855859

BibTeX

@inproceedings{hsu2000cvpr-pose,
  title     = {{Pose Estimation, Model Refinement, and Enhanced Visualization Using Video}},
  author    = {Hsu, Stephen C. and Samarasekera, Supun and Kumar, Rakesh and Sawhney, Harpreet S.},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2000},
  pages     = {1488-1495},
  doi       = {10.1109/CVPR.2000.855859},
  url       = {https://mlanthology.org/cvpr/2000/hsu2000cvpr-pose/}
}