Rectification of Figures and Photos in Document Images Using Bounding Box Interface

Abstract

This paper proposes an algorithm for the segmentation and rectification of figures and photos in document images. The algorithm requires just a rough user-provided bounding box for the objects in a single-view image. On receiving the user's bounding box, it takes about 1-2 seconds to segment and rectify mega-pixel sized figures. The main feature of the algorithm is a novel segmentation method that exploits the properties of printed figures. Specifically, a set of boundary candidates is generated using the properties, and the optimal boundary in the set is found by using an alternating optimization scheme. This segmentation result is further refined so that it is well localized to the true boundary. In addition to our segmentation method, we also propose a new boundary interpolation method for the rectification of segmented figures. The method improves the quality of output by largely removing perspective distortions compared to conventional boundary interpolation methods. Experimental results on a variety of images show that the method is efficient, robust, and easy to use.

Cite

Text

Koo and Cho. "Rectification of Figures and Photos in Document Images Using Bounding Box Interface." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010. doi:10.1109/CVPR.2010.5540071

Markdown

[Koo and Cho. "Rectification of Figures and Photos in Document Images Using Bounding Box Interface." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010.](https://mlanthology.org/cvpr/2010/koo2010cvpr-rectification/) doi:10.1109/CVPR.2010.5540071

BibTeX

@inproceedings{koo2010cvpr-rectification,
  title     = {{Rectification of Figures and Photos in Document Images Using Bounding Box Interface}},
  author    = {Koo, Hyung Il and Cho, Nam Ik},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2010},
  pages     = {3121-3128},
  doi       = {10.1109/CVPR.2010.5540071},
  url       = {https://mlanthology.org/cvpr/2010/koo2010cvpr-rectification/}
}