Predicting Geo-Informative Attributes in Large-Scale Image Collections Using Convolutional Neural Networks

Abstract

Geographic location is a powerful property for organizing large-scale photo collections, but only a small fraction of online photos are geo-tagged. Most work in automatically estimating geo-tags from image content is based on comparison against models of buildings or landmarks, or on matching to large reference collections of geotagged images. These approaches work well for frequently photographed places like major cities and tourist destinations, but fail for photos taken in sparsely photographed places where few reference photos exist. Here we consider how to recognize general geo-informative attributes of a photo, e.g. the elevation gradient, population density, demographics, etc. of where it was taken, instead of trying to estimate a precise geo-tag. We learn models for these attributes using a large (noisy) set of geo-tagged images from Flickr by training deep convolutional neural networks (CNNs). We evaluate on over a dozen attributes, showing that while automatically recognizing some attributes is very difficult, others can be automatically estimated with about the same accuracy as a human.

Cite

Text

Lee et al. "Predicting Geo-Informative Attributes in Large-Scale Image Collections Using Convolutional Neural Networks." IEEE/CVF Winter Conference on Applications of Computer Vision, 2015. doi:10.1109/WACV.2015.79

Markdown

[Lee et al. "Predicting Geo-Informative Attributes in Large-Scale Image Collections Using Convolutional Neural Networks." IEEE/CVF Winter Conference on Applications of Computer Vision, 2015.](https://mlanthology.org/wacv/2015/lee2015wacv-predicting/) doi:10.1109/WACV.2015.79

BibTeX

@inproceedings{lee2015wacv-predicting,
  title     = {{Predicting Geo-Informative Attributes in Large-Scale Image Collections Using Convolutional Neural Networks}},
  author    = {Lee, Stefan and Zhang, Haipeng and Crandall, David J.},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
  year      = {2015},
  pages     = {550-557},
  doi       = {10.1109/WACV.2015.79},
  url       = {https://mlanthology.org/wacv/2015/lee2015wacv-predicting/}
}