research-article

Identification of scene locations from geotagged images

Authors:
Jong-Seung Park

University of Incheon, Songdo-dong, Incheon, Korea

University of Incheon, Songdo-dong, Incheon, Korea
View Profile

,
Ramesh Jain

University of California, Irvine, Donald Bren Hall, CA

University of California, Irvine, Donald Bren Hall, CA
View Profile

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 9 Issue 1Article No.: 5pp 1–23https://doi.org/10.1145/2422956.2422961

Published:19 February 2013Publication History

ACM Transactions on Multimedia Computing, Communications, and Applications

Abstract

Due to geotagging capabilities of consumer cameras, it has become easy to capture the exact geometric location where a picture is taken. However, the location is not the whereabouts of the scene taken by the photographer but the whereabouts of the photographer himself. To determine the actual location of an object seen in a photo some sophisticated and tiresome steps are required on a special camera rig, which are generally not available in common digital cameras. This article proposes a novel method to determine the geometric location corresponding to a specific image pixel. A new technique of stereo triangulation is introduced to compute the relative depth of a pixel position. Geographical metadata embedded in images are utilized to convert relative depths to absolute coordinates. When a geographic database is available we can also infer the semantically meaningful description of a scene object from where the specified pixel is projected onto the photo. Experimental results demonstrate the effectiveness of the proposed approach in accurately identifying actual locations.

References

Baker, S. and Matthews, I. 2004. Lucas-Kanade 20 years on: A unifying framework. Int. J. Comput. Vis. 56, 3, 221--255. Google ScholarDigital Library
Chang, C. and Chatterjee, C. 1992. Quantization error analysis in stereo vision. In Proceedings of the 26th Asilomar Conference on Signals, Systems and Computers. 1037--1041.Google Scholar
Ebling, M. R. and Cáceres, R. 2010. Gaming and augmented reality come to location-based services. IEEE Pervas. Comput. 9, 5--6. Google ScholarDigital Library
Gluckman, J. and Nayar, S. K. 2001. Rectifying transformations that minimize resampling effects. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Vol. 1. 111--117.Google Scholar
Goodchild, M. F. 2000. New horizons for the social sciences: geographic information systems. In Social Sciences for a Digital World: Building Infrastructure and Databases for the Future, 163--172.Google Scholar
Guan, W., You, S., and Neumann, U. 2011. Gps-aided recognition-based user tracking system with augmented reality in extreme large-scale areas. In Proceedings of the 2nd Annual ACM Conference on Multimedia Systems. 1--10. Google ScholarDigital Library
Hadjitheophanous, S., Ttofis, C., Georghiades, A. S., and Theocharides, T. 2010. Towards hardware stereoscopic 3D reconstruction: a real-time FPGA computation of the disparity map. In Proceedings of the Conference on Design, Automation and Test in Europe. 1743--1748. Google ScholarDigital Library
Haklay, M., Singleton, A., and Parker, C. 2008. Web Mapping 2.0: The neogeography of the GeoWeb. Geography Compass 2, 6, 2011--2039.Google ScholarCross Ref
Hartley, R. I. and Sturm, P. 1997. Triangulation. Comput. Vis. Image Understand. 68, 2, 146--157. Google ScholarDigital Library
Hoashi, K., Uemukai, T., Matsumoto, K., and Takishima, Y. 2009. Constructing a landmark identification system for geo-tagged photographs based on web data analysis. In Proceedings of the IEEE International Conference on Multimedia and Expo. 606--609. Google ScholarDigital Library
Hudson-Smith, A., Crooks, A., Gibin, M., Milton, R., and Batty, M. 2009. NeoGeography and web 2.0: Concepts, tools and applications. J. Location Based Serv. 3, 2, 118--145. Google ScholarDigital Library
IPTC. 2010. IPTC photo metadata: Core 1.1/Extension 1.1. Tech. rep., International Press Telecommunications Council. July.Google Scholar
Jain, R. and Sinha, P. 2010. Content without context is meaningless. In Proceedings of the International Conference on Multimedia (MM'10). 1259--1268. Google ScholarDigital Library
Jawed, K., Morris, J., Khan, T., and Gimelfarby, G. 2009. Real time rectification for stereo correspondence. In Proceedings of the International Conference on Computational Science and Engineering. Vol. 2. 277--284. Google ScholarDigital Library
JEITA. 2002. Exchangeable image file format for digital still cameras: Exif version 2.2. Tech. rep. JEITA CP-3451, Japan Electronics and Information Technology Industries Association.Google Scholar
Johnson, L., Levine, A., and Smith, R. 2009. Geo-Everything. In The Horizon Report, 15--18.Google Scholar
Kalantidis, Y., Tolias, G., Avrithis, Y., Phinikettos, M., Spyrou, E., Mylonas, P., and Kollias, S. 2011. Viral: Visual image retrieval and localization. Multimedia Tools Appl. 51, 2, 555--592. Google ScholarDigital Library
Kanatani, K., Sugaya, Y., and Niitsuma, H. 2008. Triangulation from two views revisited: Hartley-Sturm vs. optimal correction. In Proceedings of the 19th British Machine Vision Conference (BMVC '08). 173--182.Google Scholar
Loop, C. and Zhang, Z. 1999. Computing rectifying homographies for stereo vision. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Vol. 1. 125--131.Google Scholar
MWG. 2009. Guidelines for handling image metadata 1.0.1. Tech. rep., Metadata Working Group.Google Scholar
NIMA. 1997. Department of defense world geodetic system 1984, its definition and relationships with local geodetic systems. Tech. rep. TR8350.2, National Imagery and Mapping Agency.Google Scholar
Pollefeys, M. and Sinha, S. N. 2004. Iso-Disparity surfaces for general stereo configurations. In Proceedings of the European Conference on Computer Vision. Vol. 3. 509--520.Google Scholar
Ramm, F., Topf, J., and Chilton, S. 2010. OpenStreetMap: Using and Enhancing the Free Map of the World. UIT Cambridge.Google Scholar
Reisch, R. and Parulski, K. A. 2009. Digital camera image storage formats. In Single-Sensor Imaging: Methods and Applications for Digital Cameras, R. Lukac, Ed., CRC Press.Google Scholar
Singh, V. K., Gao, M., and Jain, R. 2010. Social pixels: genesis and evaluation. In Proceedings of the International Conference on Multimedia (MM'10). 481--490. Google ScholarDigital Library
Sinnott, R. W. 1984. Virtues of the haversine. Sky and Telescope 68, 2, 159.Google Scholar
Staudinger, E., Humenberger, M., and Kubinger, W. 2008. FPGA-based rectification and lens undistortion for a real-time embedded stereo vision sensor. In Proceedings of FH Science Day. 18--25.Google Scholar
Tomasi, C. and Kanade, T. 1991. Detection and tracking of point features. Tech. rep. CMU-CS-91-132, Carnegie Mellon University.Google Scholar
Tsai, R. Y. 1987. A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses. IEEE J. Robot. Autom. 3, 4, 323--344.Google ScholarCross Ref
Vajda, P., Ivanov, I., Lee, J.-S., Goldmann, L., and Ebrahimi, T. 2010. Propagation of geotags based on object duplicate detection. In Proceedings of SPIE.Vol. 7798, 27.Google Scholar
van der Mark, W. and Gavrila, D. M. 2006. Real-Time dense stereo for intelligent vehicles. IEEE Trans. Intell. Transport. Syst. 7, 1, 38--50. Google ScholarDigital Library
Viana, W., Hammiche, S., Villanova-Oliver, M., Gensel, J., and Martin, H. 2008. Photo context as a bag of words. In Proceedings of the IEEE International Symposium on Multimedia (ISM '08). 310--315. Google ScholarDigital Library
Wang, J. and Liu, Y. 2007. A closed-form solution of reconstruction from nonparallel stereo geometry used in image guided system for surgery. In Proceedings of the International Conference on Multimedia Content Analysis and Mining. 371--380. Google ScholarDigital Library
Weih, R., Gilbert, M., Cross, J., and Freeman, D. 2009. Accuracy assessment of recreational and mapping grade GPS receivers. J. Arkansas Acad. Scie. 63, 163--168.Google Scholar
Yaegashi, K. and Yanai, K. 2009. Can geotags help image recognition&quest; In Proceedings of the 3rd Pacific Rim Symposium on Advances in Image and Video Technology (PSIVT '09). 361--373. Google ScholarDigital Library
Yuan, J., Luo, J., and Wu, Y. 2010. Mining compositional features from gps and visual cues for event recognition in photo collections. IEEE Trans. Multimedia 12, 7, 705--716. Google ScholarDigital Library
Zhang, Z. 2000. A flexible new technique for camera calibration. IEEE Trans. Pattern Anal. Mach. Intell. 22, 11, 1330--1334. Google ScholarDigital Library
Zitova, B. and Flusser, J. 2003. Image registration methods: A survey. Image Vis. Comput. 21, 11, 977--1000.Google ScholarCross Ref

Index Terms

Identification of scene locations from geotagged images
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
      2. Computer vision tasks
        Scene understanding
2. Information systems
  1. Information retrieval

Recommendations

Catadioptric Stereo Using Planar Mirrors

By using mirror reflections of a scene, stereo images can be captured with a single camera (catadioptric stereo). In addition to simplifying data acquisition single camera stereo provides both geometric and radiometric advantages over traditional two ...
Read More
Robust camera pose and scene structure analysis for service robotics

Successful path planning and object manipulation in service robotics applications rely both on a good estimation of the robot's position and orientation (pose) in the environment, as well as on a reliable understanding of the visualized scene. In this ...
Read More
An analysis of the relation between visual concepts and geo-locations using geotagged images on the web
ICME'09: Proceedings of the 2009 IEEE international conference on Multimedia and Expo

Recently, a large number of geotagged images are available on photo sharing Web sites such as Flickr. In this paper, we propose image region entropy and geo-location entropy for analyzing the relation between visual concepts and geographical locations ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 9, Issue 1
February 2013
158 pages
ISSN:1551-6857
EISSN:1551-6865
DOI:10.1145/2422956
Issue’s Table of Contents

Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 19 February 2013
- Accepted: 1 January 2012
- Revised: 1 November 2011
- Received: 1 June 2011
Published in tomm Volume 9, Issue 1

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Visual context
geotag
image metadata
stereo vision
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 474
  Total Downloads
- Downloads (Last 12 months)5
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Identification of scene locations from geotagged images

ACM Transactions on Multimedia Computing, Communications, and Applications

Abstract

References

Cited By

Index Terms

Recommendations

Catadioptric Stereo Using Planar Mirrors

Robust camera pose and scene structure analysis for service robotics

An analysis of the relation between visual concepts and geo-locations using geotagged images on the web

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Identification of scene locations from geotagged images

ACM Transactions on Multimedia Computing, Communications, and Applications

Abstract

References

Cited By

Index Terms

Recommendations

Catadioptric Stereo Using Planar Mirrors

Robust camera pose and scene structure analysis for service robotics

An analysis of the relation between visual concepts and geo-locations using geotagged images on the web

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media