skip to main content
research-article

Identification of scene locations from geotagged images

Published:19 February 2013Publication History
Skip Abstract Section

Abstract

Due to geotagging capabilities of consumer cameras, it has become easy to capture the exact geometric location where a picture is taken. However, the location is not the whereabouts of the scene taken by the photographer but the whereabouts of the photographer himself. To determine the actual location of an object seen in a photo some sophisticated and tiresome steps are required on a special camera rig, which are generally not available in common digital cameras. This article proposes a novel method to determine the geometric location corresponding to a specific image pixel. A new technique of stereo triangulation is introduced to compute the relative depth of a pixel position. Geographical metadata embedded in images are utilized to convert relative depths to absolute coordinates. When a geographic database is available we can also infer the semantically meaningful description of a scene object from where the specified pixel is projected onto the photo. Experimental results demonstrate the effectiveness of the proposed approach in accurately identifying actual locations.

References

  1. Baker, S. and Matthews, I. 2004. Lucas-Kanade 20 years on: A unifying framework. Int. J. Comput. Vis. 56, 3, 221--255. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Chang, C. and Chatterjee, C. 1992. Quantization error analysis in stereo vision. In Proceedings of the 26th Asilomar Conference on Signals, Systems and Computers. 1037--1041.Google ScholarGoogle Scholar
  3. Ebling, M. R. and Cáceres, R. 2010. Gaming and augmented reality come to location-based services. IEEE Pervas. Comput. 9, 5--6. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Gluckman, J. and Nayar, S. K. 2001. Rectifying transformations that minimize resampling effects. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Vol. 1. 111--117.Google ScholarGoogle Scholar
  5. Goodchild, M. F. 2000. New horizons for the social sciences: geographic information systems. In Social Sciences for a Digital World: Building Infrastructure and Databases for the Future, 163--172.Google ScholarGoogle Scholar
  6. Guan, W., You, S., and Neumann, U. 2011. Gps-aided recognition-based user tracking system with augmented reality in extreme large-scale areas. In Proceedings of the 2nd Annual ACM Conference on Multimedia Systems. 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Hadjitheophanous, S., Ttofis, C., Georghiades, A. S., and Theocharides, T. 2010. Towards hardware stereoscopic 3D reconstruction: a real-time FPGA computation of the disparity map. In Proceedings of the Conference on Design, Automation and Test in Europe. 1743--1748. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Haklay, M., Singleton, A., and Parker, C. 2008. Web Mapping 2.0: The neogeography of the GeoWeb. Geography Compass 2, 6, 2011--2039.Google ScholarGoogle ScholarCross RefCross Ref
  9. Hartley, R. I. and Sturm, P. 1997. Triangulation. Comput. Vis. Image Understand. 68, 2, 146--157. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Hoashi, K., Uemukai, T., Matsumoto, K., and Takishima, Y. 2009. Constructing a landmark identification system for geo-tagged photographs based on web data analysis. In Proceedings of the IEEE International Conference on Multimedia and Expo. 606--609. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Hudson-Smith, A., Crooks, A., Gibin, M., Milton, R., and Batty, M. 2009. NeoGeography and web 2.0: Concepts, tools and applications. J. Location Based Serv. 3, 2, 118--145. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. IPTC. 2010. IPTC photo metadata: Core 1.1/Extension 1.1. Tech. rep., International Press Telecommunications Council. July.Google ScholarGoogle Scholar
  13. Jain, R. and Sinha, P. 2010. Content without context is meaningless. In Proceedings of the International Conference on Multimedia (MM'10). 1259--1268. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Jawed, K., Morris, J., Khan, T., and Gimelfarby, G. 2009. Real time rectification for stereo correspondence. In Proceedings of the International Conference on Computational Science and Engineering. Vol. 2. 277--284. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. JEITA. 2002. Exchangeable image file format for digital still cameras: Exif version 2.2. Tech. rep. JEITA CP-3451, Japan Electronics and Information Technology Industries Association.Google ScholarGoogle Scholar
  16. Johnson, L., Levine, A., and Smith, R. 2009. Geo-Everything. In The Horizon Report, 15--18.Google ScholarGoogle Scholar
  17. Kalantidis, Y., Tolias, G., Avrithis, Y., Phinikettos, M., Spyrou, E., Mylonas, P., and Kollias, S. 2011. Viral: Visual image retrieval and localization. Multimedia Tools Appl. 51, 2, 555--592. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Kanatani, K., Sugaya, Y., and Niitsuma, H. 2008. Triangulation from two views revisited: Hartley-Sturm vs. optimal correction. In Proceedings of the 19th British Machine Vision Conference (BMVC '08). 173--182.Google ScholarGoogle Scholar
  19. Loop, C. and Zhang, Z. 1999. Computing rectifying homographies for stereo vision. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Vol. 1. 125--131.Google ScholarGoogle Scholar
  20. MWG. 2009. Guidelines for handling image metadata 1.0.1. Tech. rep., Metadata Working Group.Google ScholarGoogle Scholar
  21. NIMA. 1997. Department of defense world geodetic system 1984, its definition and relationships with local geodetic systems. Tech. rep. TR8350.2, National Imagery and Mapping Agency.Google ScholarGoogle Scholar
  22. Pollefeys, M. and Sinha, S. N. 2004. Iso-Disparity surfaces for general stereo configurations. In Proceedings of the European Conference on Computer Vision. Vol. 3. 509--520.Google ScholarGoogle Scholar
  23. Ramm, F., Topf, J., and Chilton, S. 2010. OpenStreetMap: Using and Enhancing the Free Map of the World. UIT Cambridge.Google ScholarGoogle Scholar
  24. Reisch, R. and Parulski, K. A. 2009. Digital camera image storage formats. In Single-Sensor Imaging: Methods and Applications for Digital Cameras, R. Lukac, Ed., CRC Press.Google ScholarGoogle Scholar
  25. Singh, V. K., Gao, M., and Jain, R. 2010. Social pixels: genesis and evaluation. In Proceedings of the International Conference on Multimedia (MM'10). 481--490. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Sinnott, R. W. 1984. Virtues of the haversine. Sky and Telescope 68, 2, 159.Google ScholarGoogle Scholar
  27. Staudinger, E., Humenberger, M., and Kubinger, W. 2008. FPGA-based rectification and lens undistortion for a real-time embedded stereo vision sensor. In Proceedings of FH Science Day. 18--25.Google ScholarGoogle Scholar
  28. Tomasi, C. and Kanade, T. 1991. Detection and tracking of point features. Tech. rep. CMU-CS-91-132, Carnegie Mellon University.Google ScholarGoogle Scholar
  29. Tsai, R. Y. 1987. A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses. IEEE J. Robot. Autom. 3, 4, 323--344.Google ScholarGoogle ScholarCross RefCross Ref
  30. Vajda, P., Ivanov, I., Lee, J.-S., Goldmann, L., and Ebrahimi, T. 2010. Propagation of geotags based on object duplicate detection. In Proceedings of SPIE.Vol. 7798, 27.Google ScholarGoogle Scholar
  31. van der Mark, W. and Gavrila, D. M. 2006. Real-Time dense stereo for intelligent vehicles. IEEE Trans. Intell. Transport. Syst. 7, 1, 38--50. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Viana, W., Hammiche, S., Villanova-Oliver, M., Gensel, J., and Martin, H. 2008. Photo context as a bag of words. In Proceedings of the IEEE International Symposium on Multimedia (ISM '08). 310--315. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Wang, J. and Liu, Y. 2007. A closed-form solution of reconstruction from nonparallel stereo geometry used in image guided system for surgery. In Proceedings of the International Conference on Multimedia Content Analysis and Mining. 371--380. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Weih, R., Gilbert, M., Cross, J., and Freeman, D. 2009. Accuracy assessment of recreational and mapping grade GPS receivers. J. Arkansas Acad. Scie. 63, 163--168.Google ScholarGoogle Scholar
  35. Yaegashi, K. and Yanai, K. 2009. Can geotags help image recognition? In Proceedings of the 3rd Pacific Rim Symposium on Advances in Image and Video Technology (PSIVT '09). 361--373. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Yuan, J., Luo, J., and Wu, Y. 2010. Mining compositional features from gps and visual cues for event recognition in photo collections. IEEE Trans. Multimedia 12, 7, 705--716. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Zhang, Z. 2000. A flexible new technique for camera calibration. IEEE Trans. Pattern Anal. Mach. Intell. 22, 11, 1330--1334. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Zitova, B. and Flusser, J. 2003. Image registration methods: A survey. Image Vis. Comput. 21, 11, 977--1000.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Identification of scene locations from geotagged images

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM Transactions on Multimedia Computing, Communications, and Applications
              ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 9, Issue 1
              February 2013
              158 pages
              ISSN:1551-6857
              EISSN:1551-6865
              DOI:10.1145/2422956
              Issue’s Table of Contents

              Copyright © 2013 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 19 February 2013
              • Accepted: 1 January 2012
              • Revised: 1 November 2011
              • Received: 1 June 2011
              Published in tomm Volume 9, Issue 1

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article
              • Research
              • Refereed

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader