Abstract
Due to geotagging capabilities of consumer cameras, it has become easy to capture the exact geometric location where a picture is taken. However, the location is not the whereabouts of the scene taken by the photographer but the whereabouts of the photographer himself. To determine the actual location of an object seen in a photo some sophisticated and tiresome steps are required on a special camera rig, which are generally not available in common digital cameras. This article proposes a novel method to determine the geometric location corresponding to a specific image pixel. A new technique of stereo triangulation is introduced to compute the relative depth of a pixel position. Geographical metadata embedded in images are utilized to convert relative depths to absolute coordinates. When a geographic database is available we can also infer the semantically meaningful description of a scene object from where the specified pixel is projected onto the photo. Experimental results demonstrate the effectiveness of the proposed approach in accurately identifying actual locations.
- Baker, S. and Matthews, I. 2004. Lucas-Kanade 20 years on: A unifying framework. Int. J. Comput. Vis. 56, 3, 221--255. Google ScholarDigital Library
- Chang, C. and Chatterjee, C. 1992. Quantization error analysis in stereo vision. In Proceedings of the 26th Asilomar Conference on Signals, Systems and Computers. 1037--1041.Google Scholar
- Ebling, M. R. and Cáceres, R. 2010. Gaming and augmented reality come to location-based services. IEEE Pervas. Comput. 9, 5--6. Google ScholarDigital Library
- Gluckman, J. and Nayar, S. K. 2001. Rectifying transformations that minimize resampling effects. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Vol. 1. 111--117.Google Scholar
- Goodchild, M. F. 2000. New horizons for the social sciences: geographic information systems. In Social Sciences for a Digital World: Building Infrastructure and Databases for the Future, 163--172.Google Scholar
- Guan, W., You, S., and Neumann, U. 2011. Gps-aided recognition-based user tracking system with augmented reality in extreme large-scale areas. In Proceedings of the 2nd Annual ACM Conference on Multimedia Systems. 1--10. Google ScholarDigital Library
- Hadjitheophanous, S., Ttofis, C., Georghiades, A. S., and Theocharides, T. 2010. Towards hardware stereoscopic 3D reconstruction: a real-time FPGA computation of the disparity map. In Proceedings of the Conference on Design, Automation and Test in Europe. 1743--1748. Google ScholarDigital Library
- Haklay, M., Singleton, A., and Parker, C. 2008. Web Mapping 2.0: The neogeography of the GeoWeb. Geography Compass 2, 6, 2011--2039.Google ScholarCross Ref
- Hartley, R. I. and Sturm, P. 1997. Triangulation. Comput. Vis. Image Understand. 68, 2, 146--157. Google ScholarDigital Library
- Hoashi, K., Uemukai, T., Matsumoto, K., and Takishima, Y. 2009. Constructing a landmark identification system for geo-tagged photographs based on web data analysis. In Proceedings of the IEEE International Conference on Multimedia and Expo. 606--609. Google ScholarDigital Library
- Hudson-Smith, A., Crooks, A., Gibin, M., Milton, R., and Batty, M. 2009. NeoGeography and web 2.0: Concepts, tools and applications. J. Location Based Serv. 3, 2, 118--145. Google ScholarDigital Library
- IPTC. 2010. IPTC photo metadata: Core 1.1/Extension 1.1. Tech. rep., International Press Telecommunications Council. July.Google Scholar
- Jain, R. and Sinha, P. 2010. Content without context is meaningless. In Proceedings of the International Conference on Multimedia (MM'10). 1259--1268. Google ScholarDigital Library
- Jawed, K., Morris, J., Khan, T., and Gimelfarby, G. 2009. Real time rectification for stereo correspondence. In Proceedings of the International Conference on Computational Science and Engineering. Vol. 2. 277--284. Google ScholarDigital Library
- JEITA. 2002. Exchangeable image file format for digital still cameras: Exif version 2.2. Tech. rep. JEITA CP-3451, Japan Electronics and Information Technology Industries Association.Google Scholar
- Johnson, L., Levine, A., and Smith, R. 2009. Geo-Everything. In The Horizon Report, 15--18.Google Scholar
- Kalantidis, Y., Tolias, G., Avrithis, Y., Phinikettos, M., Spyrou, E., Mylonas, P., and Kollias, S. 2011. Viral: Visual image retrieval and localization. Multimedia Tools Appl. 51, 2, 555--592. Google ScholarDigital Library
- Kanatani, K., Sugaya, Y., and Niitsuma, H. 2008. Triangulation from two views revisited: Hartley-Sturm vs. optimal correction. In Proceedings of the 19th British Machine Vision Conference (BMVC '08). 173--182.Google Scholar
- Loop, C. and Zhang, Z. 1999. Computing rectifying homographies for stereo vision. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Vol. 1. 125--131.Google Scholar
- MWG. 2009. Guidelines for handling image metadata 1.0.1. Tech. rep., Metadata Working Group.Google Scholar
- NIMA. 1997. Department of defense world geodetic system 1984, its definition and relationships with local geodetic systems. Tech. rep. TR8350.2, National Imagery and Mapping Agency.Google Scholar
- Pollefeys, M. and Sinha, S. N. 2004. Iso-Disparity surfaces for general stereo configurations. In Proceedings of the European Conference on Computer Vision. Vol. 3. 509--520.Google Scholar
- Ramm, F., Topf, J., and Chilton, S. 2010. OpenStreetMap: Using and Enhancing the Free Map of the World. UIT Cambridge.Google Scholar
- Reisch, R. and Parulski, K. A. 2009. Digital camera image storage formats. In Single-Sensor Imaging: Methods and Applications for Digital Cameras, R. Lukac, Ed., CRC Press.Google Scholar
- Singh, V. K., Gao, M., and Jain, R. 2010. Social pixels: genesis and evaluation. In Proceedings of the International Conference on Multimedia (MM'10). 481--490. Google ScholarDigital Library
- Sinnott, R. W. 1984. Virtues of the haversine. Sky and Telescope 68, 2, 159.Google Scholar
- Staudinger, E., Humenberger, M., and Kubinger, W. 2008. FPGA-based rectification and lens undistortion for a real-time embedded stereo vision sensor. In Proceedings of FH Science Day. 18--25.Google Scholar
- Tomasi, C. and Kanade, T. 1991. Detection and tracking of point features. Tech. rep. CMU-CS-91-132, Carnegie Mellon University.Google Scholar
- Tsai, R. Y. 1987. A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses. IEEE J. Robot. Autom. 3, 4, 323--344.Google ScholarCross Ref
- Vajda, P., Ivanov, I., Lee, J.-S., Goldmann, L., and Ebrahimi, T. 2010. Propagation of geotags based on object duplicate detection. In Proceedings of SPIE.Vol. 7798, 27.Google Scholar
- van der Mark, W. and Gavrila, D. M. 2006. Real-Time dense stereo for intelligent vehicles. IEEE Trans. Intell. Transport. Syst. 7, 1, 38--50. Google ScholarDigital Library
- Viana, W., Hammiche, S., Villanova-Oliver, M., Gensel, J., and Martin, H. 2008. Photo context as a bag of words. In Proceedings of the IEEE International Symposium on Multimedia (ISM '08). 310--315. Google ScholarDigital Library
- Wang, J. and Liu, Y. 2007. A closed-form solution of reconstruction from nonparallel stereo geometry used in image guided system for surgery. In Proceedings of the International Conference on Multimedia Content Analysis and Mining. 371--380. Google ScholarDigital Library
- Weih, R., Gilbert, M., Cross, J., and Freeman, D. 2009. Accuracy assessment of recreational and mapping grade GPS receivers. J. Arkansas Acad. Scie. 63, 163--168.Google Scholar
- Yaegashi, K. and Yanai, K. 2009. Can geotags help image recognition? In Proceedings of the 3rd Pacific Rim Symposium on Advances in Image and Video Technology (PSIVT '09). 361--373. Google ScholarDigital Library
- Yuan, J., Luo, J., and Wu, Y. 2010. Mining compositional features from gps and visual cues for event recognition in photo collections. IEEE Trans. Multimedia 12, 7, 705--716. Google ScholarDigital Library
- Zhang, Z. 2000. A flexible new technique for camera calibration. IEEE Trans. Pattern Anal. Mach. Intell. 22, 11, 1330--1334. Google ScholarDigital Library
- Zitova, B. and Flusser, J. 2003. Image registration methods: A survey. Image Vis. Comput. 21, 11, 977--1000.Google ScholarCross Ref
Index Terms
- Identification of scene locations from geotagged images
Recommendations
Catadioptric Stereo Using Planar Mirrors
By using mirror reflections of a scene, stereo images can be captured with a single camera (catadioptric stereo). In addition to simplifying data acquisition single camera stereo provides both geometric and radiometric advantages over traditional two ...
Robust camera pose and scene structure analysis for service robotics
Successful path planning and object manipulation in service robotics applications rely both on a good estimation of the robot's position and orientation (pose) in the environment, as well as on a reliable understanding of the visualized scene. In this ...
An analysis of the relation between visual concepts and geo-locations using geotagged images on the web
ICME'09: Proceedings of the 2009 IEEE international conference on Multimedia and ExpoRecently, a large number of geotagged images are available on photo sharing Web sites such as Flickr. In this paper, we propose image region entropy and geo-location entropy for analyzing the relation between visual concepts and geographical locations ...
Comments