Camera orientation, calibration and inverse perspective with uncertainties: A Bayesian method applied to area estimation from diverse photographs

https://doi.org/10.1016/j.isprsjprs.2019.11.013Get rights and content

Abstract

Large collections of images have become readily available through modern digital catalogs, from sources as diverse as historical photographs, aerial surveys, or user-contributed pictures. Exploiting the quantitative information present in such wide-ranging collections can greatly benefit studies that follow the evolution of landscape features over decades, such as measuring areas of glaciers to study their shrinking under climate change. However, many available images were taken with low-quality lenses and unknown camera parameters. Useful quantitative data may still be extracted, but it becomes important to both account for imperfect optics, and estimate the uncertainty of the derived quantities. In this paper, we present a method to address both these goals, and apply it to the estimation of the area of a landscape feature traced as a polygon on the image of interest. The technique is based on a Bayesian formulation of the camera calibration problem. First, the probability density function (PDF) of the unknown camera parameters is determined for the image, based on matches between 2D (image) and 3D (world) points together with any available prior information. In a second step, the posterior distribution of the feature area of interest is derived from the PDF of camera parameters. In this step, we also model systematic errors arising in the polygon tracing process, as well as uncertainties in the digital elevation model. The resulting area PDF therefore accounts for most sources of uncertainty. We present validation experiments, and show that the model produces accurate and consistent results. We also demonstrate that in some cases, accounting for optical lens distortions is crucial for accurate area determination with consumer-grade lenses. The technique can be applied to many other types of quantitative features to be extracted from photographs when careful error estimation is important.

Introduction

A large amount of quantitative physical landscape information can be extracted from terrestrial, aerial and satellite imagery using various photogrammetric techniques (Streilein, 1994, Gruen and Li, 1995, Haala and Brenner, 1999, Küng et al., 2012, Feurer and Vinatier, 2018). Inverse perspective methods (e.g. monoplotting), as reviewed by Criminisi, 2001, Förstner and Wrobel, 2016, aim at extracting referenced spatial data from a single picture. Such methods have been used to extract data from either aerial, satellite or terrestrial imagery (Jordan et al., 2005, Bozzini et al., 2012, Murtiyoso et al., 2014, Produit et al., 2016). Inverse perspective methods are particularly used in the study of Earth surface processes and landscape evolution to produce or update geological and geomorphological map data (Warner et al., 1993, Jauregui et al., 2002, Micheletti et al., 2015, Scapozza et al., 2016) or, among others, in civil engineering and building stability assessment (Murtiyoso et al., 2014). The methods have also found a particular echo in the community of cryospheric sciences, as they allow to reconstruct and monitor the evolution of glaciers over different time scales, ranging from centennial to annual fluctuations (Wiesmann et al., 2012, Piermattei et al., 2015, Čekada et al., 2016), as well as further the understanding of glacier mass balance processes (Chapuis et al., 2010).

Inverse perspective methods also allow to tap into a wealth of quantitative information present in large and diverse databases of images readily accessible from the Internet, such as historical records, aerial surveys, or user-contributed pictures. However, these images are of uneven quality: many were taken without scientific intent, often with low-quality lenses and unknown camera parameters, and are sometimes only available in low resolution. These limitations can introduce significant uncertainties and biases in the information obtained from camera orientation and calibration. Useful quantitative data may still be extracted, but accounting for potential lens distortions and quantifying the uncertainty of the results become important.

In this paper, we present a method to address both goals, applying it to the estimation of the area of landscape features, with the determination of the areas of mountain glaciers in mind. The present technique is a two-step process based on Bayesian inference.

First, the unknown camera parameters, including lens optical distortions, are estimated using a Bayesian formulation of the camera orientation with calibration problem, in the form of a spatial resection problem: the posterior probability density function (PDF) of camera parameters is obtained from matches between 2D points in the image and 3D points in world coordinates, together with any available prior information. Bayesian approaches to camera calibration were presented by several authors, either based on finding a single value of the parameters which maximize a posterior distribution (e.g. Valkenburg, 1998, Zhang, 2000) or using the whole resulting posterior distribution more extensively (e.g. Sundareswara and Schrater, 2005). In our work, we keep the full statistical information contained in the posterior distribution of camera parameters, by generating samples distributed according to the posterior distribution using Markov chain Monte Carlo (MCMC) sampling.

In a second step, the posterior PDF of the feature area is derived from the posterior of camera parameters by solving an inverse perspective problem. The outline of the feature of interest is manually traced on the photograph as a polygon, which is then back projected from the 2D image onto the 3D world using a digital elevation model (DEM). This back projection step accounts for uncertainties on the camera parameters, and attempts to model possible systematic errors introduced by the polygon tracing step, together with uncertainties introduced by the DEM. In particular, we propose a model for DEM errors for which both the root mean square error (RMSE) and spatial autocorrelation scale are locally varying. The resulting PDF of the back projected 3D feature area therefore contains information about most of the uncertainties of the process.

More generally, we attempt to unify the camera orientation with calibration, uncertainty modeling and inverse perspective problems into a statistically consistent framework which can be extended to similar classes of problems and uncertainty models.

We stress that given the diverse nature and sources of our target images and the fact that many were taken using low-quality or unknown equipment, the focus of this paper is more on uncertainty estimation than on very accurate photogrammetric techniques. The reconstructed camera location, for example, cannot be expected to be more accurate than a few meters, given that we will be working with low resolution landscape images, and that our 3D ground control point coordinates will not come from precision geodetic sources.

We start by describing our Bayesian formulation of the camera orientation with calibration problem in Section 2, and our polygon back projection method in Section 3. Section 4 describes details of our implementation, including and the posterior probability density sampling process. In Section 5, we present test problems for validation, before discussing the results in Section 6 and concluding in Section 7.

Section snippets

Camera orientation and calibration

Extracting metric measurements from digital images requires estimating the parameters of the imaging camera used to take the picture, such as its position and orientation, focal length, and possibly other optical properties. For typical orientation problems, this may be done by matching points with known 3D world coordinates with their corresponding 2D projections in the image under study. Camera orientation and calibration then consists in finding the camera parameters that best reproduce the

Inverse perspective and uncertainties

We now discuss the inverse perspective step, in the form of the back projection, which is needed to reconstruct the area of a landscape feature from its outline in the image. In Section 3.1, we first describe the back projection process through which we obtain the area S from the camera calibration results. To account for uncertainties in both the polygon tracing process as well as DEM elevations, we derive in 3.2 the posterior distribution of S, assuming imperfect knowledge of the polygon and

Implementation and MCMC sampling

The first step towards implementation is evaluating the Bayesian posterior on camera parameters θ. We have by now specified the full prior (Eq. (11) with terms discussed in Section 2.4), and likelihood using the camera model and Eqs. (17), (19). We can therefore evaluate the posterior probability density p(θ|D,I) using (2) for any value of the parameter θ.

Validation

In this section, we setup validation case studies to demonstrate the technique presented in this paper. We use a combination of photographs, both aerial and terrestrial oblique of different origins and quality, to assess key aspects of our method.

First, we consider the problem of camera orientation with calibration, by fitting images for which the camera location is known approximately, and comparing the obtained posterior on the camera position to the available photograph information.

Based on

Discussion

In Section 5.2 we compared the results of the camera calibration procedure to known values of camera parameters. We found that the errors of camera parameters reconstruction for terrestrial oblique pictures are of the same order of magnitude (a few meters) than the basic sources of uncertainty in the method such as the 3D position of the ground control points. This indicates an accurate reconstruction of the camera position by our method, to the intrinsic level of accuracy allowed by the data

Conclusion

In this paper, we presented a novel method for estimating surface area information from landscape features using single aerial and terrestrial photographs. Driven by the goal of characterizing uncertainties on the solutions of inverse perspective problems for archival or non-scientific photographs, we introduced models for errors in input data, as well as for characterizing uncertainties in digital elevation models. We integrated these ingredients into a statistically consistent Bayesian

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The authors would like to thank the anonymous reviewers for their feedback and comments, which have contributed to significantly improving the manuscript. We also would like to thank the French Institut Gégographique National (IGN) for helpful exchanges, and for providing us with valuable information on the aerial missions. Some map data is copyrighted by OpenStreetMap contributors and available from https://www.openstreetmap.org. Funding: This study is part of the ANR 14-CE03-0006 VIP Mont

References (70)

  • B.H. Carlisle

    Modelling the spatial distribution of DEM error

    Trans. GIS

    (2005)
  • M.T. Čekada et al.

    Monitoring Glacier Changes with the Use of Archive Images: The Example of the Julian Alps (NW Slovenia, NE Italy)

  • A. Chapuis et al.

    Interpretation of amplitude data from a ground-based radar in combination with terrestrial photogrammetry and visual observations for calving monitoring of Kronebreen, Svalbard

    Annals Glaciol.

    (2010)
  • CIPA Standardization Committee, Guideline for Noting Digital Camera Specifications in Catalogs, Revised Version (Oct....
  • A. Criminisi

    Accurate Visual Metrology from Single and Multiple Uncalibrated Images, Distinguished Dissertations

    (2001)
  • European GNSS Agency, EGNOS Open Service (OS) Service Definition Document (Oct....
  • Fisher, P, 1991. First Experiments in Viewshed Uncertainty: The Accuracy of the Viewshed Area, Photogrammetric...
  • P. Fisher

    Improved modeling of elevation error with geostatistics

    GeoInformatica

    (1998)
  • T.C.O. Fonseca et al.

    Objective Bayesian analysis for the Student-t regression model

    Biometrika

    (2008)
  • D. Foreman-Mackey et al.

    Emcee: the MCMC Hammer

    Publ. Astron. Soc. Pac.

    (2013)
  • Förstner, W., Wrobel, B.P., 2016. Photogrammetric Computer Vision, Vol. 11 of Geometry and Computing, Springer...
  • A. Gelman et al.

    Posterior predictive assessment of model fitness via realized discrepancies

    Stat. Sin.

    (1996)
  • A. Gelman et al.

    Bayesian Data Analysis

    (2013)
  • S.K. Ghosh

    Fundamentals of Computational Photogrammetry

    (2005)
  • J. Goodman et al.

    Ensemble samplers with affine invariance

    Commun. Appl. Math. Comput. Sci.

    (2010)
  • A. Gruen

    Adaptive least squares correlation: a powerful image matching technique

    South African J. Photogram., Remote Sens., Cartogr.

    (1985)
  • W. Haneberg

    Effects of digital elevation model errors on spatially distributed seismic slope stability calculations: an example from Seattle, Washington

    Environ. Eng. Geosci.

    (2006)
  • Heikkila, J., Silven, O., 1997. A four-step camera calibration procedure with implicit image correction. In:...
  • C. Heipke

    A global approach for least-squares image matching and surface reconstruction in object space

    Photogram. Eng.

    (1992)
  • Hobbie, D., 2010. The development of photogrammetric instruments and methods at Carl Zeiss in Oberkochen,...
  • Hogg, D.W., Foreman-Mackey, D., 2017. Data analysis recipes: Using Markov Chain Monte Carlo, arXiv:1710.06068...
  • Hogg, D.W., Bovy, J., Lang, D., 2010. Data analysis recipes: Fitting a model to data, arXiv:1008.4686 [astro-ph,...
  • G.J. Hunter et al.

    Modeling the uncertainty of slope and aspect estimates derived from spatial databases

    Geogr. Anal.

    (1997)
  • I.G. National, Remonter le temps, https://remonterletemps.ign.fr/, Jul....
  • E. Jordan et al.

    Estimation by photogrammetry of the glacier recession on the Cotopaxi Volcano (Ecuador) between 1956 and 1997

    Hydrol. Sci. J.

    (2005)
  • Cited by (7)

    • Automated point cloud classification using an image-based instance segmentation for structure from motion

      2021, Automation in Construction
      Citation Excerpt :

      We decided to use deep learning because of its outstanding performance over image-related problems [46]. Meanwhile, images from actual locations were selected as an input for PCIS to raise the number of training data because there are currently more public datasets available in the academic society [47], and also numerous images are taken as a part of daily work basis [19,20] due to the common utilization of digital cameras and other capturing tools [48]. Furthermore, other than the point cloud, the parameters such as the focal length, the rotation, and the translation of the capturing tool can also be achieved.

    View all citing articles on Scopus
    View full text