Camera orientation, calibration and inverse perspective with uncertainties: A Bayesian method applied to area estimation from diverse photographs

doi:10.1016/j.isprsjprs.2019.11.013

ISPRS Journal of Photogrammetry and Remote Sensing

Volume 159, January 2020, Pages 237-255

https://doi.org/10.1016/j.isprsjprs.2019.11.013 Get rights and content

Abstract

Large collections of images have become readily available through modern digital catalogs, from sources as diverse as historical photographs, aerial surveys, or user-contributed pictures. Exploiting the quantitative information present in such wide-ranging collections can greatly benefit studies that follow the evolution of landscape features over decades, such as measuring areas of glaciers to study their shrinking under climate change. However, many available images were taken with low-quality lenses and unknown camera parameters. Useful quantitative data may still be extracted, but it becomes important to both account for imperfect optics, and estimate the uncertainty of the derived quantities. In this paper, we present a method to address both these goals, and apply it to the estimation of the area of a landscape feature traced as a polygon on the image of interest. The technique is based on a Bayesian formulation of the camera calibration problem. First, the probability density function (PDF) of the unknown camera parameters is determined for the image, based on matches between 2D (image) and 3D (world) points together with any available prior information. In a second step, the posterior distribution of the feature area of interest is derived from the PDF of camera parameters. In this step, we also model systematic errors arising in the polygon tracing process, as well as uncertainties in the digital elevation model. The resulting area PDF therefore accounts for most sources of uncertainty. We present validation experiments, and show that the model produces accurate and consistent results. We also demonstrate that in some cases, accounting for optical lens distortions is crucial for accurate area determination with consumer-grade lenses. The technique can be applied to many other types of quantitative features to be extracted from photographs when careful error estimation is important.

Introduction

A large amount of quantitative physical landscape information can be extracted from terrestrial, aerial and satellite imagery using various photogrammetric techniques (Streilein, 1994, Gruen and Li, 1995, Haala and Brenner, 1999, Küng et al., 2012, Feurer and Vinatier, 2018). Inverse perspective methods (e.g. monoplotting), as reviewed by Criminisi, 2001, Förstner and Wrobel, 2016, aim at extracting referenced spatial data from a single picture. Such methods have been used to extract data from either aerial, satellite or terrestrial imagery (Jordan et al., 2005, Bozzini et al., 2012, Murtiyoso et al., 2014, Produit et al., 2016). Inverse perspective methods are particularly used in the study of Earth surface processes and landscape evolution to produce or update geological and geomorphological map data (Warner et al., 1993, Jauregui et al., 2002, Micheletti et al., 2015, Scapozza et al., 2016) or, among others, in civil engineering and building stability assessment (Murtiyoso et al., 2014). The methods have also found a particular echo in the community of cryospheric sciences, as they allow to reconstruct and monitor the evolution of glaciers over different time scales, ranging from centennial to annual fluctuations (Wiesmann et al., 2012, Piermattei et al., 2015, Čekada et al., 2016), as well as further the understanding of glacier mass balance processes (Chapuis et al., 2010).

Inverse perspective methods also allow to tap into a wealth of quantitative information present in large and diverse databases of images readily accessible from the Internet, such as historical records, aerial surveys, or user-contributed pictures. However, these images are of uneven quality: many were taken without scientific intent, often with low-quality lenses and unknown camera parameters, and are sometimes only available in low resolution. These limitations can introduce significant uncertainties and biases in the information obtained from camera orientation and calibration. Useful quantitative data may still be extracted, but accounting for potential lens distortions and quantifying the uncertainty of the results become important.

In this paper, we present a method to address both goals, applying it to the estimation of the area of landscape features, with the determination of the areas of mountain glaciers in mind. The present technique is a two-step process based on Bayesian inference.

First, the unknown camera parameters, including lens optical distortions, are estimated using a Bayesian formulation of the camera orientation with calibration problem, in the form of a spatial resection problem: the posterior probability density function (PDF) of camera parameters is obtained from matches between 2D points in the image and 3D points in world coordinates, together with any available prior information. Bayesian approaches to camera calibration were presented by several authors, either based on finding a single value of the parameters which maximize a posterior distribution (e.g. Valkenburg, 1998, Zhang, 2000) or using the whole resulting posterior distribution more extensively (e.g. Sundareswara and Schrater, 2005). In our work, we keep the full statistical information contained in the posterior distribution of camera parameters, by generating samples distributed according to the posterior distribution using Markov chain Monte Carlo (MCMC) sampling.

In a second step, the posterior PDF of the feature area is derived from the posterior of camera parameters by solving an inverse perspective problem. The outline of the feature of interest is manually traced on the photograph as a polygon, which is then back projected from the 2D image onto the 3D world using a digital elevation model (DEM). This back projection step accounts for uncertainties on the camera parameters, and attempts to model possible systematic errors introduced by the polygon tracing step, together with uncertainties introduced by the DEM. In particular, we propose a model for DEM errors for which both the root mean square error (RMSE) and spatial autocorrelation scale are locally varying. The resulting PDF of the back projected 3D feature area therefore contains information about most of the uncertainties of the process.

More generally, we attempt to unify the camera orientation with calibration, uncertainty modeling and inverse perspective problems into a statistically consistent framework which can be extended to similar classes of problems and uncertainty models.

We stress that given the diverse nature and sources of our target images and the fact that many were taken using low-quality or unknown equipment, the focus of this paper is more on uncertainty estimation than on very accurate photogrammetric techniques. The reconstructed camera location, for example, cannot be expected to be more accurate than a few meters, given that we will be working with low resolution landscape images, and that our 3D ground control point coordinates will not come from precision geodetic sources.

We start by describing our Bayesian formulation of the camera orientation with calibration problem in Section 2, and our polygon back projection method in Section 3. Section 4 describes details of our implementation, including and the posterior probability density sampling process. In Section 5, we present test problems for validation, before discussing the results in Section 6 and concluding in Section 7.

Section snippets

Camera orientation and calibration

Extracting metric measurements from digital images requires estimating the parameters of the imaging camera used to take the picture, such as its position and orientation, focal length, and possibly other optical properties. For typical orientation problems, this may be done by matching points with known 3D world coordinates with their corresponding 2D projections in the image under study. Camera orientation and calibration then consists in finding the camera parameters that best reproduce the

Inverse perspective and uncertainties

We now discuss the inverse perspective step, in the form of the back projection, which is needed to reconstruct the area of a landscape feature from its outline in the image. In Section 3.1, we first describe the back projection process through which we obtain the area S from the camera calibration results. To account for uncertainties in both the polygon tracing process as well as DEM elevations, we derive in 3.2 the posterior distribution of S, assuming imperfect knowledge of the polygon and

Implementation and MCMC sampling

The first step towards implementation is evaluating the Bayesian posterior on camera parameters $θ$ . We have by now specified the full prior (Eq. (11) with terms discussed in Section 2.4), and likelihood using the camera model and Eqs. (17), (19). We can therefore evaluate the posterior probability density $p (θ | D, I)$ using (2) for any value of the parameter $θ$ .

Validation

In this section, we setup validation case studies to demonstrate the technique presented in this paper. We use a combination of photographs, both aerial and terrestrial oblique of different origins and quality, to assess key aspects of our method.

First, we consider the problem of camera orientation with calibration, by fitting images for which the camera location is known approximately, and comparing the obtained posterior on the camera position to the available photograph information.

Based on

Discussion

In Section 5.2 we compared the results of the camera calibration procedure to known values of camera parameters. We found that the errors of camera parameters reconstruction for terrestrial oblique pictures are of the same order of magnitude (a few meters) than the basic sources of uncertainty in the method such as the 3D position of the ground control points. This indicates an accurate reconstruction of the camera position by our method, to the intrinsic level of accuracy allowed by the data

Conclusion

In this paper, we presented a novel method for estimating surface area information from landscape features using single aerial and terrestrial photographs. Driven by the goal of characterizing uncertainties on the solutions of inverse perspective problems for archival or non-scientific photographs, we introduced models for errors in input data, as well as for characterizing uncertainties in digital elevation models. We integrated these ingredients into a statistically consistent Bayesian

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The authors would like to thank the anonymous reviewers for their feedback and comments, which have contributed to significantly improving the manuscript. We also would like to thank the French Institut Gégographique National (IGN) for helpful exchanges, and for providing us with valuable information on the aerial missions. Some map data is copyrighted by OpenStreetMap contributors and available from https://www.openstreetmap.org. Funding: This study is part of the ANR 14-CE03-0006 VIP Mont

References (70)

D. Feurer et al.
Joining multi-epoch archival aerial images in a single SfM block allows 3-D change detection with almost exclusively image information
ISPRS J. Photogram. Remote Sens.
(2018)
A. Gruen et al.
Road extraction from aerial and satellite images by dynamic programming
ISPRS J. Photogram. Remote Sens.
(1995)
N. Haala et al.
Extraction of buildings and trees in urban environments
ISPRS J. Photogram. Remote Sens.
(1999)
K.W. Holmes et al.
Error in a USGS 30-meter digital elevation model and its impact on terrain modeling
J. Hydrol.
(2000)
M. Jauregui et al.
A procedure for map updating using digital mono-plotting
Comput. Geosci.
(2002)
C.A. Stockdale et al.
Extracting ecological information from oblique angle terrestrial landscape photographs: performance evaluation of the WSL Monoplotting Tool
Appl. Geogr.
(2015)
A. Streilein
Towards automation in architectural photogrammetry: CAD-based 3D-feature extraction
ISPRS J. Photogram. Remote Sens.
(1994)
J. Wang et al.
A new calibration model of camera lens distortion
Pattern Recogn.
(2008)
C. Bozzini et al.
A new monoplotting tool to extract georeferenced vector data and orthorectified raster data from oblique non-metric photographs
Int. J. Heritage Digital Era
(2012)
D.C. Brown
Close-range camera calibration
Photogram. Eng.
(1971)

B.H. Carlisle

Modelling the spatial distribution of DEM error

Trans. GIS

(2005)

M.T. Čekada et al.

Monitoring Glacier Changes with the Use of Archive Images: The Example of the Julian Alps (NW Slovenia, NE Italy)

A. Chapuis et al.

Interpretation of amplitude data from a ground-based radar in combination with terrestrial photogrammetry and visual observations for calving monitoring of Kronebreen, Svalbard

Annals Glaciol.

(2010)

CIPA Standardization Committee, Guideline for Noting Digital Camera Specifications in Catalogs, Revised Version (Oct....

A. Criminisi

Accurate Visual Metrology from Single and Multiple Uncalibrated Images, Distinguished Dissertations

(2001)

European GNSS Agency, EGNOS Open Service (OS) Service Definition Document (Oct....

Fisher, P, 1991. First Experiments in Viewshed Uncertainty: The Accuracy of the Viewshed Area, Photogrammetric...

P. Fisher

Improved modeling of elevation error with geostatistics

GeoInformatica

(1998)

T.C.O. Fonseca et al.

Objective Bayesian analysis for the Student-t regression model

Biometrika

(2008)

D. Foreman-Mackey et al.

Emcee: the MCMC Hammer

Publ. Astron. Soc. Pac.

(2013)

Förstner, W., Wrobel, B.P., 2016. Photogrammetric Computer Vision, Vol. 11 of Geometry and Computing, Springer...

A. Gelman et al.

Posterior predictive assessment of model fitness via realized discrepancies

Stat. Sin.

(1996)

A. Gelman et al.

Bayesian Data Analysis

(2013)

S.K. Ghosh

Fundamentals of Computational Photogrammetry

(2005)

J. Goodman et al.

Ensemble samplers with affine invariance

Commun. Appl. Math. Comput. Sci.

(2010)

A. Gruen

Adaptive least squares correlation: a powerful image matching technique

South African J. Photogram., Remote Sens., Cartogr.

(1985)

W. Haneberg

Effects of digital elevation model errors on spatially distributed seismic slope stability calculations: an example from Seattle, Washington

Environ. Eng. Geosci.

(2006)

Heikkila, J., Silven, O., 1997. A four-step camera calibration procedure with implicit image correction. In:...

C. Heipke

A global approach for least-squares image matching and surface reconstruction in object space

Photogram. Eng.

(1992)

Hobbie, D., 2010. The development of photogrammetric instruments and methods at Carl Zeiss in Oberkochen,...

Hogg, D.W., Foreman-Mackey, D., 2017. Data analysis recipes: Using Markov Chain Monte Carlo, arXiv:1710.06068...

Hogg, D.W., Bovy, J., Lang, D., 2010. Data analysis recipes: Fitting a model to data, arXiv:1008.4686 [astro-ph,...

G.J. Hunter et al.

Modeling the uncertainty of slope and aspect estimates derived from spatial databases

Geogr. Anal.

(1997)

I.G. National, Remonter le temps, https://remonterletemps.ign.fr/, Jul....

E. Jordan et al.

Estimation by photogrammetry of the glacier recession on the Cotopaxi Volcano (Ecuador) between 1956 and 1997

Hydrol. Sci. J.

(2005)

Cited by (7)

High-precision visual imaging model and calibration method for multi-depth-of-field targets
2022, Optik
In the field of high-precision large-depth 3D reconstruction, some bottlenecks exist in the improvement of visual measurement accuracy. This is because traditional vision measurement methods simplify the imaging model of a fixed-focus camera to a pinhole imaging model with fixed parameters accompanied by various distortion information. In fact, according to the optical properties of camera imaging, the position of the optical center in the pinhole imaging model varies with the depth of the imaging target. In this paper, in order to achieve the objective of improving visual measurement accuracy, a high-precision visual imaging model for multi-depth-of-field targets and the corresponding camera parameters calibration method are proposed based on the optical properties of camera imaging. When using this imaging model for 3D measurements, different pixel points in the image use different camera internal parameters depending on the depth of the imaging target, thus enabling highly accurate 3D reconstruction. The experimental results show that the imaging model and calibration method proposed in this paper greatly improve the accuracy and robustness of visual 3D measurement and have a promising future in the field of large-depth 3D reconstruction.
Automated point cloud classification using an image-based instance segmentation for structure from motion
2021, Automation in Construction
Citation Excerpt :
We decided to use deep learning because of its outstanding performance over image-related problems [46]. Meanwhile, images from actual locations were selected as an input for PCIS to raise the number of training data because there are currently more public datasets available in the academic society [47], and also numerous images are taken as a part of daily work basis [19,20] due to the common utilization of digital cameras and other capturing tools [48]. Furthermore, other than the point cloud, the parameters such as the focal length, the rotation, and the translation of the capturing tool can also be achieved.
Point cloud constantly gains popularity as a visualization tool in numerous fields including civil infrastructure scope. However, automatic point cloud classification for civil infrastructures such as piers is challenging due to untidy scenes, gigantic sizes, and image feature-rich objects that can generate many cloud points. Moreover, the lack of training point clouds and unrealistic synthetic data preventing deep learning to fully support the three-dimensional point cloud classification. This paper proposes Point cloud Classification based on image-based Instance Segmentation (PCIS), an automated point cloud classification based on two-dimensional digital images from a daily work basis. These images are processed into the pre-trained network in PCIS to generate mask images, which are later used to create three-dimensional masks based on the projection from the solved camera parameters. The cloud points located inside these masks are classified as the cloud point of interest. The experiment result showed that PCIS correctly classified the point cloud and achieved up to 0.96 F1-score from a one-class classification sample and 0.83 F1-score from a six-class classification sample in our validation process. Our study has proved that normal digital images can also be used to train deep learning to classify the three-dimensional point cloud.
Ice aprons on steep high-alpine slopes: Insights from the Mont-Blanc massif, Western Alps
2023, Journal of Glaciology
Variations in surface area of six ice aprons in the Mont-Blanc massif since the Little Ice Age
2020, Journal of Glaciology
Bayesian estimation of glacier surface elevation changes from DEMs
2023, Frontiers in Earth Science
Ice Aprons in the Mont Blanc Massif (Western European Alps): Topographic Characteristics and Relations with Glaciers and Other Types of Perennial Surface Ice Features
2022, Remote Sensing

View all citing articles on Scopus

View full text

Camera orientation, calibration and inverse perspective with uncertainties: A Bayesian method applied to area estimation from diverse photographs

Abstract

Introduction

Section snippets

Camera orientation and calibration

Inverse perspective and uncertainties

Implementation and MCMC sampling

Validation

Discussion

Conclusion

Declaration of Competing Interest

Acknowledgments

ISPRS J. Photogram. Remote Sens.

ISPRS J. Photogram. Remote Sens.

ISPRS J. Photogram. Remote Sens.

J. Hydrol.

Comput. Geosci.

Appl. Geogr.

ISPRS J. Photogram. Remote Sens.

Pattern Recogn.

A new monoplotting tool to extract georeferenced vector data and orthorectified raster data from oblique non-metric photographs

Int. J. Heritage Digital Era

Close-range camera calibration

Photogram. Eng.

Modelling the spatial distribution of DEM error

Trans. GIS

Monitoring Glacier Changes with the Use of Archive Images: The Example of the Julian Alps (NW Slovenia, NE Italy)

Interpretation of amplitude data from a ground-based radar in combination with terrestrial photogrammetry and visual observations for calving monitoring of Kronebreen, Svalbard

Annals Glaciol.

Accurate Visual Metrology from Single and Multiple Uncalibrated Images, Distinguished Dissertations

Improved modeling of elevation error with geostatistics

GeoInformatica

Objective Bayesian analysis for the Student-t regression model

Biometrika

Emcee: the MCMC Hammer

Publ. Astron. Soc. Pac.

Posterior predictive assessment of model fitness via realized discrepancies

Stat. Sin.

Bayesian Data Analysis

Fundamentals of Computational Photogrammetry

Ensemble samplers with affine invariance

Commun. Appl. Math. Comput. Sci.

Adaptive least squares correlation: a powerful image matching technique

South African J. Photogram., Remote Sens., Cartogr.

Effects of digital elevation model errors on spatially distributed seismic slope stability calculations: an example from Seattle, Washington

Environ. Eng. Geosci.

A global approach for least-squares image matching and surface reconstruction in object space

Photogram. Eng.

Modeling the uncertainty of slope and aspect estimates derived from spatial databases

Geogr. Anal.

Estimation by photogrammetry of the glacier recession on the Cotopaxi Volcano (Ecuador) between 1956 and 1997

Hydrol. Sci. J.