Properties of the bridge sampler with a focus on splitting the MCMC sample

Wong, Jackie S. T.; Forster, Jonathan J.; Smith, Peter W. F.

doi:10.1007/s11222-019-09918-5

Properties of the bridge sampler with a focus on splitting the MCMC sample

Published: 18 January 2020

Volume 30, pages 799–816, (2020)
Cite this article

Statistics and Computing Aims and scope Submit manuscript

Jackie S. T. Wong ORCID: orcid.org/0000-0002-0314-6684¹,
Jonathan J. Forster² &
Peter W. F. Smith³

373 Accesses
4 Citations
9 Altmetric
1 Mention
Explore all metrics

Abstract

Computation of normalizing constants is a fundamental mathematical problem in various disciplines, particularly in Bayesian model selection problems. A sampling-based technique known as bridge sampling (Meng and Wong in Stat Sin 6(4):831–860, 1996) has been found to produce accurate estimates of normalizing constants and is shown to possess good asymptotic properties. For small to moderate sample sizes (as in situations with limited computational resources), we demonstrate that the (optimal) bridge sampler produces biased estimates. Specifically, when one density (we denote as \(p_2\)) is constructed to be close to the target density (we denote as \(p_1\)) using method of moments, our simulation-based results indicate that the correlation-induced bias through the moment-matching procedure is non-negligible. More crucially, the bias amplifies as the dimensionality of the problem increases. Thus, a series of theoretical as well as empirical investigations is carried out to identify the nature and origin of the bias. We then examine the effect of sample size allocation on the accuracy of bridge sampling estimates and discovered that one possibility of reducing both the bias and standard error with a small increase in computational effort is by drawing extra samples from the moment-matched density \(p_2\) (which we assume easy to sample from), provided that the evaluation of \(p_1\) is not too expensive. We proceed to show how the simple adaptive approach we termed “splitting” manages to alleviate the correlation-induced bias at the expense of a higher standard error, irrespective of the dimensionality involved. We also slightly modified the strategy suggested by Wang et al. (Warp bridge sampling: the next generation, Preprint, 2019. arXiv:1609.07690) to address the issue of the increase in standard error due to splitting, which is later generalized to further improve the efficiency. We conclude the paper by offering our insights of the application of a combination of these adaptive methods to improve the accuracy of bridge sampling estimates in Bayesian applications (where posterior samples are typically expensive to generate) based on the preceding investigations, with an application to a practical example.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 5

Violating the normality assumption may be the lesser of two evils

Article Open access 07 May 2021

Entropy-Based Subsampling Methods for Big Data

Article 11 April 2024

A Guide for Sparse PCA: Model Comparison and Applications

Article Open access 29 June 2021

References

Augustine Kong, A., Liu, J., Wong, W.H.: Sequential imputations and Bayesian missing data problems. J. Am. Stat. Assoc. 89(425), 278–288 (1994)
MATH Google Scholar
Bartolucci, F., Scaccia, L., Mira, A.: Efficient Bayes factor estimation from the reversible jump output. Biometrika 93(1), 41–52 (2006)
MathSciNet MATH Google Scholar
Bennett, C.H.: Efficient estimation of free energy differences from Monte Carlo data. J. Comput. Phys. 22(2), 245–268 (1976)
MathSciNet Google Scholar
Carlin, B.P., Chib, S.: Bayesian model choice via Markov chain Monte Carlo methods. J. R. Stat. Soc. Ser. B 57, 473–484 (1995)
MATH Google Scholar
Carlin, B.P., Louis, T.A.: Bayes and Empirical Bayes Methods for Data Analysis, 2nd edn. Chapman & Hall, London (2000)
MATH Google Scholar
Chen, M.H., Shao, Q.M., Ibrahim, J.G.: Monte Carlo Methods in Bayesian Computation. Springer, Berlin (2000)
MATH Google Scholar
Chib, S.: Marginal likelihood from the Gibbs output. J. Am. Stat. Assoc. 90(432), 1313–1321 (1995)
MathSciNet MATH Google Scholar
Frenkel, D.: Free-energy computation and first-order phase transitions. In: Ciccotti, G., Hoover, W.G. (eds.) Molecular-Dynamics Simulation of Statistical Mechanical Systems, pp. 151–188. North-Holland, Amsterdam (1986)
Google Scholar
Frühwirth-Schnatter, S.: Estimating marginal likelihoods for mixture and Markov switching models using bridge sampling techniques. Econom. J. 7(1), 143–167 (2004)
MathSciNet MATH Google Scholar
Gelfand, A.E., Dey, D.K.: Bayesian model choice: asymptotic and exact calculations. J. R. Stat. Soc. Ser. B (Mehodol.) 56(3), 501–514 (1994)
MathSciNet MATH Google Scholar
Gelman, A., Meng, X.L.: Simulating normalizing constants: from importance sampling to bridge sampling to path sampling. Stat. Sci. 13(2), 163–185 (1998)
MathSciNet MATH Google Scholar
Gelman, A., Rubin, D.B., Carlin, J.B., Stern, H.S.: Bayesian Data Analysis, 1st edn. Chapman & Hall, London (1995)
MATH Google Scholar
Geweke, J.: Bayesian inference in econometric models using Monte Carlo integration. Econometrica 57(6), 1317–1339 (1989)
MathSciNet MATH Google Scholar
Green, P.J.: Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika 82, 711–732 (1995)
MathSciNet MATH Google Scholar
Gronau, Q.F., Sarafoglou, A., Matzke, D., Ly, A., Boehm, U., Marsman, M., Leslie, D.S., Forster, J.J., Wagenmakers, E.J., Steingroever, H.: A tutorial on bridge sampling. J. Math. Psychol. 81, 80–97 (2017a)
MathSciNet MATH Google Scholar
Gronau, Q.F., Singmann, H., Wagenmakers, E.J.: Bridgesampling: bridge sampling for marginal likelihoods and Bayes factors (2017b). https://github.com/quentingronau/bridgesampling (R Package Version 0.2-2)
Guy, J.A., Bijak, J., Forster, J.J., Raymer, J., Smith, P.W.F., Wong, J.S.T.: Integrating uncertainty in time series population forecasts: an illustration using a simple projection model. Demogr. Res. 29(43), 1187–1226 (2013)
Google Scholar
Irwin, M., Cox, N., Kong, A.: Sequential imputation for multilocus linkage analysis. Proc. Natl. Acad. Sci. USA 91(24), 11684–11688 (1994)
Google Scholar
Jensen, C.S., Kong, A.: Blocking Gibbs sampling for linkage analysis in large pedigrees with many loops. Am. J. Hum. Genet. 65(3), 885–901 (1999)
Google Scholar
Johnson, N.L., Kotz, S., Balakrishnan, N.: Continuous Univariate Distributions, vol. 2. Wiley Series in Probability and Mathematical Statistics, Berlin (1995)
MATH Google Scholar
Kass, R.E., Raftery, A.E.: Bayes factors. J. Am. Stat. Assoc. 90(430), 773–795 (1995)
MathSciNet MATH Google Scholar
Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, Montreal, QE, Morgan Kaufmann, pp. 1137–1143, Los Altos (1995)
Kong, A., McCullagh, P., Meng, X.L., Nicolae, D., Tan, Z.: A theory of statistical models for Monte Carlo integration. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 65(3), 585–604 (2003)
MathSciNet MATH Google Scholar
Kullback, S., Leibler, R.A.: On information and sufficiency. Ann. Math. Stat. 22(1), 79–86 (1951)
MathSciNet MATH Google Scholar
Lee, S.Y., Song, X.Y., Lee, J.C.K.: Maximum likelihood estimation of nonlinear structural equation models with ignorable missing data. J. Educ. Behav. Stat. 28(2), 111–134 (2003)
Google Scholar
Lopes, H.F., West, M.: Bayesian model assessment in factor analysis. Stat. Sin. 14(1), 41–67 (2004)
MathSciNet MATH Google Scholar
Lunn, D.J., Thomas, A., Best, N., Spiegelhalter, D.: WinBUGS—a Bayesian modelling framework: concepts, structure, and extensibility. Stat. Comput. 10(2000), 325–337 (2000)
Google Scholar
Meng, X.L., Schilling, S.: Fitting full-information item factor models and an empirical investigation of bridge sampling. J. Am. Stat. Assoc. 91(435), 1254–1264 (1996)
MATH Google Scholar
Meng, X.L., Wong, W.H.: Simulating ratios of normalizing constants via a simple identity: a theoretical exploration. Stat. Sin. 6(4), 831–860 (1996)
MathSciNet MATH Google Scholar
Mira, A., Nicholls, G.: Bridge estimation of the probability density at a point. Stat. Sin. 14(2), 603–612 (2004)
MathSciNet MATH Google Scholar
Neal, R.M.: Probabilistic inference using Markov chain Monte Carlo methods. Technical Report CRG-TR-93-1, Department of Computer Science, University of Toronto (1993)
Neal, R.M.: Contribution to the discussion of “Approximate Bayesian inference with the weighted likelihood bootstrap” by Michael A. Newton and Adrian E. Raftery. J. R. Stat. Soc. Ser. B (Mehodol.) 56(1), 41–42 (1994)
Google Scholar
Newton, M.A., Raftery, A.E.: Approximate Bayesian inference with the weighted likelihood bootstrap. J. R. Stat. Soc. Ser. B (Mehodol.) 56(1), 3–48 (1994)
MathSciNet MATH Google Scholar
Overstall, A.M., Forster, J.J.: Default Bayesian model determination methods for generalised linear mixed models. Comput. Stat. Data Anal. 54(12), 3269–3288 (2010)
MathSciNet MATH Google Scholar
Sinharay, S., Stern, H.S.: An empirical comparison of methods for computing Bayes factor in generalized linear mixed models. J. Comput. Graph. Stat. 14(2), 415–435 (2005)
MathSciNet Google Scholar
Sturtz, S., Ligges, U., Gelman, A.: R2OpenBUGS: a package for running OpenBUGS from R (2010). http://cran.r-project.org/web/packages/R2OpenBUGS/vignettes/~R2OpenBUGS.pdf. Accessed 25 Sept. 2019
Tan, Z.: Calibrated path sampling and stepwise bridge sampling. J. Stat. Plan. Inference 143(4), 675–690 (2013)
MathSciNet MATH Google Scholar
Wang, L., Jones, D.E., Meng, X.L.: Warp bridge sampling: the next generation. Preprint (2019). arXiv:1609.07690
Wong, J.S.T.: Bayesian estimation and model comparison for mortality forecasting. Ph.D. Thesis, University of Southampton (2017)
Wong, J.S.T., Forster, J.J., Smith, P.W.F.: Bayesian mortality forecasting with overdispersion. Insur. Math. Econ. 83(2018), 206–221 (2018)
MathSciNet MATH Google Scholar

Download references

Acknowledgements

The authors greatly appreciate the comments from the anonymous referees involved, who helped in improving the quality of the paper.

Author information

Authors and Affiliations

Department of Mathematical Sciences, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, UK
Jackie S. T. Wong
Department of Statistics, University of Warwick, Coventry, CV4 7AL, UK
Jonathan J. Forster
Southampton Statistical Sciences Research Institute (S3RI), University of Southampton, Highfield, Southampton, SO17 1BJ, UK
Peter W. F. Smith

Authors

Jackie S. T. Wong
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan J. Forster
View author publications
You can also search for this author in PubMed Google Scholar
Peter W. F. Smith
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jackie S. T. Wong.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 204 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wong, J.S.T., Forster, J.J. & Smith, P.W.F. Properties of the bridge sampler with a focus on splitting the MCMC sample. Stat Comput 30, 799–816 (2020). https://doi.org/10.1007/s11222-019-09918-5

Download citation

Received: 17 May 2019
Accepted: 25 December 2019
Published: 18 January 2020
Issue Date: July 2020
DOI: https://doi.org/10.1007/s11222-019-09918-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Properties of the bridge sampler with a focus on splitting the MCMC sample

Abstract

Access this article

Similar content being viewed by others

Violating the normality assumption may be the lesser of two evils

Entropy-Based Subsampling Methods for Big Data

A Guide for Sparse PCA: Model Comparison and Applications

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Electronic supplementary material

Supplementary material 1 (pdf 204 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Properties of the bridge sampler with a focus on splitting the MCMC sample

Abstract

Access this article

Similar content being viewed by others

Violating the normality assumption may be the lesser of two evils

Entropy-Based Subsampling Methods for Big Data

A Guide for Sparse PCA: Model Comparison and Applications

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Electronic supplementary material

Supplementary material 1 (pdf 204 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation