Abstract
Computation of normalizing constants is a fundamental mathematical problem in various disciplines, particularly in Bayesian model selection problems. A sampling-based technique known as bridge sampling (Meng and Wong in Stat Sin 6(4):831–860, 1996) has been found to produce accurate estimates of normalizing constants and is shown to possess good asymptotic properties. For small to moderate sample sizes (as in situations with limited computational resources), we demonstrate that the (optimal) bridge sampler produces biased estimates. Specifically, when one density (we denote as \(p_2\)) is constructed to be close to the target density (we denote as \(p_1\)) using method of moments, our simulation-based results indicate that the correlation-induced bias through the moment-matching procedure is non-negligible. More crucially, the bias amplifies as the dimensionality of the problem increases. Thus, a series of theoretical as well as empirical investigations is carried out to identify the nature and origin of the bias. We then examine the effect of sample size allocation on the accuracy of bridge sampling estimates and discovered that one possibility of reducing both the bias and standard error with a small increase in computational effort is by drawing extra samples from the moment-matched density \(p_2\) (which we assume easy to sample from), provided that the evaluation of \(p_1\) is not too expensive. We proceed to show how the simple adaptive approach we termed “splitting” manages to alleviate the correlation-induced bias at the expense of a higher standard error, irrespective of the dimensionality involved. We also slightly modified the strategy suggested by Wang et al. (Warp bridge sampling: the next generation, Preprint, 2019. arXiv:1609.07690) to address the issue of the increase in standard error due to splitting, which is later generalized to further improve the efficiency. We conclude the paper by offering our insights of the application of a combination of these adaptive methods to improve the accuracy of bridge sampling estimates in Bayesian applications (where posterior samples are typically expensive to generate) based on the preceding investigations, with an application to a practical example.
Similar content being viewed by others
References
Augustine Kong, A., Liu, J., Wong, W.H.: Sequential imputations and Bayesian missing data problems. J. Am. Stat. Assoc. 89(425), 278–288 (1994)
Bartolucci, F., Scaccia, L., Mira, A.: Efficient Bayes factor estimation from the reversible jump output. Biometrika 93(1), 41–52 (2006)
Bennett, C.H.: Efficient estimation of free energy differences from Monte Carlo data. J. Comput. Phys. 22(2), 245–268 (1976)
Carlin, B.P., Chib, S.: Bayesian model choice via Markov chain Monte Carlo methods. J. R. Stat. Soc. Ser. B 57, 473–484 (1995)
Carlin, B.P., Louis, T.A.: Bayes and Empirical Bayes Methods for Data Analysis, 2nd edn. Chapman & Hall, London (2000)
Chen, M.H., Shao, Q.M., Ibrahim, J.G.: Monte Carlo Methods in Bayesian Computation. Springer, Berlin (2000)
Chib, S.: Marginal likelihood from the Gibbs output. J. Am. Stat. Assoc. 90(432), 1313–1321 (1995)
Frenkel, D.: Free-energy computation and first-order phase transitions. In: Ciccotti, G., Hoover, W.G. (eds.) Molecular-Dynamics Simulation of Statistical Mechanical Systems, pp. 151–188. North-Holland, Amsterdam (1986)
Frühwirth-Schnatter, S.: Estimating marginal likelihoods for mixture and Markov switching models using bridge sampling techniques. Econom. J. 7(1), 143–167 (2004)
Gelfand, A.E., Dey, D.K.: Bayesian model choice: asymptotic and exact calculations. J. R. Stat. Soc. Ser. B (Mehodol.) 56(3), 501–514 (1994)
Gelman, A., Meng, X.L.: Simulating normalizing constants: from importance sampling to bridge sampling to path sampling. Stat. Sci. 13(2), 163–185 (1998)
Gelman, A., Rubin, D.B., Carlin, J.B., Stern, H.S.: Bayesian Data Analysis, 1st edn. Chapman & Hall, London (1995)
Geweke, J.: Bayesian inference in econometric models using Monte Carlo integration. Econometrica 57(6), 1317–1339 (1989)
Green, P.J.: Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika 82, 711–732 (1995)
Gronau, Q.F., Sarafoglou, A., Matzke, D., Ly, A., Boehm, U., Marsman, M., Leslie, D.S., Forster, J.J., Wagenmakers, E.J., Steingroever, H.: A tutorial on bridge sampling. J. Math. Psychol. 81, 80–97 (2017a)
Gronau, Q.F., Singmann, H., Wagenmakers, E.J.: Bridgesampling: bridge sampling for marginal likelihoods and Bayes factors (2017b). https://github.com/quentingronau/bridgesampling (R Package Version 0.2-2)
Guy, J.A., Bijak, J., Forster, J.J., Raymer, J., Smith, P.W.F., Wong, J.S.T.: Integrating uncertainty in time series population forecasts: an illustration using a simple projection model. Demogr. Res. 29(43), 1187–1226 (2013)
Irwin, M., Cox, N., Kong, A.: Sequential imputation for multilocus linkage analysis. Proc. Natl. Acad. Sci. USA 91(24), 11684–11688 (1994)
Jensen, C.S., Kong, A.: Blocking Gibbs sampling for linkage analysis in large pedigrees with many loops. Am. J. Hum. Genet. 65(3), 885–901 (1999)
Johnson, N.L., Kotz, S., Balakrishnan, N.: Continuous Univariate Distributions, vol. 2. Wiley Series in Probability and Mathematical Statistics, Berlin (1995)
Kass, R.E., Raftery, A.E.: Bayes factors. J. Am. Stat. Assoc. 90(430), 773–795 (1995)
Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, Montreal, QE, Morgan Kaufmann, pp. 1137–1143, Los Altos (1995)
Kong, A., McCullagh, P., Meng, X.L., Nicolae, D., Tan, Z.: A theory of statistical models for Monte Carlo integration. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 65(3), 585–604 (2003)
Kullback, S., Leibler, R.A.: On information and sufficiency. Ann. Math. Stat. 22(1), 79–86 (1951)
Lee, S.Y., Song, X.Y., Lee, J.C.K.: Maximum likelihood estimation of nonlinear structural equation models with ignorable missing data. J. Educ. Behav. Stat. 28(2), 111–134 (2003)
Lopes, H.F., West, M.: Bayesian model assessment in factor analysis. Stat. Sin. 14(1), 41–67 (2004)
Lunn, D.J., Thomas, A., Best, N., Spiegelhalter, D.: WinBUGS—a Bayesian modelling framework: concepts, structure, and extensibility. Stat. Comput. 10(2000), 325–337 (2000)
Meng, X.L., Schilling, S.: Fitting full-information item factor models and an empirical investigation of bridge sampling. J. Am. Stat. Assoc. 91(435), 1254–1264 (1996)
Meng, X.L., Wong, W.H.: Simulating ratios of normalizing constants via a simple identity: a theoretical exploration. Stat. Sin. 6(4), 831–860 (1996)
Mira, A., Nicholls, G.: Bridge estimation of the probability density at a point. Stat. Sin. 14(2), 603–612 (2004)
Neal, R.M.: Probabilistic inference using Markov chain Monte Carlo methods. Technical Report CRG-TR-93-1, Department of Computer Science, University of Toronto (1993)
Neal, R.M.: Contribution to the discussion of “Approximate Bayesian inference with the weighted likelihood bootstrap” by Michael A. Newton and Adrian E. Raftery. J. R. Stat. Soc. Ser. B (Mehodol.) 56(1), 41–42 (1994)
Newton, M.A., Raftery, A.E.: Approximate Bayesian inference with the weighted likelihood bootstrap. J. R. Stat. Soc. Ser. B (Mehodol.) 56(1), 3–48 (1994)
Overstall, A.M., Forster, J.J.: Default Bayesian model determination methods for generalised linear mixed models. Comput. Stat. Data Anal. 54(12), 3269–3288 (2010)
Sinharay, S., Stern, H.S.: An empirical comparison of methods for computing Bayes factor in generalized linear mixed models. J. Comput. Graph. Stat. 14(2), 415–435 (2005)
Sturtz, S., Ligges, U., Gelman, A.: R2OpenBUGS: a package for running OpenBUGS from R (2010). http://cran.r-project.org/web/packages/R2OpenBUGS/vignettes/~R2OpenBUGS.pdf. Accessed 25 Sept. 2019
Tan, Z.: Calibrated path sampling and stepwise bridge sampling. J. Stat. Plan. Inference 143(4), 675–690 (2013)
Wang, L., Jones, D.E., Meng, X.L.: Warp bridge sampling: the next generation. Preprint (2019). arXiv:1609.07690
Wong, J.S.T.: Bayesian estimation and model comparison for mortality forecasting. Ph.D. Thesis, University of Southampton (2017)
Wong, J.S.T., Forster, J.J., Smith, P.W.F.: Bayesian mortality forecasting with overdispersion. Insur. Math. Econ. 83(2018), 206–221 (2018)
Acknowledgements
The authors greatly appreciate the comments from the anonymous referees involved, who helped in improving the quality of the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Wong, J.S.T., Forster, J.J. & Smith, P.W.F. Properties of the bridge sampler with a focus on splitting the MCMC sample. Stat Comput 30, 799–816 (2020). https://doi.org/10.1007/s11222-019-09918-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11222-019-09918-5