An objective Bayesian analysis of the change point problem

Moreno, Elías; Casella, George; Garcia-Ferrer, Antonio

doi:10.1007/s00477-004-0224-2

An objective Bayesian analysis of the change point problem

Original Paper
Published: 10 March 2005

Volume 19, pages 191–204, (2005)
Cite this article

Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Elías Moreno¹,
George Casella² &
Antonio Garcia-Ferrer³

344 Accesses
20 Citations
Explore all metrics

Abstract

The Bayesian literature on the change point problem deals with the inference of a change in the distribution of a set of time-ordered data based on a sample of fixed size. This is the so-called “retrospective or off-line” analysis of the change point problem. A related but different problem is that of the “sequential” change point detection, mainly analyzed from a frequentist viewpoint. While the former typically focuses on the estimation of the position in which the change point occurs, the latter is a testing problem which has a natural formulation as a Bayesian model selection problem. In this paper we provide such a Bayesian formulation, which generalizes previous formulations such as the well-known CUSUM stopping rule. We show that the conventional improper priors (also called non-informative, objective or default), cannot be used either for sequential detection of the change or for retrospective estimation. Then, we propose objective intrinsic prior distributions for the unknown model parameters. The normal and Poisson cases are studied in detail and examples with simulated and real data are provided.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Nonparametric Tests and Nested Sequential Sampling Plans for Change-Point Detection

Article 28 January 2017

A Multiple Hypothesis Testing Approach to Detection Changes in Distribution

Article 01 April 2019

Practical Aspects of False Alarm Control for Change Point Detection: Beyond Average Run Length

Article Open access 17 May 2018

References

Abramowitz M, Stegun IA (1970) Handbook of mathematical functions. Dover Publications Inc., New York
Google Scholar
Balke NS (1993) Detecting level shifts in time series. J Business Econ Stat 11:81–92
Google Scholar
Berger JO (2000) Bayesian analysis: a look at today and thoughts of tomorrow. J Amer Stat Assoc 95:1269–1276
Google Scholar
Berger JO, Bernardo JM (1992) On the development of the reference prior method. In: Bernardo JM et al. (eds) Bayesian Statistics 4. Oxford University Press, London, pp 35–60
Google Scholar
Berger JO, Pericchi LR (1996) The intrinsic Bayes factor for model selection and prediction. J Amer Stat Assoc 91:109–122
Google Scholar
Berger JO, Sellke T (1987) Testing a point null hypothesis: the irreconcilibility of p-values and evidence (with discussion). J Amer Stat Assoc 82:112–122
Google Scholar
Berger JO, De Oliveira V, Sansó B (2001) Objective Bayesian analysis of spatially correlated data. J Amer Stat Assoc 96:1361–1374
Google Scholar
Carlin BP, Gelfand AE, Smith AFM (1992). Hierarchical Bayesian analysis of change point problems. Appl Stat 41:389-405
Google Scholar
Carlstein E (1988) Nonparametric change-point estimation. Ann Stat 16:188–197
Google Scholar
Casella G, Berger RL (1987). Reconciling Bayesian and frequentist evidence in the one-sided testing problem (with discussion). J Amer Stat Assoc 82:106–111
Google Scholar
Casella G, Moreno E (2002) Objective Bayesian variable selection. Technical Report, University of Granada
Casella G, Moreno E (2003) Objective Bayesian analysis of contingency tables. Technical Report, University of Granada
Casella G, Moreno E (2005) Intrinsic meta analysis of contingency tables. Stat Med 24:583–604
Google Scholar
Chernoff H, Zacks S (1964) Estimating the current mean of a normal distribution which is subjected to changes in time. Ann Math Stat 35:999–1018
Google Scholar
Choy JH, Broemeling LD (1980) Some Bayesian inferences for a changing linear model. Technometrics 22:71–78
Google Scholar
Clyde M. (2001) Discussion to Chipman H, George E, McCulloch RE (2001). IMS Lecture Notes-Monograph Series 38:67–134
Cobb GW (1978) The problem of the Nile: conditional solution to a change-point problem. Biometrika 65:243–251
Google Scholar
Dümbgen L (1991) The asymptotic behavior of some nonparametric change-point estimators. Ann Stat 19:1471–1495
Google Scholar
Ferreira PE (1975) A Bayesian analysis of a switching regression model: known number of regimes. J Amer Stat Assoc 70:370–374
Google Scholar
Hsu DA (1979) Detecting shifts in parameters in gamma sequences with application to stock price and air traffic flow analysis. J Amer Stat Assoc 74:31–40
Google Scholar
Jarret RG (1979) A note on the intervals between coal-mining disasters. Biometrika 66:191–193
Google Scholar
Jeffreys H (1961) Theory of probability. Oxford University Press, London
Google Scholar
Kim S, Sun D (2000). Intrinsic priors for model selection using an encompassing model. Life Time Data Anal 6:251–269
Google Scholar
Lai TL (1995) Sequential change point detection in quality control and dynamical systems. J R Stat Soc Ser B 57:613–658
Google Scholar
Lorden G (1971) Procedures for reacting to a change in distribution. Ann Math Stat 41:520–527
Google Scholar
Menzefrike U (1981) A bayesian analysis of a change in the precision for a sequence of independent normal random variables at an unknown time point. Appl Stat 30:141–146
Google Scholar
Moreno E, Liseo B (2003) Default prior for testing the number of components of a mixture. J Stat Plan Inference 111:129–142
Google Scholar
Moreno E, Bertolino F, Racugno W (1998) An intrinsic limiting procedure for model selection and hypothesis testing. J Amer Stat Assoc 93:1451–1460
Google Scholar
Moreno E, Bertolino F, Racugno W (1999) Default Bayesian analysis of the Behrens-Fisher problem. J Stat Plan Inference 81:323–333
Google Scholar
Moreno E, Bertolino F, Racugno W (2000) Bayesian model selection approach to analysis of variance under heterocedasticity. J R Stat Soc Ser D 49:1–15
Google Scholar
Moreno E, Bertolino F, Racugno W (2003) Bayesian inference under partial prior information. Scand J Stat 30:565–580
Google Scholar
Morris CN (1987) Discussion of Berger/Sellke and Casella/Berger. J Amer Stat Assoc 82:112–122
Google Scholar
Müller HG (1992). Change-points in nonparametric regression analysis. Ann Stat 20:737–761
Google Scholar
Page ES (1954) Continuous inspection schemes. Biometrika 41:100–114
Google Scholar
Page ES (1955) A test for a change in a parameter occurring at an unknown point. Biometrika 42:523–527
Google Scholar
Pettitt AN (1979) A non-parametric approach to the change-point problem. Appl Stat 28:126–135
Google Scholar
Pollak M, Siegmund D (1991) Sequential detection of a change in a normal mean when the initial value is unknown. Ann Stat 19:394–416
Google Scholar
Raftery AE, Akman VE (1986) Bayesian analysis of a Poisson process with a change-point. Biometrika 73:85–89
Google Scholar
Rudemo M (1982) Empirical choice of histograms and kernel density estimators. Scand J Stat 9:65–78
Google Scholar
Sen AK, Srivastava MS (1973) On multivariate test for detecting change in mean. Sankhy A 35:173–186
Google Scholar
Siegmund D (1988) Confidence sets in change point problems. Int Stat Rev 56:31–48
Google Scholar
Smith AFM (1975) A Bayesian approach to inference about a change-point in a sequence of random variables. Biometrika 62:407–416
Google Scholar
Smith AFM, Cook DG (1980) Straight lines with a change point: a Bayesian analysis of some renal ransplant data. Appl Stat 29:180–189
Google Scholar
Sweeting TJ (2001) Coverage probability bias, objective Bayes and the likelihood principle. Biometrika 88:657–675
Google Scholar
Wasserman L (2000) Asymptotic inference for mixture models using data-dependent priors. J R Stat Soc Ser B 62:159–180
Google Scholar
Worsley KJ (1986) Confidence regions and tests for a change-point in a sequence of exponential random variables. Biometrika 73:91–104
Google Scholar

Download references

Acknowledgements

We are grateful to two anonymous referees for their comments which have improved an earlier version of the paper. This work has been partially supported by Ministerio de Educación y Ciencia under grant SEJ2004-02447 and BEC2002-00081.

Author information

Authors and Affiliations

Universidad de Granada, Granada, Spain
Elías Moreno
University of Florida, Gainesville, USA
George Casella
Universidad Autónoma de Madrid, Madrid, Spain
Antonio Garcia-Ferrer

Authors

Elías Moreno
View author publications
You can also search for this author in PubMed Google Scholar
George Casella
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Garcia-Ferrer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elías Moreno.

Appendices

Appendix 1

Proof of Lemma 1.

For the models $M_0 :P\left( {x_1 |\theta } \right) = {{\theta ^{x_1 } } \mathord{\left/ {\vphantom {{\theta ^{x_1 } } {x_1 !}}} \right. \kern-\nulldelimiterspace} {x_1 !}}\,\exp \left\{ { - \theta } \right\},$ and $M_1 :\left\{ {P\left( {x_1 |\theta _1 } \right)P\left( {x_2 |\theta _2 } \right),\pi ^D \left( {\theta _1 ,\theta _2 } \right) = k\theta _1^{ - 1/2} \theta _2^{ - 1/2} } \right\},$ where θ is an arbitrary but fixed value, and k is an arbitrary positive constant. The minimal training sample is a pair of independent random variables X₁,X₂ such that under model M₁, X_i∽ P(x_i|θ _i), and under M₀, X_i ∽ P(x_i|θ), i=0,1. Then, simple calculations give

$$ B_{01}^N (x_1 ,x_2 ) = \frac{{\theta ^{x_1 + x_2 } \exp \{ - 2\theta \} }} {{k\Gamma (x_1 + 1/2)\Gamma (x_2 + 1/2)}}. $$

Furthermore,

$$ \begin{aligned} E_{x_1 ,x_2 |\theta _1 ,\theta _2 }^{M_1 } B_{01}^N (x_1 ,x_2 ) & = \frac{{\exp \{ - (\theta _1 + \theta _2 + 2\theta )\} }} {k}\sum\limits_{x_1 = 0}^\infty \,\frac{{(\theta \theta _1 )^{x_1 } }} {{\Gamma (x_1 + 1/2)x_1 !}} \\ & \quad \times \sum\limits_{x_2 = 0}^\infty \,\frac{{(\theta \theta _2 )^{x_2 } }} {{\Gamma (x_2 + 1/2)x_2 !}}. \\ \end{aligned} $$

Using the equality $\sum\nolimits_{x = 0}^\infty {{{\left( {\theta \;\theta _1 } \right)^{x_2 } } \mathord{\left/ {\vphantom {{\left( {\theta \;\theta _1 } \right)^{x_2 } } {\Gamma \left( {x + 1/2} \right)x!}}} \right. \kern-\nulldelimiterspace} {\Gamma \left( {x + 1/2} \right)x!}} = {{F_0^1 \left( {1/2,\theta \;\theta _1 } \right)} \mathord{\left/ {\vphantom {{F_0^1 \left( {1/2,\theta \;\theta _1 } \right)} {\Gamma \left( {1/2} \right)}}} \right. \kern-\nulldelimiterspace} {\Gamma \left( {1/2} \right)}}} $ and then substitution in Eq. 12, Lemma 1 follows.

Appendix 2

Proof of Lemma 2.

Consider the model

$$ M_0^ * :N(x|\theta ,\tau ^2 ), $$

and

$$ M_1 :\left\{ {N\left( {x|\mu _1 ,\sigma _1^2 } \right)N\left( {y|\mu _2 ,\sigma _2^2 } \right),\pi _1^N ({\varvec{\mu }},{\varvec{\sigma }}) = \frac{{c_1 }} {{\sigma _1 \sigma _2 }}} \right\}. $$

The minimal training sample is a random vector (X₁,X₂,Y₁,Y₂) with independent components such that under model M₁, X_i~ N(x_i|μ ₁,σ ²₁ ), Y_i~ N(y_i|μ ₂,σ ²₂ ), and under M ^*₀ , X_i, Y_i~ N(x|θ ,τ ²), i=1,2. We recall that a minimal training sample is a random vector of minimal size for which the marginal density is greater than zero and finite (except for a null set with respect to the Lebergue measure). Then,

$$ B_{01}^N (x,y) = \frac{1} {{m_1 ({\mathbf{x}},{\mathbf{y}})}}\prod\limits_{i = 1}^2 \,N(x_i |\theta ,\tau ^2 )N(y_i |\theta ,\tau ^2 ), $$

where

$$ m_1 (x,y) = c_1 \frac{1} {{2^2 |x_1 - x_2 ||y_1 - y_2 |}}. $$

Therefore,

$$ \begin{aligned} \pi ^I ({\varvec{\mu }},{\varvec{\sigma }}|\theta ,\tau ) & = \frac{1} {{4\sigma _1^3 \sigma _2^3 \tau ^4 }} \\ & \quad \times \int {|x_1 - x_2 |\exp \left\{ { - d_x^2 \left( {\tau ^{ - 2} + \sigma _1^{ - 2} } \right) - \frac{{(m_x - \theta )^2 }} {{\tau ^2 }} - \frac{{(m_x - \mu _1 )^2 }} {{\sigma _1^2 }}} \right\}{\text{d}}x_1 {\text{d}}x_2 } \\ & \quad \times \int {|y_1 - y_2 |\exp \left\{ { - d_y^2 \left( {\tau ^{ - 2} + \sigma _2^{ - 2} } \right) - \frac{{(m_y - \theta )^2 }} {{\tau ^2 }} - \frac{{(m_y - \mu _2 )^2 }} {{\sigma _2^2 }}} \right\}{\text{d}}y_1 {\text{d}}y_2 } , \\ \end{aligned} $$

where

$$ d_x^2 = \frac{{(x_1 - x_2 )^2 }} {4},\quad m_x = \frac{{x_1 + x_2 }} {2}, $$

$$ d_y^2 = \frac{{(y_1 - y_2 )^2 }} {4},\quad m_x = \frac{{y_1 + y_2 }} {2}. $$

Changing to the new variables

$$ u_1 = x_1 - x_2 ,\quad v_1 = x_1 + x_2 , $$

$$ u_2 = y_1 - y_2 ,\quad v_2 = y_1 + y_2 , $$

the result in Lemma 1 follows.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Moreno, E., Casella, G. & Garcia-Ferrer, A. An objective Bayesian analysis of the change point problem. Stoch Environ Res Ris Assess 19, 191–204 (2005). https://doi.org/10.1007/s00477-004-0224-2

Download citation

Published: 10 March 2005
Issue Date: August 2005
DOI: https://doi.org/10.1007/s00477-004-0224-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An objective Bayesian analysis of the change point problem

Abstract

Access this article

Similar content being viewed by others

Nonparametric Tests and Nested Sequential Sampling Plans for Change-Point Detection

A Multiple Hypothesis Testing Approach to Detection Changes in Distribution

Practical Aspects of False Alarm Control for Change Point Detection: Beyond Average Run Length

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1

Appendix 2

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An objective Bayesian analysis of the change point problem

Abstract

Access this article

Similar content being viewed by others

Nonparametric Tests and Nested Sequential Sampling Plans for Change-Point Detection

A Multiple Hypothesis Testing Approach to Detection Changes in Distribution

Practical Aspects of False Alarm Control for Change Point Detection: Beyond Average Run Length

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1

Appendix 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation