Abstract
In segmentation problems, inference on change-point position and model selection are two difficult issues due to the discrete nature of change-points. In a Bayesian context, we derive exact, non-asymptotic, explicit and tractable formulae for the posterior distribution of variables such as the number of change-points or their positions. We also derive a new selection criterion that accounts for the reliability of the results. All these results are based on an efficient strategy to explore the whole segmentation space, which can be very large. We illustrate our methodology on both simulated data and a comparative genomic hybridisation profile.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
BAI, J. and PERRON, P. (2003): Computation and analysis of multiple structural change models. J. Appl. Econ. 18 1–22.
BARAUD, Y., GIRAUD, C. and HUET, S. (2009): Gaussian model selection with unknown variance. AS. 37 (2) 630–672.
BIERNACKI, C., CELEUX, G. and GOVAERT, G. (2000): Assessing a mixture model for clustering with the integrated completed likelihood. IEEE Trans. Pattern Anal. Machine Intel. 22 (7) 719–725.
BIRGÉ, L. and MASSART, P. (2007): Minimal penalties for gaussian model selection. Probability Th. and Related Fields. 138 33–73.
BRAUN, R.-K., BRAUN, J.-V. and MÜLLER, H.-G.(2000): Multiple changepoint fitting via quasilikelihood, with application to dna sequence segmentation. Biometrika. 87 301–314.
FEDER, P. I. (1975): The loglikelihood ratio in segmented regression. ”AS”. 3 (1) 84–97.
GUÉDON, Y. (2008), Explorating the segmentation space for the assessment of multiple change-points models. Technical report, Preprint INRIA.
HUŠKOVÁ, M. and KIRCH, C. (2008): Bootstrapping confidence intervals for the change-point of time series. Journal of Time Series Analysis. 29 (6) 947–972.
LAVIELLE, M. (2005): Using penalized contrasts for the change-point problem. Signal Processing. 85 (8) 1501–1510.
LEBARBIER, E. (2005): Detecting multiple change-points in the mean of gaussian process by model selection. Signal Processing. 85 717–736.
MUGGEO, V. M. (2003): Estimating regression models with unknown break-points. Stat. Med. 22 (19) 3055–3071.
PICARD, F., ROBIN, S., LAVIELLE, M., VAISSE, C. and DAUDIN, J.-J. (2005): A statistical approach for array CGH data analysis. BMC Bioinformatics. 6 (27) 1. www.biomedcentral.com/1471-2105/6/27.
PINKEL, D. et al. (1998): High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays. Nature Genetics. (20) 207–211.
TOMS, J. D. and LESPERANCE, M. L. (2003): Piecewise regression: A tool for identifying ecological thresholds. Ecology. 84 (8) 2034–2041.
ZHANG, N. R. and SIEGMUND, D. O. (2007): A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data. Biometrics. 63 (1) 22–32.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rigaill, G., Lebarbier, E., Robin, S. (2010). Exact Posterior Distributions over the Segmentation Space and Model Selection for Multiple Change-Point Detection Problems. In: Lechevallier, Y., Saporta, G. (eds) Proceedings of COMPSTAT'2010. Physica-Verlag HD. https://doi.org/10.1007/978-3-7908-2604-3_57
Download citation
DOI: https://doi.org/10.1007/978-3-7908-2604-3_57
Published:
Publisher Name: Physica-Verlag HD
Print ISBN: 978-3-7908-2603-6
Online ISBN: 978-3-7908-2604-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)