Skip to main content
Log in

Classifying wave forecasts with model-based Geostatistics and the Aitchison distribution

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

This paper proposes a non-parametric method of classification of maps (i.e., variable fields such as wave energy maps for the Western Mediterranean Sea) into a set of D typical regimes (calm, E-, SW- or N/NW-wind dominated storms, the 4 synoptic situations more often occurring in this region). Each map in the training set is described by its values at P measurement points and one of these regime classes. A map is thus identified as a labelled point in a P-dimensional feature space, and the problem is to find a discrimination rule that may be used for attaching a classification probability to future unlabelled maps. The discriminant model proposed assumes that some log-contrasts of these classification probabilities form a Gaussian random field on the feature space. Then, available data (labelled maps of the training set) are linked to these latent probabilities through a multinomial model. This model is quite common in model-based Geostatistics and the Gaussian process classification literature. Inference is here approximated numerically using likelihood based techniques. The multinomial likelihood of labelled features is combined in a Bayesian updating with the Gaussian random field, playing the role of prior distribution. The posterior corresponds to an Aitchison distribution. Its maximum posterior estimates are obtained in two steps, exploiting several properties of this family. The first step is to obtain the mode of this distribution for labelled features, by solving a mildly non-linear system of equations. The second step is to propagate these estimates to unlabelled features, with simple kriging of log-contrasts. These inference steps can be extended via Markov-chain Monte Carlo (MCMC) sampling to a hierarchical Bayesian problem. This MCMC sampling can be improved by further exploiting the Aitchison distribution properties, though this is only outlined here. Results for the application case study suggest that E- and N/NW-dominated storms can be successfully discriminated from calm situations, but not so easily distinguished from each other.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  • Abramovitz M, Stegun I (1965) Handbook of mathematical functions. New York: Dover

    Google Scholar 

  • Aitchison J (1981) Distributions on the simplex for the analysis of neutrality. In: Taillie C, Ganapati PP, Baldessari BA (eds) Statistical distributions in scientific work—models, structures, and characterizations. D. Reidel Publishing Co., Dordrecht, pp 147–156

  • Aitchison J (1986) The statistical analysis of compositional data. Monographs on statistics and applied probability. Chapman & Hall Ltd., London (reprinted in 2003 with additional material by The Blackburn Press)

  • Aitchison J (2002) Simplicial inference. In: Viana MAG, Richards DSP (eds) Algebraic methods in statistics and probability. American Mathematical Society, Providence, pp 1–22

  • Billheimer D, Guttorp P, Fagan WF (2001) Statistical interpretation of species composition. J Am Stat Assoc 96(456):1205–1214

    Article  Google Scholar 

  • Bogaert P (2002) Spatial prediction of categorical variables: the Bayesian maximum entropy approach. Stoch Environ Res Risk Assess 16:425–448

    Article  Google Scholar 

  • Christakos G (2000) Modern spatio-temporal geostatistics. Studies on mathematical geology. Oxford University Press, New York

    Google Scholar 

  • Diggle PJ, Tawn JA, Moyeed RA (1998) Model-based geostatistics (with discussion). J R Stat Soc Ser C (App Stat) 47(3):299–350

    Article  Google Scholar 

  • Diggle P, Ribeiro PJ (2007) Model-based geostatistics. Series in statistical. Springer

  • Eaton ML (1983) Multivariate statistics. A vector space approach. Wiley

  • Egozcue JJ, Pawlowsky-Glahn V, Mateu-Figueras G, Barceló-Vidal C (2003) Isometric logratio transformations for compositional data analysis. Math Geol 35(3):279–300

    Article  Google Scholar 

  • Emery X (2007) Simulation of geological domains using the plurigaussian model: new developments and computer programs. Comput Geosci 33(9):1189–1201

    Article  Google Scholar 

  • Higgs MD, Hoeting JA (2010) A clipped latent variable model for spatially correlated ordered categorical data. Comput Stat Data Anal 54(8):1999–2011

    Article  Google Scholar 

  • Journel AG (1983) Nonparametric estimation of spatial distributions. Math Geol 15(3): 445–468

    Article  Google Scholar 

  • Lantuéjoul C (1999) Geostatistical simulation: models and algorithms. Springer

  • Leonard T, Hsu JSJ (1999) Bayesian Methods: an analysis for statisticians and interdisciplinary researchers. Series in statistical and probabilistic methods. Cambridge University Press

  • Neal RM (1999) Regression and classification using Gaussian process priors. In: Bernardo JM, Berger JO, Dawid AP, Smith AFM (eds) Bayesian statistics 6. Oxford University Press, New York, pp 475–501

  • Opper M, Winther O (2000) Gaussian processes for classification: mean field algorithms. Neural Comput 12(11):2655–2684

    Article  CAS  Google Scholar 

  • Pawlowsky-Glahn V, Egozcue JJ (2001) Geometric approach to statistical analysis on the simplex. Stoch Environ Res Risk Assess 15(5):384–398

    Article  Google Scholar 

  • Pawlowsky-Glahn V (2003) Statistical modelling in coordinates. In: Martín-Fernández JA, Thió-Henestrosa S (eds) Compositional data analysis workshop—CoDaWork’03, proceedings. Universitat de Girona, ISBN 84-8458-111-X. http://ima.udg.es/Activitats/CoDaWork03/

  • Simonoff JS (2003) Analyzing categorical data. Springer texts in statistics. Springer

  • Tolosana-Delgado R (2006) Geostatistics for constrained variables: positive data, compositions and probabilities. Application to environmental hazard monitoring. PhD thesis, Universitat de Girona, Spain

  • Tolosana-Delgado R, Pawlowsky-Glahn V, Egozcue JJ (2007) Indicator kriging without order relation violations. Math Geosci 40:327–347

    Article  Google Scholar 

  • Tolosana-Delgado R, Pawlowsky-Glahn V, Egozcue JJ (2008) Simplicial indicator kriging. J China Univ Geosci 19(1):65–71. ISSN 1002-0705

    Article  Google Scholar 

  • Williams CKI, Barber D (1998) Bayesian classification with Gaussian processes. IEEE Trans Pattern Anal Mach Intell 20(12):1342–1351

    Article  Google Scholar 

Download references

Acknowledgments

We acknowledge financial support by the 7th FP project FIELD_AC, a project of the European Space Agency (FP7-SPACE-2009-1), as well as from the Spanish Ministry of Science and Innovation through a “Juan de la Cierva” grant JCI-2008-1835 , supported by the European Social Fund (ESF-FSE); and projects MTM2009-13272 and Ingenio Mathematica (i-MATH) CSD2006-00032. Funding is also ackonwledged to Agencia de Gestió d’Ajuts Universitaris i de Recerca of the Generalitat de Catalunya under project 2009SGR424. Finally, the paper notably benefited from suggestions and comments from an anonymous reviewer, which we want to thank.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Raimon Tolosana-Delgado.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tolosana-Delgado, R., Egozcue, J.J., Sánchez-Arcilla, A. et al. Classifying wave forecasts with model-based Geostatistics and the Aitchison distribution. Stoch Environ Res Risk Assess 25, 1091–1100 (2011). https://doi.org/10.1007/s00477-011-0487-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-011-0487-3

Keywords

Navigation