On degeneracy and invariances of random fields paths with applications in Gaussian process modelling

https://doi.org/10.1016/j.jspi.2015.10.002Get rights and content

Highlights

  • Covariance-driven pathwise properties of random fields are investigated.

  • A number of pathwise properties can be cast as degeneracies under linear operators.

  • Degeneracies and invariances of Gaussian field paths are stable under conditioning.

  • Improved function inference using ad hoc kernels is illustrated on two examples.

Abstract

We study pathwise invariances and degeneracies of random fields with motivating applications in Gaussian process modelling. The key idea is that a number of structural properties one may wish to impose a priori on functions boil down to degeneracy properties under well-chosen linear operators. We first show in a second order set-up that almost sure degeneracy of random field paths under some class of linear operators defined in terms of signed measures can be controlled through the two first moments. A special focus is then put on the Gaussian case, where these results are revisited and extended to further linear operators thanks to state-of-the-art representations. Several degeneracy properties are tackled, including random fields with symmetric paths, centred paths, harmonic paths, or sparse paths. The proposed approach delivers a number of promising results and perspectives in Gaussian process modelling. In a first numerical experiment, it is shown that dedicated kernels can be used to infer an axis of symmetry. Our second numerical experiment deals with conditional simulations of a solution to the heat equation, and it is found that adapted kernels notably enable improved predictions of non-linear functionals of the field such as its maximum.

Introduction

Whether for function approximation, classification, or density estimation, probabilistic models relying on random fields have been increasingly used in recent works from various research communities. Finding their applied roots in geostatistics and spatial statistics with optimal linear prediction and Kriging (Matheron, 1963, Stein, 1999), random field models for prediction have become a main stream topic in machine learning (under the Gaussian Process Regression terminology, see, e.g., Rasmussen and Williams, 2006), with a spectrum ranging from metamodeling and adaptive design approaches in science and engineering Welch et al. (1992), Jones (2001), O’Hagan (2006) to theoretical Bayesian statistics in function spaces (see Van der Vaart and Van Zanten, 2008a, Van der Vaart and Van Zanten, 2008b, Van der Vaart and van Zanten, 2011 and references therein).

Often, a Gaussian random field model is assumed for some function f of interest, and so all prior assumptions on f are accounted for by the corresponding mean function m and covariance kernel k. The choice of m and k should thus reflect as much as possible any prior belief the modeller wishes to incorporate in the model. Such prior belief on f may of course include classical regularity properties in the first place (continuity, differentiability, Hölder regularity, etc.), but also more specific properties such as symmetries (Haasdonk and Burkhardt, 2007, Ginsbourger et al., 2012), sparse functional ANOVA decompositions (Duvenaud et al., 2011, Durrande et al., 2012, Ginsbourger et al., 2014), or degeneracy under multivariate differential operators in the case of vector-valued random fields. To take a concrete example, covariance structures characterizing divergence-free and curl-free random vector fields have been recently presented and illustrated in Scheuerer and Schlather (2012). Besides that, the idea of expressing structure with kernels has been explored in Duvenaud (2014), where a number of practical aspects regarding positive-semidefiniteness-preserving operations are addressed.

Here we shall discuss how the two first moments influence mathematical properties of associated realizations (or paths), both in a general second order set-up and in the Gaussian case. A number of well-known random field properties driven by the covariance kernel are in the mean square sense, e.g.  L2 continuity and differentiability (Cressie, 1993). However, such results generally are not informative about the pathwise behaviour of underlying random fields. On the other hand, much can be said about path regularity properties of random field paths (see, e.g., classical results in Cramér and Leadbetter, 1967, Adler, 1990), based in particular on the behaviour of the covariance kernel in the neighbourhood of the diagonal in the second order case. In the stationary case, it is then sufficient to look at the covariance function in the neighbourhood of the origin (with similar results for the variogram in the intrinsic stationary–but not necessarily second order–case). More recently, Scheuerer (2010) has taken a new look at path regularity of second-order random fields, and drew conclusions about a.s. continuous differentiability in non-Gaussian settings. Also, we refer to Scheuerer (2009) for an enlightening exposition of state-of-the-art results concerning regularity properties of random field sample paths in various frameworks.

Our focus in the present work is on pathwise mathematical properties of second order random fields and statistical applications thereof in the context of Gaussian process modelling. Motivated by several practical situations, we pay a particular attention to random fields Z=(Zx)xD that are supported by the null space of some linear operator T, i.e. for which T(Z)=0(a.s.). As we first develop in general second-order settings, an impressive diversity of path properties including invariances under group actions or sparse ANOVA decompositions of multivariate paths can be encapsulated in the framework of Eq. (1). Furthermore, in the particular case of Gaussian random fields, a more general class of path properties (notably some degeneracy properties involving differential operators) can be covered through the link between operators on the paths and operators on the reproducing kernel Hilbert space (Berlinet and Thomas-Agnan, 2004) associated with the random field, and also through an additional representation of Z in terms of Gaussian measures on Banach spaces.

While Section  2 is dedicated to the exposition of the main results, proofs are presented in the Appendix to ease the reading. Applications in the context of random field modelling, and especially for Gaussian process modelling, are then investigated throughout Section  3. In particular, we tackle zero-integral random processes, random fields with paths invariant under group actions, random fields with additive paths, random fields with harmonic paths, and discuss further potential applications.

In Section  4, we present two original numerical experiments where the notions of degeneracy and invariance appear very useful in Gaussian process modelling under two types of structural prior information. In the first case, the objective function possesses an unknown axis of symmetry, which is inferred by maximum likelihood, relying on a family of argumentwise invariant covariance kernels. In the second case, we obtain an improved interpolation of a solution to the heat equation thanks to a bi-harmonic kernel. The proposed model enables performing harmonic conditional simulations, which has very beneficial consequences in terms of estimation of the maximum. Section  5 is dedicated to conclusions and perspectives. The main results are finally proven in the Appendix.

Section snippets

Main results

Let (D,D) be a measurable space, (Ω,A,P) be a complete probability space, and Z=(Zx)xD be a measurable real-valued stochastic process over (Ω,A,P). Let us further assume that the paths of Z belong with probability 1 to some function space FM(D,R), where M(D,R) is the set of (D,B(R))-measurable functions, and consider a linear operator T:FF. Here both Z and T(Z) are assumed second order, in the sense that their marginals possess a variance, and we aim at giving necessary and sufficient

On processes with paths integrating to zero

Lemma 1 of Appendix ensures that the almost sure nullity of ν(Z) can be characterized through the nullity of ν(m) and νν(k), provided that the function uDk(u,u)+m(u)2R is ν-integrable. In practice, assuming that this integrability condition is fulfilled–as it is notably the case for compact D and continuous m and k–it suffices to check that ν(m)=0 and ν(k(,x))=0 for arbitrary xD. Taking for instance the settings of Durrande et al. (2013) where ν is a finite (positive) measure and

Estimation of a symmetry axis

We now consider the following test function over D=[0,1]2: f:(x1,x2)Dcos(2(x1+x2)+0.4)+sin(3(x1x2))2+x1x2. This function is symmetric with respect to the axis defined by the equation x2=x10.12. Given noisy evaluation results of f at 20 points, and assuming a priori that f is symmetric with respect to some axis Δ (parametrized by its angle with the ordinate axis λ[0,π) and its distance to the origin δ0), we aim at both reconstructing f and recovering the equation of this axis by using a

Conclusions and perspectives

This article focuses on the control of pathwise invariances of square-integrable random field through their covariance structure. It is illustrated on various examples how a number of features one may wish to impose on paths such as multivariate sparsity, symmetries, or being solution to a vast class of homogeneous ordinary or partial differential equations may be cast as degeneracy or invariance properties under linear operators.

One of the main results of this work, given in Proposition 1,

Acknowledgements

The authors would like to thank Dominic Schuhmacher, Fabrice Gamboa, as well as two anonymous referees for useful comments on previous versions of this paper. They are also very grateful to the editor for constructive remarks having led to significant improvements.

References (37)

  • Deville, Y., Ginsbourger, D., Roustant, O., 2015. kergp: Gaussian Process Laboratory, R package version...
  • N. Durrande et al.

    Additive covariance kernels for high-dimensional Gaussian process modeling

    Ann. Fac. Sci. Toulouse

    (2012)
  • D. Duvenaud

    Automatic model construction with Gaussian processes

    (2014)
  • D.K. Duvenaud et al.

    Additive Gaussian processes

  • Franco, J., Dupuy, D., Roustant, O., Damblin, G., Iooss, B., 2013. DiceDesign: Designs of Computer Experiments, R...
  • T.E. Fricker et al.

    Multivariate Gaussian process emulators with nonseparable covariance structures

    Technometrics

    (2013)
  • I. Gihman et al.

    The Theory of Stochastic Processes I

    (1974)
  • D. Ginsbourger et al.

    Argumentwise invariant kernels for the approximation of invariant functions

    Ann. Fac. Sci. Toulouse

    (2012)
  • Cited by (23)

    • Inference and uncertainty propagation of GB structure-property models: H diffusivity in [100] tilt GBs in Ni

      2021, Acta Materialia
      Citation Excerpt :

      As we will show below, this approach succeeds in enforcing the desired symmetries, but at the cost of introducing topological artifacts. We propose an alternative approach that circumvents this drawback by employing an unsymmetrized distance function in conjunction with the sum-over-group-orbits strategy [39–41]. Distance metrics for GBs have been an active area of research for some time and there are a variety of candidates in the literature (see e.g. [42–46] and [47] for a recent review of extant distance metrics).

    View all citing articles on Scopus
    View full text