On degeneracy and invariances of random fields paths with applications in Gaussian process modelling

doi:10.1016/j.jspi.2015.10.002

Journal of Statistical Planning and Inference

Volume 170, March 2016, Pages 117-128

https://doi.org/10.1016/j.jspi.2015.10.002 Get rights and content

Highlights

•
Covariance-driven pathwise properties of random fields are investigated.
•
A number of pathwise properties can be cast as degeneracies under linear operators.
•
Degeneracies and invariances of Gaussian field paths are stable under conditioning.
•
Improved function inference using ad hoc kernels is illustrated on two examples.

Abstract

We study pathwise invariances and degeneracies of random fields with motivating applications in Gaussian process modelling. The key idea is that a number of structural properties one may wish to impose a priori on functions boil down to degeneracy properties under well-chosen linear operators. We first show in a second order set-up that almost sure degeneracy of random field paths under some class of linear operators defined in terms of signed measures can be controlled through the two first moments. A special focus is then put on the Gaussian case, where these results are revisited and extended to further linear operators thanks to state-of-the-art representations. Several degeneracy properties are tackled, including random fields with symmetric paths, centred paths, harmonic paths, or sparse paths. The proposed approach delivers a number of promising results and perspectives in Gaussian process modelling. In a first numerical experiment, it is shown that dedicated kernels can be used to infer an axis of symmetry. Our second numerical experiment deals with conditional simulations of a solution to the heat equation, and it is found that adapted kernels notably enable improved predictions of non-linear functionals of the field such as its maximum.

Introduction

Whether for function approximation, classification, or density estimation, probabilistic models relying on random fields have been increasingly used in recent works from various research communities. Finding their applied roots in geostatistics and spatial statistics with optimal linear prediction and Kriging (Matheron, 1963, Stein, 1999), random field models for prediction have become a main stream topic in machine learning (under the Gaussian Process Regression terminology, see, e.g., Rasmussen and Williams, 2006), with a spectrum ranging from metamodeling and adaptive design approaches in science and engineering Welch et al. (1992), Jones (2001), O’Hagan (2006) to theoretical Bayesian statistics in function spaces (see Van der Vaart and Van Zanten, 2008a, Van der Vaart and Van Zanten, 2008b, Van der Vaart and van Zanten, 2011 and references therein).

Often, a Gaussian random field model is assumed for some function $f$ of interest, and so all prior assumptions on $f$ are accounted for by the corresponding mean function $m$ and covariance kernel $k$ . The choice of $m$ and $k$ should thus reflect as much as possible any prior belief the modeller wishes to incorporate in the model. Such prior belief on $f$ may of course include classical regularity properties in the first place (continuity, differentiability, Hölder regularity, etc.), but also more specific properties such as symmetries (Haasdonk and Burkhardt, 2007, Ginsbourger et al., 2012), sparse functional ANOVA decompositions (Duvenaud et al., 2011, Durrande et al., 2012, Ginsbourger et al., 2014), or degeneracy under multivariate differential operators in the case of vector-valued random fields. To take a concrete example, covariance structures characterizing divergence-free and curl-free random vector fields have been recently presented and illustrated in Scheuerer and Schlather (2012). Besides that, the idea of expressing structure with kernels has been explored in Duvenaud (2014), where a number of practical aspects regarding positive-semidefiniteness-preserving operations are addressed.

Here we shall discuss how the two first moments influence mathematical properties of associated realizations (or paths), both in a general second order set-up and in the Gaussian case. A number of well-known random field properties driven by the covariance kernel are in the mean square sense, e.g. $L^{2}$ continuity and differentiability (Cressie, 1993). However, such results generally are not informative about the pathwise behaviour of underlying random fields. On the other hand, much can be said about path regularity properties of random field paths (see, e.g., classical results in Cramér and Leadbetter, 1967, Adler, 1990), based in particular on the behaviour of the covariance kernel in the neighbourhood of the diagonal in the second order case. In the stationary case, it is then sufficient to look at the covariance function in the neighbourhood of the origin (with similar results for the variogram in the intrinsic stationary–but not necessarily second order–case). More recently, Scheuerer (2010) has taken a new look at path regularity of second-order random fields, and drew conclusions about a.s. continuous differentiability in non-Gaussian settings. Also, we refer to Scheuerer (2009) for an enlightening exposition of state-of-the-art results concerning regularity properties of random field sample paths in various frameworks.

Our focus in the present work is on pathwise mathematical properties of second order random fields and statistical applications thereof in the context of Gaussian process modelling. Motivated by several practical situations, we pay a particular attention to random fields $Z = {(Z_{x})}_{x \in D}$ that are supported by the null space of some linear operator $T$ , i.e. for which $T (Z) = 0 (a.s.) .$ As we first develop in general second-order settings, an impressive diversity of path properties including invariances under group actions or sparse ANOVA decompositions of multivariate paths can be encapsulated in the framework of Eq. (1). Furthermore, in the particular case of Gaussian random fields, a more general class of path properties (notably some degeneracy properties involving differential operators) can be covered through the link between operators on the paths and operators on the reproducing kernel Hilbert space (Berlinet and Thomas-Agnan, 2004) associated with the random field, and also through an additional representation of $Z$ in terms of Gaussian measures on Banach spaces.

While Section 2 is dedicated to the exposition of the main results, proofs are presented in the Appendix to ease the reading. Applications in the context of random field modelling, and especially for Gaussian process modelling, are then investigated throughout Section 3. In particular, we tackle zero-integral random processes, random fields with paths invariant under group actions, random fields with additive paths, random fields with harmonic paths, and discuss further potential applications.

In Section 4, we present two original numerical experiments where the notions of degeneracy and invariance appear very useful in Gaussian process modelling under two types of structural prior information. In the first case, the objective function possesses an unknown axis of symmetry, which is inferred by maximum likelihood, relying on a family of argumentwise invariant covariance kernels. In the second case, we obtain an improved interpolation of a solution to the heat equation thanks to a bi-harmonic kernel. The proposed model enables performing harmonic conditional simulations, which has very beneficial consequences in terms of estimation of the maximum. Section 5 is dedicated to conclusions and perspectives. The main results are finally proven in the Appendix.

Section snippets

Main results

Let $(D, D)$ be a measurable space, $(Ω, A, P)$ be a complete probability space, and $Z = {(Z_{x})}_{x \in D}$ be a measurable real-valued stochastic process over $(Ω, A, P)$ . Let us further assume that the paths of $Z$ belong with probability 1 to some function space $F \subset M (D, R)$ , where $M (D, R)$ is the set of $(D, B (R))$ -measurable functions, and consider a linear operator $T : F ⟶ F$ . Here both $Z$ and $T (Z)$ are assumed second order, in the sense that their marginals possess a variance, and we aim at giving necessary and sufficient

On processes with paths integrating to zero

Lemma 1 of Appendix ensures that the almost sure nullity of $ℓ_{ν} (Z)$ can be characterized through the nullity of $ℓ_{ν} (m)$ and $ℓ_{ν} \otimes ℓ_{ν} (k)$ , provided that the function $u \in D ⟶ \sqrt{k (u, u) + m {(u)}^{2}} \in R$ is $ν$ -integrable. In practice, assuming that this integrability condition is fulfilled–as it is notably the case for compact $D$ and continuous $m$ and $k$ –it suffices to check that $ℓ_{ν} (m) = 0$ and $ℓ_{ν} (k (\cdot, x^{'})) = 0$ for arbitrary $x^{'} \in D$ . Taking for instance the settings of Durrande et al. (2013) where $ν$ is a finite (positive) measure and

Estimation of a symmetry axis

We now consider the following test function over $D = {[0, 1]}^{2}$ : $f : (x_{1}, x_{2}) \in D \to cos (\sqrt{2} (x_{1} + x_{2}) + 0.4) + sin {(3 (x_{1} - x_{2}))}^{2} + x_{1} - x_{2} .$ This function is symmetric with respect to the axis defined by the equation $x_{2} = - x_{1} - 0.1 \sqrt{2}$ . Given noisy evaluation results of $f$ at 20 points, and assuming a priori that $f$ is symmetric with respect to some axis $Δ$ (parametrized by its angle with the ordinate axis $λ \in [0, π)$ and its distance to the origin $δ \geq 0$ ), we aim at both reconstructing $f$ and recovering the equation of this axis by using a

Conclusions and perspectives

This article focuses on the control of pathwise invariances of square-integrable random field through their covariance structure. It is illustrated on various examples how a number of features one may wish to impose on paths such as multivariate sparsity, symmetries, or being solution to a vast class of homogeneous ordinary or partial differential equations may be cast as degeneracy or invariance properties under linear operators.

One of the main results of this work, given in Proposition 1,

Acknowledgements

The authors would like to thank Dominic Schuhmacher, Fabrice Gamboa, as well as two anonymous referees for useful comments on previous versions of this paper. They are also very grateful to the editor for constructive remarks having led to significant improvements.

References (37)

N. Durrande et al.
ANOVA kernels and RKHS of zero mean functions for model-based sensitivity analysis
J. Multivariate Anal.
(2013)
A. O’Hagan
Bayesian analysis of computer code outputs: A tutorial
Reliab. Eng. Syst. Saf.
(2006)
M. Scheuerer
Regularity of the sample paths of a general second order random field
Stochastic Process. Appl.
(2010)
V. Tarieladze et al.
Disintegration of Gaussian measures and average-case optimal algorithms
J. Complexity
(2007)
R.J. Adler
R. Adler et al.
Random Fields and Geometry
(2007)
A. Berlinet et al.
Reproducing Kernel Hilbert Spaces in Probability and Statistics
(2004)
H. Cramér et al.
Stationary and Related Stochastic Processes: Sample Function Properties and their Applications
(1967)
N. Cressie
P. Deheuvels
Karhunen-Loève expansions of mean-centered Wiener processes

Deville, Y., Ginsbourger, D., Roustant, O., 2015. kergp: Gaussian Process Laboratory, R package version...

N. Durrande et al.

Additive covariance kernels for high-dimensional Gaussian process modeling

Ann. Fac. Sci. Toulouse

(2012)

D. Duvenaud

Automatic model construction with Gaussian processes

(2014)

D.K. Duvenaud et al.

Additive Gaussian processes

Franco, J., Dupuy, D., Roustant, O., Damblin, G., Iooss, B., 2013. DiceDesign: Designs of Computer Experiments, R...

T.E. Fricker et al.

Multivariate Gaussian process emulators with nonseparable covariance structures

Technometrics

(2013)

I. Gihman et al.

The Theory of Stochastic Processes I

(1974)

D. Ginsbourger et al.

Argumentwise invariant kernels for the approximation of invariant functions

Ann. Fac. Sci. Toulouse

(2012)

Cited by (23)

Covariance models and Gaussian process regression for the wave equation. Application to related inverse problems
2023, Journal of Computational Physics
In this article, we consider the general task of performing Gaussian process regression (GPR) on pointwise observations of solutions of the 3 dimensional homogeneous free space wave equation. In a recent article, we obtained promising covariance expressions tailored to this equation: we now explore the potential applications of these formulas. We first study the particular cases of stationarity and radial symmetry, for which significant simplifications arise. We next show that the true-angle multilateration method for point source localization, as used in GPS systems, is naturally recovered by our GPR formulas in the limit of the small source radius. Additionally, we show that this GPR framework provides a new answer to the ill-posed inverse problem of reconstructing initial conditions for the wave equation from a limited number of sensors, and simultaneously enables the inference of physical parameters from these data. We finish by illustrating this “physics informed” GPR on a number of practical examples.
Inference and uncertainty propagation of GB structure-property models: H diffusivity in [100] tilt GBs in Ni
2021, Acta Materialia
Citation Excerpt :
As we will show below, this approach succeeds in enforcing the desired symmetries, but at the cost of introducing topological artifacts. We propose an alternative approach that circumvents this drawback by employing an unsymmetrized distance function in conjunction with the sum-over-group-orbits strategy [39–41]. Distance metrics for GBs have been an active area of research for some time and there are a variety of candidates in the literature (see e.g. [42–46] and [47] for a recent review of extant distance metrics).
In this work we present a non-parametric Bayesian approach for developing structure-property models for grain boundaries (GBs) with built-in uncertainty quantification (UQ). Using this method we infer a structure-property model for H diffusivity in [100] tilt GBs in Ni at 700 K based on molecular dynamics (MD) data. Once a GB structure-property model is developed, it can be used as an input to mesoscale simulations of the effective properties of polycrystals, microstructure evolution, etc. A significant advantage of the Bayesian approach presented here is that it facilitates propagation of uncertainties from the underlying structure-property model to the output predictions from mesoscale modeling. Leveraging this capability, we perform mesoscale simulations of the effective diffusivity of polycrystals to investigate the interaction between structure-property model uncertainties and GB network structure. We observe a fundamental interaction between crystallographic correlations and spatial correlations in GB networks that causes certain types of microstructures (those with large populations of $J_{2}$ - and $J_{3}$ -type triple junctions) to exhibit intrinsically larger uncertainty in their effective properties. Data and code are provided in supplementary materials.
Constrained Bayesian Optimization Under Partial Observations: Balanced Improvements and Provable Convergence
2023, arXiv
Covariance models and Gaussian process regression for the wave equation. Application to related inverse problems
2023, arXiv
Characterization of the second order random fields subject to linear distributional PDE constraints
2023, Bernoulli
An Introduction to the Calibration of Computer Models
2023, arXiv

View all citing articles on Scopus

View full text

On degeneracy and invariances of random fields paths with applications in Gaussian process modelling

Highlights

Abstract

Introduction

Section snippets

Main results

On processes with paths integrating to zero

Estimation of a symmetry axis

Conclusions and perspectives

Acknowledgements

J. Multivariate Anal.

Reliab. Eng. Syst. Saf.

Stochastic Process. Appl.

J. Complexity

Random Fields and Geometry

Reproducing Kernel Hilbert Spaces in Probability and Statistics

Stationary and Related Stochastic Processes: Sample Function Properties and their Applications

Karhunen-Loève expansions of mean-centered Wiener processes

Additive covariance kernels for high-dimensional Gaussian process modeling

Ann. Fac. Sci. Toulouse

Automatic model construction with Gaussian processes

Additive Gaussian processes

Multivariate Gaussian process emulators with nonseparable covariance structures

Technometrics

The Theory of Stochastic Processes I

Argumentwise invariant kernels for the approximation of invariant functions

Ann. Fac. Sci. Toulouse