A limit result for the prior predictive applied to checking for prior-data conflict

doi:10.1016/j.spl.2011.02.025

Statistics & Probability Letters

Volume 81, Issue 8, August 2011, Pages 1034-1038

https://doi.org/10.1016/j.spl.2011.02.025 Get rights and content

Abstract

We consider checking for prior-data conflict in a Bayesian analysis via a tail probability based on the prior predictive distribution. We establish the appropriateness of this measure in the sense that the limiting value of the tail probability measures the extent to which the true value of the parameter is a surprising value from the prior.

Introduction

The relevance of the results of a statistical analysis depends upon the inputs chosen by the analyst. If the inputs are deemed not to be appropriate, then one has reason to doubt any conclusions drawn. For a Bayesian statistical analysis, these inputs comprise the sampling model ${P_{θ} : θ \in Θ}$ for the data considered here as a collection of probability measures, one of which is supposed to have generated the observed data, the prior $Π$ on the model parameter and perhaps a loss function. One way to assess the relevance of these inputs is to see whether or not these make sense in light of the data collected. From this point of view, we have a model failure when the data observed is surprising for every probability distribution in the model. In this paper, we are concerned with assessing whether or not there is a prior-data conflict.

Intuitively, prior-data conflict arises when the likelihood is relatively high, where the prior is relatively low. While this seems easy to assess via a graph when dealing with a one-dimensional parameter, more formal methods seem necessary in general. Various approaches have been proposed for assessing prior-data conflict; see for example, Young and Pettit (1996), Evans and Moshonov (2006), and Marshall and Spiegelhalter (2007). Some Bayesian model checking methods, such as those discussed in Box (1980) and Gelman et al. (1996) could also be considered as assessments of the prior although this is confounded with checking the model. Separating out the assessment of the prior from the model gives greater information concerning which of these choices might be in conflict with the data. Box (1980) proposed the tail probability $M (m (X) \leq m (x))$ , where $x$ is the observed data, and $m (x) = \int_{Θ} f_{θ} (x) Π (d θ)$ is the density of the data associated with the prior predictive measure $M$ on the sample space $X$ , where $P_{θ}$ has density $f_{θ}$ . We show in Example 1 that this is not appropriate for checking the prior.

In this paper, we prove a consistency result for the check for prior-data conflict discussed in Evans and Moshonov, 2006, Evans and Moshonov, 2007. Suppose that $T : X \to T$ is a minimal sufficient statistic for ${P_{θ} : θ \in Θ}$ with density $f_{θ T}$ on $T$ . The tail probability $M_{T} (m_{T} (T) \leq m_{T} (T (x))),$ was proposed for checking for prior-data conflict where $M_{T}$ is the prior predictive distribution of $T$ . The following example motivates why (1) is suitable for checking for prior-data conflict.

Example 1 Location Normal

Suppose that $x = (x_{1}, \dots, x_{n})$ is a sample from a $N (μ, 1)$ distribution where $μ \in R^{1}$ is unknown. Then a minimal sufficient statistic is given by $T_{n} (x) = \bar{x}$ and $T_{n} (x)$ converges almost surely to the true value $μ_{true}$ as $n \to \infty$ . Suppose we put a $N (μ_{0}, σ_{0}^{2})$ prior on $μ$ . Then $M_{T_{n}}$ is the $N (μ_{0}, σ_{0}^{2} + 1 / n)$ distribution and this converges in distribution to the $N (μ_{0}, σ_{0}^{2})$ distribution. Also $m_{T_{n}} (t)$ converges almost surely to the prior density ${(2 π)}^{- 1 / 2} σ_{0}^{- 1} exp {- {(t - μ_{0})}^{2} / 2 σ_{0}^{2}}$ uniformly for $t$ in a compact set. A simple computation then shows that (1) converges almost surely to $2 (1 - Φ (| μ_{true} - μ_{0} | / σ_{0}))$ which assesses how far out in the tails of the prior $μ_{true}$ lies.

Now consider the Box (1980) tail probability for this problem. We have that $X \sim N_{n} (0, I_{n} + τ 1_{n} 1_{n}^{'})$ where $I_{n}$ is the $n \times n$ identity matrix, $1_{n}$ is a vector of $n$ ones, and $M (m (X) \leq m (x)) = 1 - G_{n} (x^{'} (I_{n} - \frac{τ}{1 + n τ} 1_{n} 1_{n}^{'}) x)$ where $G_{n}$ is the chi-squared $(n)$ distribution function. The quadratic form can be decomposed as $x^{'} (I_{n} - \frac{τ}{1 + n τ} 1_{n} 1_{n}^{'}) x = \sum_{i = 1}^{n} {(x_{i} - {\bar{x}}_{n})}^{2} + \frac{n}{1 + n τ} {\bar{x}}_{n}^{2} = V_{n} + W_{n},$ where, conditionally given $θ, V_{n} \sim χ^{2} (n - 1), W_{n} = O_{p} (1)$ and $V_{n}$ and $W_{n}$ are independent. Now $(χ^{2} (n) - n) / \sqrt{2 n} \overset{d}{\to} N (0, 1)$ and $G_{n} (n + x \sqrt{2 n}) - Φ (x) = O (n^{- 1 / 2})$ uniformly in $x$ , by Theorem XVI.4.1 in Feller (1971). Hence $M (m (X) \leq m (x)) = 1 - Φ ((V_{n} - (n - 1)) / \sqrt{2 n}) + O_{p} (n^{- 1 / 2})$ where we have used the uniform continuity of $Φ$ . Since $(V_{n} - (n - 1)) / \sqrt{2 n} = (V_{n} - (n - 1)) / \sqrt{2 (n - 1)} \times \sqrt{1 - 1 / n} \overset{d}{\to} N (0, 1)$ we have that $M (m (X) \leq m (x)) \overset{d}{\to} Uniform (0, 1)$ . This limit is independent of the prior and whether or not $μ_{true}$ is in the tails of the prior. Therefore, this tail probability is not useful for checking for prior-data conflict.

While the potential ill effects of a prior-data conflict have long been recognized, it is not clear what one should do when we conclude that a conflict exists. One can note, however, that we have learned something of relevance and it seems only fair that an analyst report this. Also, we note that the situation is similar with model checking as it is not clear what we should do when we have a failure and this does excuse us from these checks. The typical response to model failure is that we must modify the model in some way, perhaps by enlarging the family of distributions. Similarly, when a prior-data conflict exists, our response can be to use a new prior that is less informative in the sense that we can expect fewer prior-data conflicts. We discuss this in Section 4.

A criticism of (1) is that, in the case of continuous models, (1) is not invariant under smooth transformations. For suppose that $W : T \to W$ is 1-1 and smooth and let $J_{W} (t)$ be the reciprocal of the Jacobian determinant of $W$ evaluated at $t$ . Then $W$ is also minimal sufficient and (1) applied to $W$ gives the tail probability $M_{T} (m_{T} (T) J_{W} (T) \leq m_{T} (T (x)) J_{W} (T (x)))$ which is generally different than (1). This issue is avoided if we use the approach discussed in Evans and Jang (2010a) to get the invariant tail probability $M_{T} (m_{T}^{*} (T) \leq m_{T}^{*} (T (x)))$ where $m_{T}^{*} (t) = m_{T} (t) E (J_{T}^{- 1} (X) ∣ T (X) = t), J_{T} (x) = {| det (d T (x) \circ d T^{'} (x)) |}^{- 1 / 2}$ and $d T$ is the differential of $T$ . The factor $E (J_{T}^{- 1} (X) ∣ T (X) = t)$ corrects for volume distortions due to the transformation $T$ . Note that whenever $T$ is linear, then $E (J_{T}^{- 1} (X) ∣ T (X) = t)$ is constant and the invariant tail probability is the same as (1). This is the case for all but one of our examples. We state a relevant convergence result for this tail probability in Section 2.

In Section 2, we provide theorems, with proofs in the Appendix, for the convergence of (1) to the tail probability $Π (π (θ) \leq π (θ_{true}))$ where $θ_{true}$ is the true value of the parameter, i.e., (1) is a consistent assessment of whether or not the true value of the parameter is in the tails of the prior. In Section 3, we provide some applications. In Section 4, we discuss what one can do when a prior-data conflict is encountered.

Section snippets

Consistency of the check

We consider the behavior of (1) as the amount of data grows. We have the following generalization of Example 1.

Theorem 1

Suppose $Θ \subset R^{k}$ is open and (i) $T_{n} \to θ$ a.s. $P_{θ}$ for every $θ$ , (ii) $m_{T_{n}} (t) \to π (t)$ uniformly on compact subsets of $Θ$ , (iii) $π$ is continuous and the prior distribution of $π (θ)$ has no atoms, then $M_{T_{n}} (m_{T_{n}} (T_{n}) \leq m_{T_{n}} (T_{n} (x_{n}))) \to Π (π (θ) \leq π (θ_{true}))$ a.s. $P_{θ_{true}}$ .

Note that our discussion here is restricted to situations where the minimal sufficient statistic is a consistent estimator of the true value which is

Examples

For these examples the details associated with establishing Theorem 1(ii) are similar to the proof of Theorem 2 and can be found in Evans and Jang (2010b).

Example 2 Scale-Gamma

Let $x = (x_{1}, \dots, x_{n})$ be a sample from a Gamma $(α_{0}, θ)$ distribution where the scale parameter $θ > 0$ is unknown. Then $T_{n} (x) = {(n α_{0})}^{- 1} \sum_{i = 1}^{n} x_{i} \sim Gamma (n α_{0}, θ / (n α_{0}))$ is minimal sufficient and $T_{n} (x) \overset{a.s.}{\to} θ_{true}$ . When $π$ satisfies Theorem 1(iii), then Theorem 1(ii) holds and Theorem 1 applies.

The following example uses Example 2 in a problem of considerable

Resolving a prior-data conflict

There are several possible courses of action when we find that a given prior is in conflict with the data. First we note that, as we increase the amount of data it is typical that the effect of the prior disappears. So even though a prior-data conflict may exist, it may be that we can ignore it as the prior has little effect on the analysis. Diagnostics for assessing this are discussed in Evans and Moshonov (2006) and these involve comparing posterior inferences under the prior with those under

Acknowledgements

The authors thank the Editor and referees for some helpful comments.

References (13)

J. Berger et al.
On the development of the reference prior method
J.O. Berger et al.
The formal definition of reference priors
Ann. Statist.
(2009)
G.E.P. Box
Sampling and Bayes’ inference in scientific modelling and robustness
J. R. Stat. Soc. Ser. A
(1980)
Evans, M., Jang, G.H., 2009. Weak informativity and the information in one prior relative to another. Tech. Rep. No....
M. Evans et al.
Invariant $P$ -values for model checking
Ann. Statist.
(2010)
Evans, M., Jang, G.H., 2010b. A limit result for the prior predictive. Tech. Rep. No. 1004. Department of Statistics,...

There are more references available in the full text version of this article.

Cited by (32)

Maximum entropy derived and generalized under idempotent probability to address Bayes-frequentist uncertainty and model revision uncertainty: An information-theoretic semantics for possibility theory
2023, Fuzzy Sets and Systems
Typical statistical methods of data analysis only handle determinate uncertainty, the type of uncertainty that can be modeled under the Bayesian or confidence theories of inference. An example of indeterminate uncertainty is uncertainty about whether the Bayesian theory or the frequentist theory is better suited to the problem at hand. Another example is uncertainty about how to modify a Bayesian model upon learning that its prior is inadequate. Both problems of indeterminate uncertainty have solutions under the proposed framework. The framework is based on an information-theoretic definition of an incoherence function to be minimized. It generalizes the principle of choosing an estimate that minimizes the reverse relative entropy between it and a previous posterior distribution such as a confidence distribution. The simplest form of the incoherence function, called the incoherence distribution, is a min-plus probability distribution, which is equivalent to a possibility distribution rather than a measure-theoretic probability distribution. A simple case of minimizing the incoherence leads to a generalization of minimizing relative entropy and thus of maximizing entropy. The framework of minimum incoherence is applied to problems of Bayesian-confidence uncertainty and to parallel problems of indeterminate uncertainty about model revision.
Measuring statistical evidence using relative belief
2016, Computational and Structural Biotechnology Journal
Citation Excerpt :
Such a check is carried out by computing a tail probability based on the prior predictive distribution of a minimal sufficient statistic (see Evans and Moshonov [20,21]). In Evans and Jang [16] it is proved that this tail probability is consistent in the sense that, as the amount of data grows, it converges to a probability that measures how far into the tails of the prior the true value of θ lies. Here “lying in the tails” is interpreted as indicating that a prior-data conflict exists since the data is not coming from a distribution where the prior assigns most of the belief.
A fundamental concern of a theory of statistical inference is how one should measure statistical evidence. Certainly the words “statistical evidence,” or perhaps just “evidence,” are much used in statistical contexts. It is fair to say, however, that the precise characterization of this concept is somewhat elusive. Our goal here is to provide a definition of how to measure statistical evidence for any particular statistical problem. Since evidence is what causes beliefs to change, it is proposed to measure evidence by the amount beliefs change from a priori to a posteriori. As such, our definition involves prior beliefs and this raises issues of subjectivity versus objectivity in statistical analyses. This is dealt with through a principle requiring the falsifiability of any ingredients to a statistical analysis. These concerns lead to checking for prior-data conflict and measuring the a priori bias in a prior.
On some problems of Bayesian region construction with guaranteed coverages
2024, Statistical Papers
How to Measure Statistical Evidence and Its Strength: Bayes Factors or Relative Belief Ratios?
2023, arXiv
Avoiding prior–data conflict in regression models via mixture priors
2022, Canadian Journal of Statistics
Bayesian statistics and modelling
2021, Nature Reviews Methods Primers

View all citing articles on Scopus

View full text

A limit result for the prior predictive applied to checking for prior-data conflict

Abstract

Introduction

Section snippets

Consistency of the check

Examples

Resolving a prior-data conflict

Acknowledgements

On the development of the reference prior method

The formal definition of reference priors

Ann. Statist.

Sampling and Bayes’ inference in scientific modelling and robustness

J. R. Stat. Soc. Ser. A

Invariant P-values for model checking

Ann. Statist.

Invariant $P$ -values for model checking