Abstract
Since the choice of a particular loss function strongly influences the resulting inference, it seems necessary to rely on “intrinsic” losses when no information is available about the utility function of the decision-maker, rather than to call for classical losses like the squared error loss. Since this setting is quite similar to the derivation of noninformative priors in Bayesian analysis, we first recall the conditions of this derivation and deduce from these conditions some requirements on the intrinsic losses. It then appears that these loss functions should only depend on the sampling distribution and that they should be independent of the parameterization of the distribution. The resulting estimators are therefore transformation equivariant. We study the properties of two natural intrinsic losses, namely entropy and Hellinger losses, and show that they can be expressed in closed form for exponential families. Moreover, the entropy loss also provides analytic expressions of Bayes estimators under conjugate priors; the derivation of Bayes estimators associated with the Hellinger loss is more cumbersome, as shown in Poisson and Gamma cases, while leading to similar estimators.
Similar content being viewed by others
References
Abramowitz, M. and Stegun, I.: 1964, Handbook of Mathematical Functions, Dover.
Berger, J.O.: 1985, Statistical Decision Theory and Bayesian Analysis (2nd edition). Springer-Verlag, New York.
Berger, J.O. and Bernardo, J.M.: 1989, ‘Estimating a product of means Bayesian analysis with reference priors’, J. Amer. Statist. Assoc. 84, 200–207.
Berger, J.O. and Wolpert, R.: 1988, The Likelihood Principle (2nd edition). IMS Lecture Notes, Monograph Series 6, Hay ward, California.
Bernardo, J.M. 1979, ‘Reference posterior distributions for Bayesian inference (with discussion)’, J. Royal Statist. Soc. (Ser. B) 41, 113–147.
Bernardo, J.M. and Smith, A.F.M.: 1994, Bayesian Theory, J. Wiley, New York.
Birgé, L.: 1980, Approximation dans les espaces métriques et théorie de l'estimation, Thèse d'Etat, Univ. de Paris VII
Birgé, L.: 1983, ‘Robust testing for independent non identically distributed variables and Markov chains’, Lecture Notes in Statistics 16, Springer-Verlag, New York.
Brown, L.D.: 1986, Foundations of Exponential Families, IMS Lecture Notes, Monograph Series 9, Hay ward, California.
Brown, L.D.: 1980, ‘Examples of Berger's phenomenon in the estimation of independent normal means’, Ann. Statist. 9, 1289–1300.
DeGroot, M.: 1970, Optimal Statistical Decisions, McGraw-Hill, New York.
Fishburn, P.: 1988, Non-linear Preferences and Utility Theory, Harvester Wheatsheaf, Brighton, Sussex.
Gauβ, C.F.: 1810, Méthode des moindres carrés. Mémoire sur la combination des observations, Trad. J. Bertrand. Mallet-Bachelier, Paris (1985).
Gibbs, J.W.: 1876, Collected Works and Commentary, vol. II, A. Haas (Ed.) Yale University Press, New Haven (1936).
Gouriéroux, C. and Monfort, A.: 1994, ‘Testing non-nested hypotheses’, Handbook of Econometrics, Vol. IV, R.F. Engle and D.L. McFadden (Eds.), Elsevier, Amsterdam.
Gutiérrez-Peña, E.: 1992, ‘Expected logarithmic divergence for exponential families’, in Bayesian Statistics 4, J.O. Bernardo, A.P. Dawid and A.F.M. Smith (eds.). Oxford University Press, London, pp. 669–674.
Huber, P.: 1964, ‘Robust estimation of a location parameter’, Ann. Math. Statist. 35, 73–101.
Jaynes, E.t.: 1989, Papers on Probability, Statistics and Statistical Physics, R.D. Rosenkrantz (Ed.), Kluwer Academic Publishers, Dordrecht.
Jeffreys, H.: 1961, Theory of Probability (3rd edition), Oxford University Press, London.
Keating, J.P., Mason, R.L. and Sen, P.K.: 1993, Pitman Measure of Closeness: Comparison of Statistical Estimators, SIAM, Philadelphia.
Le Cam, L.: 1982, ‘On the risk of Bayes estimates’, in Statistical Decision Theory and Related Topics III, Vol. 2, J.L. Berger and S.S. Gupta (Eds.) Academic Press, New York.
Le Cam, L.: 1986, Asymptotic Methods in Statistical Decision Theory, Springer-Verlag, New York.
Le Cam, L. and Yang, G.L.: 1990, Asymptotics in Statistics, Springer-Verlag, New York.
Lehmann, E. Lehmann, E.L.: 1993, Theory of Point Estimation J. Wiley, New York.
Lehmann, E.L. and Casella, G.: 1995, Theory of Point Estimation (revised edition). Wadsworth, Pacific Grove, California.
Lindley, D.: 1985, Making Decisions, Wiley, New York.
McCulloch, R. and Rossi, P.E.: 1992, ‘Bayes factors for nonlinear hypotheses and likelihood distributions’, Biometrika 79, 663–676.
Olver, F.W.J.: 1974, Asymptotics and Special Functions, Academic-Press, New York.
Pitman, E.: 1937, ‘The closest estimates of statistical parameters’, Proc. Cambridge Phil. Soc. 33, 212–222.
Rao, C.R.: 1981, ‘Some comments on the minimum mean square error as a criterion of estimation’, in Statistics and Related Topics, M. Csörgo, D. Dawson, J.N.K. Rao, and A. Saleh (Eds.), pp. 123–143.
Robert, C.P.: 1994, The Bayesian Choice, Springer-Verlag, New York.
Robert, C.P., Hwang, J.T.G. and Strawderman, W.E.: 1993, ‘Is Pitman nearness a reasonable criterion? (with discussion)’, J. American Statistical Assoc. 88(1), 57–76.
Rubin, H.: 1987, ‘A weak system of axioms for “rational” behavior and the nonseparability of utility from prior’, Statist. Decisions 5, 47–58.
Rukhin, A.: 1978, ‘Universal Bayes estimators’, Ann. Statist. 6, 345–351.
Shannon, C.: 1948, ‘A mathematical theory of communication’, Bell System Tech. J. 27, 379–423 and 623–656.
Stigler, S.: 1986, The History of Statistics Belknap, Harvard.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Robert, C.P. Intrinsic losses. Theor Decis 40, 191–214 (1996). https://doi.org/10.1007/BF00133173
Issue Date:
DOI: https://doi.org/10.1007/BF00133173