A comparative study of several smoothing methods in density estimation

https://doi.org/10.1016/0167-9473(92)00066-ZGet rights and content

Abstract

The theory of bandwidth choice in density estimation is developing very fast. Several methods (with plenty of varieties and subvarieties) have been recently proposed as an alternative to least squares cross-validation, the standard for years. This paper includes (a) A critical up-to-date review of the main methods currently available. The discussion provide some new insights on the important problem of estimating the minimization criteria and on the choice of pilot bandwidths in bootstrap-based methods. (b) An extensive simulation study of ten selected bandwidths. (c) A final discussion with some recommendations for practitioners. The conclusions are not easily summarized in a few words, because different cases have to be considered and important nuances must be pointed out. However, we could mention that the classical cross-validation bandwidths show, generally speaking, a relatively poor behavior (this is especially clear for the pseudo-likelihood method). On the other hand, although no selector appears to be uniformly better, the plug-in (in a similar version to that proposed by Sheather and Jones, J. Royal Statist. Soc. Ser. B 5 1991) and the (smoothed) bootstrap-based selectors show a fairly satisfactory performance which suggests that they could be the new standard methods for the problem of smoothing in density estimation. Interesting results are also obtained for a new type of bandwidths based on the number of inflection points.

References (54)

  • M. Broniatowski et al.

    On the relationship between stability of extreme order statistics and convergence of the maximum likelihood kernel density estimate

    Ann. Statist.

    (1989)
  • R. Cao-Abad

    Applicaciones y nuevos resultados del método bootstrap en la estimación no paramétrica de curvas

  • S.T. Chiu

    Bandwidth selection for kernel density estimation

    Ann. Statist.

    (1991)
  • Y.S. Chow et al.

    Consistent cross-validated density estimation

    Ann. Statist.

    (1983)
  • A. Cuevas et al.

    Data-driven smoothing based on convexity properties

  • L. Devroye

    A Course in Density Estimation

    (1987)
  • L. Devroye

    The double kernel method in density estimation

    Ann. Inst. Henri Poincaré

    (1989)
  • L. Devroye et al.

    Nonparametric Density Estimation: The L1-View

    (1985)
  • J.J. Faraway et al.

    Bootstrap choice of bandwidth for density estimation

    J. Amer. Statist. Assoc.

    (1990)
  • J.D.F. Habbema et al.

    A stepwise discrimation analysis program using density estimation

  • P. Hall

    Large-sample optimality of least-squares cross-validation in density estimation

    Ann. Statist.

    (1983)
  • P. Hall

    On Kullback—Leibler loss and density estimation

    Ann. Statist.

    (1987)
  • P. Hall et al.

    Empirical functionals and efficient smoothing parameter selection

    J.R. Statist. Soc. B

    (1992)
  • P. Hall et al.

    On the amount of noise inherent in bandwidth selection for a kernel density estimator

    Ann. Statist.

    (1987)
  • P. Hall et al.

    Extent to which least-squares cross-validation minimise integrated square error in nonparametric density estimation

    Probab. Th. Rel. Fields

    (1987)
  • P. Hall et al.

    Lower bounds for bandwidth selection in density estimation

    Probab. Th. Rel. Fields

    (1991)
  • P. Hall et al.

    Local minima in cross-validation functions

    J.R. Statist. Soc. B

    (1991)
  • Cited by (0)

    View full text