Skip to main content
Log in

Generalized linear mixed models: a review and some extensions

Lifetime Data Analysis Aims and scope Submit manuscript

Abstract

Breslow and Clayton (J Am Stat Assoc 88:9–25,1993) was, and still is, a highly influential paper mobilizing the use of generalized linear mixed models in epidemiology and a wide variety of fields. An important aspect is the feasibility in implementation through the ready availability of related software in SAS (SAS Institute, PROC GLIMMIX, SAS Institute Inc., URL http://www.sas.com, 2007), S-plus (Insightful Corporation, S-PLUS 8, Insightful Corporation, Seattle, WA, URL http://www.insightful.com, 2007), and R (R Development Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, URL http://www.R-project.org, 2006) for example, facilitating its broad usage. This paper reviews background to generalized linear mixed models and the inferential techniques which have been developed for them. To provide the reader with a flavor of the utility and wide applicability of this fundamental methodology we consider a few extensions including additive models, models for zero-heavy data, and models accommodating latent clusters.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Abramowitz, M, Stegun, IA (eds) (1984) Handbook of mathematical functions with formulas, graphs, and mathematical tables. A Wiley-Interscience Publication, John Wiley, New York

    MATH  Google Scholar 

  • Ainsworth L, Dean CB (2007) Detection of local and global outliers in mapping studies. Environmetrics. doi:10.1002/env.851

  • Barndorff-Nielsen O, Cox DR (1979) Edgeworth and saddle-point approximations with statistical applications (with discussion). J R Stat Soc Ser B: Methodol 41: 279–299

    MATH  MathSciNet  Google Scholar 

  • Bates D, Sarkar D (2007) lme4: Linear mixed-effects models using S4 classes. R package version 0.99875–1

  • Breslow N (1989) Score tests in overdispersed GLM’s. In: Decarli A, Francis BJ, Gilchrist R, Seeber GUH, (eds) Statistical modelling. Springer-Verlag Inc., pp 64–74

  • Breslow NE, Clayton DG (1993) Approximate inference in generalized linear mixed models. J Am Stat Assoc 88: 9–25

    Article  MATH  Google Scholar 

  • Breslow NE, Lin X (1995) Bias correction in generalised linear mixed models with a single component of dispersion. Biometrika 82: 81–91

    Article  MATH  MathSciNet  Google Scholar 

  • Casella G, George EI (1992) Explaining the Gibbs sampler. Am Stat 46: 167–174

    Article  MathSciNet  Google Scholar 

  • Chen J, Zhang D, Davidian M (2002) A Monte Carlo EM algorithm for generalized linear mixed models with flexible random effects distribution. Biostatistics (Oxford) 3: 347–360

    Article  MATH  Google Scholar 

  • Cheng KF, Wu JW (1994) Testing goodness of fit for a parametric family of link functions. J Am Stat Assoc 89: 657–664

    Article  MATH  MathSciNet  Google Scholar 

  • Cook RD, Weisberg S (1989) Regression diagnostics with dynamic graphics (C/R: P293–311). Technometrics 31: 277–291

    Article  MathSciNet  Google Scholar 

  • Davidian M, Carroll RJ (1987) Variance function estimation. J Am Stat Assoc 82: 1079–1091

    Article  MATH  MathSciNet  Google Scholar 

  • Dean CB, Balshaw R (1997) Efficiency lost by analyzing counts rather than event times in Poisson and overdispersed Poisson regression models. J Am Stat Assoc 92: 1387–1398

    Article  MATH  MathSciNet  Google Scholar 

  • Dean C, Lawless JF, Willmot GE (1989) A mixed Poisson-inverse–Gaussian regression model. Can J Stat 17: 171–181

    Article  MATH  MathSciNet  Google Scholar 

  • Dubin JA, Han L, Fried TR (2007) Triggered sampling could help improve longitudinal studies of persons with elevated mortality risk. J Clin Epidemiol 60: 288–93

    Article  Google Scholar 

  • Green PJ, Silverman BW (1994) Nonparametric regression and generalized linear models: a roughness penalty approach. Chapman & Hall Ltd.

  • Harville DA (1977) Maximum likelihood approaches to variance component estimation and to related problems. J Am Stat Assoc 72: 320–338

    Article  MATH  MathSciNet  Google Scholar 

  • Hastie T, Tibshirani R (1999) Generalized additive models. Chapman & Hall Ltd.

  • Henderson R, Shimakura S (2003) A serially correlated gamma frailty model for longitudinal count data. Biometrika 90: 355–366

    Article  MATH  MathSciNet  Google Scholar 

  • Insightful Corporation (2007) S-PLUS 8. Insightful Corporation, Seattle, WA. URL http://www.insightful.com, accessed on 25 October 2007

  • Jiang J (1998) Consistent estimators in generalized linear mixed models. J Am Stat Assoc 93: 720–729

    Article  MATH  Google Scholar 

  • Jørgensen B (1982) Statistical properties of the generalized inverse Gaussian distribution. Lecture Notes in Statistics, vol 9. Springer-Verlag, New York

  • Kleinman K, Lazarus R, Platt R (2004) A generalized linear mixed models approach for detecting incident clusters of disease in small areas, with an application to biological terrorism. Am J Epidemiol 159: 217–24

    Article  Google Scholar 

  • Laird NM (1991) Topics in likelihood-based methods for longitudinal data analysis. Statistica Sinica 1: 33–50

    MATH  Google Scholar 

  • Laird NM, Louis TA (1982) Approximate posterior distributions for incomplete data problems. J R Stat Soc Ser B: Methodol 44: 190–200

    MATH  MathSciNet  Google Scholar 

  • Lambert D (1992) Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics 34: 1–14

    Article  MATH  Google Scholar 

  • Lawless JF (1987) Negative binomial and mixed P. Can J Stat 15: 209–225

    Article  MATH  MathSciNet  Google Scholar 

  • Lawless JF, Zhan M (1998) Analysis of interval-grouped recurrent-event data using piecewise constant rate functions. Can J Stat 26: 549–565

    Article  MATH  Google Scholar 

  • Liang K-Y, Zeger SL (1986) Longitudinal data analysis using generalized linear models. Biometrika 73: 13–22

    Article  MATH  MathSciNet  Google Scholar 

  • Lin X, Breslow NE (1996) Bias correction in generalized linear mixed models with multiple components of dispersion. J Am Stat Assoc 91: 1007–1016

    Article  MATH  MathSciNet  Google Scholar 

  • Lin X, Zhang D (1999) Inference in generalized additive mixed models by using smoothing splines. J R Stat Soc Ser B: Stat Methodol 61: 381–400

    Article  MATH  MathSciNet  Google Scholar 

  • Lin X, Harlow SD, Raz J, Harlow SD (1997) Linear mixed models with heterogeneous within-cluster variances. Biometrics 53: 910–923

    Article  MATH  MathSciNet  Google Scholar 

  • Lindstrom MJ, Bates DM (1988) Newton-Raphson and EM algorithms for linear mixed-effects models for repeated-measures data. J Am Stat Assoc 83: 1014–1022, Corr: 94V89, p 1572

    Article  MATH  MathSciNet  Google Scholar 

  • Martin TG, Wintle BA, Rhodes JR, Kuhnert PM, Field SA, Low-Choy SJ, Tyre AJ, Possingham HP (2005) Zero tolerance ecology: improving ecological inference by modelling the source of zero observations. Ecol Lett 8: 1235–1246

    Article  Google Scholar 

  • McCullagh P, Nelder JA (1989) Generalized linear models. Chapman & Hall Ltd.

  • McCulloch CE (1997) Maximum likelihood algorithms for generalized linear mixed models. J Am Stat Assoc 92: 162–170

    Article  MATH  MathSciNet  Google Scholar 

  • Nelder JA, Pregibon D (1987) An extended quasi-likelihood function. Biometrika 74: 221–232

    Article  MATH  MathSciNet  Google Scholar 

  • Nielsen JD, Dean CB (2007) Clustered mixed nonhomogeneous Poisson process spline models for the analysis of recurrent event panel data. Biometrics. doi:10.1111/j.1541-0420.2007.00940.x

    Google Scholar 

  • Nodtvedt A, Dohoo I, Sanchez J, Conboy G, DesCjteaux L, Keefe G, Leslie K, Campbell J (2002) The use of negative binomial modelling in a longitudinal study of gastrointestinal parasite burdens in Canadian dairy cows. Can J Vet Res 66: 249–257

    Google Scholar 

  • Pierce DA, Schafer DW (1986) Residuals in generalized linear models. J Am Stat Assoc 81: 977–986

    Article  MATH  MathSciNet  Google Scholar 

  • Pinheiro JC, Bates DM (2000) Mixed-effects models in S and S-PLUS. Springer-Verlag Inc.

  • R Development Core Team (2006) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org, accessed on 25 October 2007

  • Raudenbush SW, Yang M-L, Yosef M (2000) Maximum likelihood for generalized linear models with nested random effects via high-order, multivariate Laplace approximation. J Comput Graph Stat 9: 141–157

    Article  MathSciNet  Google Scholar 

  • Rich-Edwards JW, Kleinman KP, Strong EF, Oken E, Gillman MW (2005) Preterm delivery in Boston before and after September 11th, 2001. Epidemiology 16: 323–327

    Article  Google Scholar 

  • Rosen O, Jiang W, Tanner MA (2000) Mixtures of marginal models. Biometrika 87: 391–404

    Article  MATH  MathSciNet  Google Scholar 

  • Sartori N, Severini TA (2004) Conditional likelihood inference in generalized linear mixed models. Statistica Sinica 14: 349–360

    MATH  MathSciNet  Google Scholar 

  • SAS Institute (2007) PROC GLIMMIX. SAS Institute Inc. URL http://www.sas.com, accessed on 25 October 2007

  • Shun Z, McCullagh P (1995) Laplace approximation of high dimensional integrals. J R Stat Soc Ser B: Methodol 57: 749–760

    MATH  MathSciNet  Google Scholar 

  • Sichel HS (1974) On a distribution representing sentence-length in written prose. J R Stat Soc Ser A 137: 25–34

    Article  Google Scholar 

  • Simons JS, Neal DJ, Gaher RM (2006) Risk for marijuana-related problems among college students: an application of zero-inflated negative binomial regression. Am J Drug Alcohol Abuse 32: 41–53

    Article  Google Scholar 

  • Song PX-K, Fan Y, Kalbfleisch JD (2005) Maximization by parts in likelihood inference. J Am Stat Assoc 100: 1145–1158

    Article  MATH  MathSciNet  Google Scholar 

  • Stangle DE, Smith DR, Beaudin SA, Strawderman MS, Levitsky DA, Strupp BJ (2007) Succimer chelation improves learning, attention, and arousal regulation in lead-exposed rats but produces lasting cognitive impairment in the absence of lead exposure. Environ Health Perspect 115: 201–209

    Article  Google Scholar 

  • Tchetgen EJ, Coull BA (2006) A diagnostic test for the mixing distribution in a generalised linear mixed model. Biometrika 93: 1003–1010

    Article  MathSciNet  Google Scholar 

  • Tierney L, Kass RE, Kadane JB (1989) Approximate marginal densities of nonlinear functions. Biometrika 76: 425–433, Corr: V78, p233–234

    Article  MATH  MathSciNet  Google Scholar 

  • Tjur T (1982) A connection between Rasch’s item analysis model and a multiplicative Poisson model. Scand J Stat 9: 23–30

    MathSciNet  MATH  Google Scholar 

  • Venables WN, Ripley BD (2002) Modern applied statistics with S, 4th edn. Springer, New York. URL http://www.stats.ox.ac.uk/pub/MASS4, accessed on 25 October 2007

  • Vonesh EF, Wang H, Nie L, Majumdar D (2002) Conditional second-order generalized estimating equations for generalized linear and nonlinear mixed-effects models. J Am Stat Assoc 97: 271–283

    Article  MATH  MathSciNet  Google Scholar 

  • Waagepetersen R (2006) A simulation-based goodness-of-fit test for random effects in generalized linear mixed models. Scand J Stat 33: 721–731

    Article  MATH  MathSciNet  Google Scholar 

  • Wedderburn RWM (1974) Quasi-likelihood functions, generalized linear models, and the Gauss–Newton method. Biometrika 61: 439–447

    MATH  MathSciNet  Google Scholar 

  • White H (1982) Maximum likelihood estimation of misspecified models. Econometrica 50: 1–26

    Article  MATH  MathSciNet  Google Scholar 

  • Wood S (2006) mgcv: GAMs with GCV smoothness estimation and GAMMs by REML/PQL. R package version 1.3–24

  • Zeger SL, Karim MR (1991) Generalized linear models with random effects: a Gibbs sampling approach. J Am Stat Assoc 86: 79–86

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to C. B. Dean.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dean, C.B., Nielsen, J.D. Generalized linear mixed models: a review and some extensions. Lifetime Data Anal 13, 497–512 (2007). https://doi.org/10.1007/s10985-007-9065-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10985-007-9065-x

Keywords

Navigation