Elsevier

Journal of Corporate Finance

Volume 19, February 2013, Pages 1-19
Journal of Corporate Finance

Estimating dynamic panel models in corporate finance

https://doi.org/10.1016/j.jcorpfin.2012.09.004Get rights and content

Abstract

Dynamic panel models play a natural role in several important areas of corporate finance, but the combination of fixed effects and lagged dependent variables introduces serious econometric bias. Several methods of counteracting these biases are available and these methodologies have been tested on small datasets with independent, normally-distributed explanatory variables. However, no one has evaluated the methods' performance with corporate finance data, in which the dependent variable may be clustered or censored and independent variables may be missing, correlated with one another, or endogenous. We find that the data's properties substantially affect the estimators' performances. We provide evidence about the impact of various data set characteristics on the estimators, so that researchers can determine the best approach for their datasets.

Highlights

► We examine how to best estimate dynamic panel models in corporate finance. ► We show how different estimators dramatically affect coefficient estimates. ► We discuss how real-world corporate data issues affect estimation.

Introduction

Dynamic panel models play an increasingly prominent role in corporate finance research. Empirically understanding payout policy, capital structure, or investment decisions arguably requires the use of firm fixed effects to control for unobserved, time-invariant differences across firms.1 Yet uncorrected coefficient estimates for a dynamic panel model can be severely biased. Following on the observations of Nerlove (1967), Nickell (1981) established that OLS estimates of the lagged dependent variable's coefficient in a dynamic panel model are biased due to the correlation between the fixed effects and the lagged dependent variable (see also Baltagi, 2008). The bias is inversely related to panel length (“T”), but potentially severe biases remain even with T = 30 (Judson and Owen (1999)). Compustat firms have a mean (median) of 15 (11) years of annual data, well short of the number of observations required to make the bias negligible. Even when the researcher's primary concern lies elsewhere, a biased coefficient on the lagged dependent variable renders the other coefficient estimates suspect. As such, the short panel bias is a significant concern, and questions requiring dynamic panel models constitute some of the most contentious and unresolved areas of financial research.

The potential importance of choosing an appropriate estimation method for a dynamic panel model can be illustrated by recent efforts to estimate dynamic panel models of corporate leverage. Welch (2004) concludes that firms do not adjust toward target leverage; Fama and French (2002) estimate that firms adjust between 7 and 18% each year; Lemmon et al., (2008) estimate about 25% annually; Huang and Ritter (2009) estimate 17–23%; Flannery and Rangan (2006) estimate an adjustment speed above 30%. The econometric uncertainties associated with dynamic panel data have made it difficult to achieve consensus on the importance of adjustment behavior and of the factors affecting target leverage ratios. Similar problems exist in other areas of corporate finance research such as corporate governance (Wintoki et al., (2012)), cash management (Dittmar and Duchin (2010)), and investment financing (Bond et al., 2003), as well as growth and banking research.

Of course, econometric techniques have evolved to correct these biases, including instrumental variables (IV), generalized method of moments (GMM) estimators, long differencing (LD), and bias correction formulae. These methods have been tested on small datasets, most of which have at most one, normally distributed independent variable. Yet corporate finance studies include multiple independent variables, of which many exhibit endogeneity and serial correlation. Therefore, an estimator's performance in simple Monte Carlo simulations may not apply to more complex empirical analysis. We examine the statistical properties of seven alternative methods for estimating dynamic panel models: OLS, standard fixed effects (FE) estimation, Arellano and Bond's (1991) difference GMM, Blundell and Bond (1998) system GMM, two variations of long differencing (Hahn et al., 2007, Huang and Ritter, 2009), and corrected least-squares (Kiviet, 1995, Bruno, 2005). By simulating data that resembles “real” corporate finance data, we evaluate the performance of these estimators under conditions that are likely to apply to corporate finance research topics. Our goal is to provide sufficient analysis that corporate empiricists can identify the estimation technique most appropriate to their data.

The paper is organized as follows. Section 2 illustrates the importance of dynamic panel estimation biases in the context of firms' capital structure choices. Section 3 explains the econometric issues and describes existing methods for addressing them. Like Petersen (2009), we utilize Monte Carlo simulations to assess the performance of various estimators in different situations. Section 4 describes how we simulate datasets. In addition to a straightforward set of independent and identically distributed (iid) simulated datasets, we also simulate datasets using a variance–covariance structure generated from actual Compustat variables. Section 5 presents some initial results. First, we confirm that most of the proposed estimation methods yield reasonably accurate coefficient estimates when data and regression residuals are generated from iid error distributions. When the simulated explanatory variables mimic Compustat data, the estimated coefficient on the lagged dependent variable remains reliable, but estimation errors for some of the other explanatory variables' coefficients increase dramatically. In Section 6, we evaluate each estimator in the presence of common corporate data features, such as missing observations, unbalanced panel lengths, and dependent variable censoring. One estimation method (Kiviet's (1995) corrected least squares dependent variable, or LSDVC) emerges as the most accurate methodology across all these dataset conditions and Blundell and Bond's (1998) system GMM estimator is often the second best choice.

Unfortunately, both have limitations. Not only does LSDVC's computer memory requirement make it difficult to apply in large datasets, it assumes exogenous regressors. Blundell Bond GMM (like Arellano Bond GMM) assumes an absence of second order serial correlation. In Section 7, we explore how violating these assumptions affects the performance of all seven estimators. Both endogenous regressors and second-order serial correlation seriously compromise many of the estimation methodologies, consistent with the theoretical literature. Perhaps surprisingly, these complications can be large enough that there are occasions when the much maligned fixed effects estimator performs best. Section 8 concludes by offering guidance about the best way to approach dynamic panel estimation in a corporate finance context.

Section snippets

An example of short panel bias

This section uses a partial adjustment model of capital structure to demonstrate the severity of the short panel bias and to illustrate the need for appropriate econometric procedures. Fischer et al. (1989) argue that adjustment costs prevent firms from adjusting completely to their optimal leverage each period. An appropriate regression specification therefore must include a lagged dependent variable to control for the prior period's capital structure. At the same time, the available data do

The econometrics of dynamic panel models

We can generalize Eq. (2) to write a dynamic panel model asYit=γYit1+βXit+Fi+εit.

Estimating Eq. (3) via OLS yields biased and inconsistent results because OLS omits the fixed effect, Fi. The least squares dummy variable (or “fixed effect”, FE) estimate controls for the unobserved (time-invariant) heterogeneity, but it also yields biased coefficient estimates. Since Yit is a function of the fixed effect, the lagged dependent variable is correlated with the error term (Baltagi (2008)). The

Constructing simulated data

The true model underlying our generated data takes the general form:yit=γyit1+j=1Jβjxjit+ηi+εitwhere γ and βj are the primary coefficients to be estimated, xjit are explanatory variables, ηi is a firm fixed effect, and εit is the residual. Unlike most prior Monte Carlo studies, we include multiple (j > 1) explanatory variables and one of our main concerns is the impact of the explanatory variables' characteristics on estimators for Eq. (5). The explanatory variables evolve according toxijt=ρjxij

Initial results

This section compares the seven estimation methods across the iid and Compustat-based simulated datasets.

Further investigations of corporate finance dataset properties

Corporate finance datasets commonly include at least some of the following features:

  • Unbalanced panels

  • Missing observations

  • Censored or clustered data

Evaluating the effects of these data features on alternative estimators should help researchers determine which estimators to use, given what they know about their dataset properties. Given the larger RMSEs associated with Compustat-consistent data, we pay no further attention to iid datasets. We also limit these tests to panels of moderate length (T =

Correlation in the errors

Based on the simulations thus far, LSDVC and Blundell Bond appear to be the most reliable estimators. However, each methodology has a known shortcoming. LSDVC assumes the regressors are exogenous (and independent of the error term) and BB instrumental variables are invalidated by the presence of second order serial correlation. We next introduce both issues into our simulations and compare the performance of all seven estimators.

Conclusion: empirical implications for researchers

This paper examines the performance of seven econometric methodologies for estimating dynamic panel models in “realistic” corporate finance datasets. We consider two methods whose results are thought to be generally biased (OLS and fixed effects (FE)), plus five “advanced” estimation methods (Arellano Bond (AB), Blundell Bond (BB), longest differencing (LD), four period differencing (LD4), and least squares dummy variable correction (LSDVC)). Although each of the advanced methodologies has been

References (43)

  • R.A. Judson et al.

    Estimating dynamic panel data models: a guide for macroeconomists

    Econ. Lett.

    (1999)
  • J.F. Kiviet

    On bias, inconsistency, and efficiency of various estimators in dynamic panel data models

    J. Econometrics

    (1995)
  • L.E. Papke et al.

    Panel data methods for fractional response variables with an application to test pass rates

    J. Econometrics

    (2008)
  • M.B. Wintoki et al.

    Endogeneity and the dynamics of internal corporate governance

    J. Finan. Econ.

    (2012)
  • M. Arellano et al.

    Some tests of specification for panel data: Monte Carlo evidence and an application to employment equations

    Rev. Econ. Stud.

    (1991)
  • B.H. Baltagi

    Econometric Analysis of Panel Data

    (2008)
  • S. Bond

    Dynamic panel data models: a guide to micro data methods and practice, practice

    Port. Econ. J.

    (2002)
  • S. Bond et al.

    Dynamic investment models and the firm's financial policy

    Rev. Econ. Stud.

    (1994)
  • S. Bond et al.

    Financial factors and investment in Belgium, France, Germany, and the United Kingdom: a comparison using company panel data

    Rev. Econ. Stat.

    (2003)
  • J. Bound et al.

    Problems with instrumental variables estimation when the correlation between the instruments and the endogenous explanatory variable is weak

    J. Am. Stat. Assoc.

    (1995)
  • X. Chang et al.

    Target behavior and financing: how conclusive is the evidence?

    J. Financ.

    (2009)
  • Cited by (509)

    View all citing articles on Scopus
    View full text