Skip to main content
Log in

Specifying the Relationship Between Crime and Prisons

  • Original Paper
  • Published:
Journal of Quantitative Criminology Aims and scope Submit manuscript

Abstract

There is no scholarly consensus as to the proper functional form of the crime equation, particularly with regard to one critical, explanatory variable—prison population. The critical questions are whether crime and prison rates must be differenced, whether they are cointegrated, and whether they are simultaneously determined—whether crime and prison cause one another. To determine the proper specification, different researchers have applied unit-root, cointegration, and Granger tests to very similar data sets and obtained very different results. This has led to very different specifications and predictably different implications for public policy. These differences are more likely to be due to the methods used, rather than to real differences among the data sets. When the best available methods are used to identify the proper specification for a panel of U.S. states, results are fairly clear. Crime rates and prison populations are close to unit-root; crime and prison are not cointegrated; crime clearly affects subsequent prison populations. Thus the best specification of the crime equation must rely on differenced data and instrumental variables. Alternative specifications run a substantial risk of spurious results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. Several studies reported results of several specifications. Table 1 reports results of that specification that produced the highest adjusted R 2 or lowest Bayesian information criterion. For the studies that did not use a log-log specification (e.g., Smith 2004), Table 1 shows the elasticity at average values of violent crime and prison. The figure reported for Marvell and Moody (2001) is the elasticity of homicide rate with regard to prison population; the figure reported for Liedka et al. (2006) is the elasticity of index crime rate with regard to prison population.

  2. Three parts are missing. Reliable crime data were not available from New York until 1965; Alaska did not report prison statistics to the National Prisoner Statistics program until 1971; in 2001, the U.S. Bureau of Prisons took over the District of Columbia prison system, merging its prisoners with other federal prisoners. Prison data were missing from four other states (Arkansas and Rhode Island, 1968–1970; Delaware and North Carolina, 1968); but these gaps were filled with the geometric mean of the prison population. Rates were then calculated in the usual way. Thus 2,280 cases are available for prison, and 2,290 cases for violent and property crime.

  3. An argument often used for restricting the crime series to 1971 and later is that the FBI’s definition of larceny changed to include thefts for which the dollar value was $50 or less. This increased the larceny rate considerably in some states. On the other hand, the point at which this change took place is known, and the increase can be dealt with effectively by including a dummy variable for the year of the change in all calculations. A 30% increase in the length of the series is well worth the additional trouble. In addition, a longer time-series is useful for two technical reasons. First, the number of states, N, is approximately the same as the number of years, T. Thus this data set better matches the assumptions of several statistical tests, which rely on asymptotic results based on the assumption that N and T both approach infinity at about the same rate. Second, particularly when N is small, estimates of serial correlation are biased toward zero (Kiviet 1995; Nickell 1981). The bias is caused by the fact that each y t−1 is dependent on a weighted average of previous values of the error term, which in a small sample will not generally sum to zero (Hurwicz 1950; Maeshiro 2000). Monte Carlo simulations (Judson and Owen 1999) show that the biases are relatively small for T > 30 (the law of large numbers takes care of the problem); the biases are also small when the coefficients of the lagged dependent variables are less than 0.2 (the weights on which y t−1 depends are themselves close to zero). Thus a longer time-series will both improve precision and reduce bias. The primary argument against a long time series is that the data-generating process (for example, the extent of serial correlation or the existence of cointegration) is more likely to shift over a longer time period. The simplest way to account for such changes over time is to model them explicitly.

  4. If the underlying statistic is close to normal-distributed, then (N − 1)(S/σ)2 will be χ2-distributed with N −1 degrees of freedom (Snedecor and Cochrane 1980), where N is the number of cross-sectional units (here, states), S is the standard deviation of test statistics across states, and σ is the expected standard deviation if the null hypothesis were correct (and thus all states were identical).

  5. For each variable y, the estimate shown is r(y t , y t−1). Consistent with prevailing practice, all variables are logged here, and in all ensuing analyses, to stabilize the variance over time and across states. Here and throughout the analysis, very similar results are obtained for unlogged variables.

  6. Even statistically significant differences are not policy relevant for our purposes. For example, the range of coefficients on the lagged dependent variable for property is .93 to .97—hardly a substantively important difference. Note, incidentally, that each of the 11-year averages is less than the average serial correlation over the entire time series, as one would expect if short time series provide biased results.

  7. A third approach is more direct, and draws on the emerging literature on long memory processes and fractional integration. Some series do not appear to be stationary in levels, but do not appear to be unit root, either. In these cases, it may be appropriate to “fractionally difference” the series, such that α (X t X t−1)δ = u t , where δ can be any real number between 0 and 1. If δ = 0, no differencing is required; if 0 < δ < .5, the series exhibits “long memory” (the effect of the u t diminishes very slowly over time), but the series itself is stationary; if δ > .5, the series is no longer stationary and is liable to produce spurious results in a regression, even if we account for autocorrelation in the usual way. Because the fractional differencing equation leads to an infinite series in the left-hand side of the equation, this approach is inconvenient and remains on the frontier of econometrics (see Baillie 1996 for a review). Nevertheless, it is possible to estimate the value of δ and its standard error (Geweke and Porter-Hudak 1988; Hassler 1993); if δ > .5, and particularly if δ≈ 1.0, we can be sure that some action (perhaps fractional differencing, perhaps simple first differencing) is needed to avoid spurious results. Unfortunately, standard errors for δ are very large when the time-series is short; if the Geweke and Porter-Hudak method is used when t = 46, it is not difficult to show that the standard error is approximately 0.39—obviously too large to produce usable estimates. For what it’s worth, the mean values of δ across states were close to one for prison (mean δ = 0.962), violent (mean δ = 0.849), and property (mean δ = 1.066), suggesting that the basic conclusion—these variables are close to unit-root—is justified, on average at least.

  8. Other unit-root tests are available, and there is some evidence that they are more powerful under some circumstances (Campbell and Perron 1991; Schwert 1989). Nevertheless, in this case at least, the statistic chosen has no appreciable effect on the results. Phillips–Perron tests and FGLS tests produced very similar results for each combination of state and variable, and led to identical conclusions. Because it is the standard test, appears to be robust in a wide variety of circumstances, and can easily be adjusted to account for cross-sectional dependence, only CADF test results are reported in the text.

  9. Cross-sectional dependence is substantial for these data. All of the crime studies examined in Table 1 accounted for it in the same way, by controlling for year fixed effects in the specification. It is not difficult to show that this is more or less equal to including the national average of the dependent variable in the specification (it is exactly the same if cases are weighted by state population). Year fixed effects were always (collectively) statistically significant, and including them typically increases R 2 by more than adding any other variable. Perhaps more to the point, if cross-sectional dependence is not accounted for in the unit-root tests, test results across states have less variability than expected, if the unit-root hypothesis were correct and the only deviations were due to sampling errors.

  10. This is the usual explanation of the KPSS test. A more complete explanation is that it decomposes the time series into a deterministic time trend, a stationary error term, and a random walk component (the cause of any nonstationarity). KPSS then tests the variance of the random walk component against a null hypothesis of zero. Effectively, then, KPSS assumes that ρ = 0.

  11. A variety of other tests have been suggested, including a χ2 test of the state-by-state significance levels (Maddala and Wu 1989) and a Z test of the log of significance levels (Choi 2001). These tests are not reported here because state-by-state significance levels were based on Normal approximations to the KPSS and CADF distributions, and are thus not exact. When these tests are applied to approximate significance levels, panel-wide significance levels are very similar to those shown below, and they lead to identical conclusions.

  12. This procedure was originally recommended by Kwiatkowski et al. (1992). It has been modified slightly to account for recent findings as to the lack of power of the KPSS test in the face of fractional integration (that is, non-zero δ) and large (but non-unit) values of ρ (Hassler and Wolters 1995; Muller 2005).

  13. It may also arise if the series is fractionally integrated with .5 < δ < 1.

  14. If the series is in fact fractionally integrated, then there is a substantial risk of spurious regression results, even if we account for autocorrelation in the residuals. The reason is that the autoregressive function in a long memory model decays much more slowly than in the usual autoregressive, “short memory” model. Thus estimated generalized least squares or Newey–West estimators of the covariance matrix will underadjust, and substantial autocorrelation will remain in the residuals. To some extent this can be dealt with by adjusting for a longer lag, but the usual procedures for lag selection (e.g., Andrews 1991) will systematically understate the number of lags required.

  15. That is, if we were to rely on a single specification for all states, it must rely on differenced variables. An argument can be made for using two specifications, differencing the majority of states that appear to be unit-root and using FGLS or computing Newey–West standard errors for the rest. Since the two procedures produce unbiased estimates of the same coefficients, this would allow for more efficient estimation in the undifferenced states without changing the results. Given that all states appear to be at least near-integrated in all three variables, however, the efficiency gains would be minor, and are probably not worth the substantial logistical difficulties.

  16. It may also be that violent and property crime rates are cointegrated with one another. Recall that prison was chosen as a normalizing variable on substantive, not statistical grounds. If our only interest were in finding some evidence of a cointegrating relationship of some kind, it would make more sense to normalize on violent or property, instead. Because the stationarity characteristics of the residuals of the cointegrating equation are liable to mirror those of the dependent variable, choosing a dependent variable that is closer to stationarity maximizes the chances of finding evidence of cointegration. When the procedures used in the text were duplicated with violent as the normalizing variable, there was clear evidence of a (usually positive) cointegrating relationship between violent and property in some states, and residuals were stationary in five states. Thus there appears to be a long-run equilibrium relationship between violent and property crime rates, in a few states, at least.

  17. Some argue for including control variables in the Granger equations to further reduce the risk of omitted variable bias. In our context, Useem et al. (2001, pp. 7–8), have disputed Marvell and Moody’s (1994) Granger test results on these grounds. If the nature and functional form of the control variables were known for both the crime and prison equations, this would no doubt be good advice. However, if we are unsure as to which control variables should be included and how, adding any controls lends a false sense of conclusiveness to what is essentially a quick-and-dirty diagnostic device. For this data set, there is a more fundamental problem: The best, single predictor of state crime rates, unemployment rates, were not collected at the state level by the U.S. Department of Labor until the early 1970s. Rather than risk substantial omitted variable bias, it makes more sense to conduct the Granger test in the usual way and caveat the findings appropriately.

  18. This follows from the finding that, in short time-series, the coefficient on the lagged dependent variable is biased toward zero. See footnote 3, above, for references.

  19. It may not. Both the FGLS and Newey–West procedures require estimates of the serial correlation in the residuals, and if this estimate is in error it may lead to misleading standard errors. The usual advice is to ignore serial correlation unless |ρ| > .3 or so (Kennedy 2004, p. 145).

  20. When DeFina and Arvanites’ (2002) analysis is replicated using Levitt’s (1996) data and specification, which relies on differences rather than levels, deviations among states in the effect of prison on violent and property are not statistically significant. Results are available upon request.

  21. In the case of the Liedka et al. (2006) study, the problem may have been exacerbated by the use of total crime rates (that is, violent plus property crime rates) as the dependent variable. For the 1960–2004 sample, the correlation between ΔC t and ΔC t−1 is .362, slightly larger than the correlations for violent and property crimes taken separately. This correlation is positive in 49 of 51 states, and greater than .30 in 36 states. Results are almost identical for the period covered by the Liedka et al. study.

  22. In a county-level data set, Spelman (2005) used local criminal justice system resources, local jail capacity, twice-lagged arrest rates, and conservative voting patterns as instruments for jail and prison populations. These instruments were effective predictors of prison (F = 72.8, ΔR 2 = .192) but not of crime (for violent crime, F = 0.37, ΔR 2 = .0016; for property crime, F = 0.93, ΔR 2 = .0035). Whether these instruments behave similarly in a state panel is an open question, but it would be premature to state that no effective instruments are available.

  23. A third possibility was raised by Western (2006): Even if we cannot come up with an unbiased estimate of the effect of crime rates on prison populations, we may be able to use available information to bound that relationship. Western suggests an upper bound of +.15, and uses it to estimate a lower bound (that is, the most negative possible estimate) for the effect of prison on crime. Although this is an interesting proposal, it would be difficult to estimate a valid bound on either relationship without using an instrument of some kind.

References

  • Andrews DWK (1991) Heteroskedasticity and autocorrelation consistent covariance matrix estimation. Econometrica 59:817–858

    Article  Google Scholar 

  • Baillie RT (1996) Long memory processes and fractional integration in econometrics. J Economet 73:5–59

    Article  Google Scholar 

  • Besci Z (1999) Economics and crime in the states. Econ Rev Federal Reserve Bank of Atlanta 84:38–56

    Google Scholar 

  • Black D, Nagin D (1998) Do right-to-carry laws deter violent crime? J Legal Stud 27:209–219

    Article  Google Scholar 

  • Blumstein A, Cohen J (1973) A theory of the stability of punishment. J Crim Law Criminol Police Sci 64:198–207

    Article  Google Scholar 

  • Blumstein A, Cohen J, Hsieh P (1983) The duration of adult criminal careers. Final report. National Institute of Justice, Washington, DC

    Google Scholar 

  • Bound J, Jaeger DA, Baker RM (1995) Problems with instrumental variables estimation when the correlation between the instruments and the endogenous explanatory variable is weak. J Am Stat Assoc 90:443–450

    Article  Google Scholar 

  • Campbell JY, Perron P (1991) Pitfalls and opportunities: what economists should know about unit roots. NBER Technical Working Paper 100. National Bureau of Economic Research, Cambridge, MA

  • Choi I (2001) Unit root tests for panel data. J Int Money Bank 20:249–272

    Article  Google Scholar 

  • Choi C-Y, Hu L, Ogaki M (2006) Robust estimation for structural spurious regressions and a Hausman-type cointegration test. Working paper. Ohio State University, Department of Economics, Columbus, OH

  • Cochrane D, Orcutt GH (1949) Application of least squares regression to relationships containing autocorrelated error terms. J Am Stat Assoc 44:32–61

    Article  Google Scholar 

  • DeBoef S, Granato J (1997) Near-integrated data and the analysis of political relationships. Am J Polit Sci 41:619–640

    Article  Google Scholar 

  • DeFina RH, Arvanites TM (2002) The weak effect of imprisonment on crime: 1971–1998. Soc Sci Quart 83:635–653

    Article  Google Scholar 

  • DeJong C (1997) Survival analysis and specific deterrence: integrating theoretical and empirical models of recidivism. Criminology 35:561–576

    Article  Google Scholar 

  • Dezhbakhsh H, Rubin P, Shepherd J (2003) Does capital punishment have a deterrent effect? New evidence from postmoratorium panel data. Am Econ Rev 344:203

    Google Scholar 

  • Dickey DA, Fuller WA (1979) Distribution of the estimators for autoregressive time series with a unit root. J Am Stat Assoc 74:427–431

    Article  Google Scholar 

  • Donohue JJ, Levitt SD (2001) The impact of legalized abortion on crime. Quart J Econ 116:379–420

    Article  Google Scholar 

  • Donohue J, Wolfers J (2005) Uses and abuses of statistical evidence in the death penalty debate. Stanford Law Rev 58:787

    Google Scholar 

  • Engle RF, Granger CWJ (1987) Co-integration and error correction: representation, estimation, and testing. Econometrica 55:251–276

    Article  Google Scholar 

  • Geweke J, Porter-Hudak S (1988) The estimation and application of long memory time series models. J Time Ser Anal 4:221–238

    Google Scholar 

  • Geweke J, Meese R, Dent W (1983) Comparing alternative tests of causality in temporal systems: analytic results and experimental evidence. J Economet 21:161–194

    Article  Google Scholar 

  • Granger CWJ (1969) Investigating causal relations by econometric models and cross-spectral methods. Econometrica 37:424–438

    Article  Google Scholar 

  • Granger CWJ, Newbold P (1974) Spurious regressions in econometrics. J Economet 2:111–120

    Article  Google Scholar 

  • Greenberg DF, West V (2001) State prison populations and their growth, 1971–1991. Criminology 39:615–653

    Article  Google Scholar 

  • Greene WH (2002) Econometric analysis, 5th edn. Prentice-Hall, New York

    Google Scholar 

  • Guilkey DK, Salemi MK (1982) Small-sample properties of three tests for Granger-causal ordering in a bivariate stochastic system. Rev Econ Stat 64:668–680

    Article  Google Scholar 

  • Hadri K, Larsson R (2005) Testing for stationarity in heterogeneous panel data where the time dimension is finite. Economet J 8:55–69

    Article  Google Scholar 

  • Harrison PM, Beck AJ (2006) Prisoners in 2005. Bureau of Justice Statistics Bulletin. U.S. Department of Justice, Bureau of Justice Statistics, Washington, DC

  • Hassler U (1993) Regression of spectral estimators with fractionally integrated time series. J Time Ser Anal 14:369–380

    Google Scholar 

  • Hassler U, Wolters J (1995) Long memory in inflation rates: international evidence. J Bus Econ Stat 13:37–45

    Article  Google Scholar 

  • Hurwicz L (1950) Least squares bias in time series. In: Koopmans TC (ed) Statistical inference in dynamic economic models. Cowles Commission Monograph no. 10. Wiley, New York, pp 365–383

  • Im KS, Pesaran MH, Shin Y (2003) Testing for unit roots in heterogeneous panels. J Economet 115:53–74

    Article  Google Scholar 

  • Jacobs D, Carmichael JT (2001) The politics of punishment across time and space: a pooled time-series analysis of imprisonment rates. Soc Forces 80:61–91

    Article  Google Scholar 

  • Johansen S, Juselius K (1990) Maximum likelihood estimation and inference on cointegration—with applications to the demand for money. Oxford Bull Econ Stat 52:169–210

    Article  Google Scholar 

  • Judson RA, Owen AL (1999) Estimating dynamic panel data models: a guide for macroeconomists. Econ Lett 65:9–15

    Article  Google Scholar 

  • Kennedy P (2004) A guide to econometrics, 5th edn. MIT Press, Cambridge, MA

    Google Scholar 

  • Kiviet JF (1995) On bias, inconsistency, and efficiency of various estimators in dynamic panel data models. J Economet 68:53–78

    Article  Google Scholar 

  • Kovandzic TV, Vieraitis LM (2006) The effect of county-level prison population growth on crime rates. Criminol Public Pol 5:213–244

    Article  Google Scholar 

  • Kwiatkowski D, Phillips PCB, Schmidt P, Shin Y (1992) Testing the null hypothesis of stationarity against the alternative of a unit root. J Economet 54:91–115

    Article  Google Scholar 

  • Levitt SD (1996) The effect of prison population size on crime rates: evidence from prison overcrowding litigation. Quart J Econ 111:319–351

    Article  Google Scholar 

  • Liedka RV, Piehl AM, Useem B (2006) The crime-control effect of incarceration: does scale matter? Criminol Public Pol 5:245–276

    Article  Google Scholar 

  • Lott J (1998) More guns, less crime: understanding crime and gun-control laws. University of Chicago Press, Chicago

    Google Scholar 

  • MacKinnon JG (1996) Numerical distribution functions for unit root and cointegration tests. J Appl Economet 11:601–618

    Article  Google Scholar 

  • Maddala GS, Wu S (1989) A comparative study of unit root tests with panel data and a new simple test. Oxford Bull Econ Stat 61:631–652

    Article  Google Scholar 

  • Maeshiro A (2000) An illustration of the bias of OLS for Y t  = λY t−1 + U t . J Econ Educ 31:76–80

    Article  Google Scholar 

  • Marvell TB, Moody CE Jr (1994) Prison population growth and crime reduction. J Quant Criminol 10:109–140

    Article  Google Scholar 

  • Marvell TB, Moody CE (2001) The lethal effects of three-strikes laws. J Legal Stud 30:89–106

    Article  Google Scholar 

  • Muller UK (2005) Size and power of tests for stationarity in highly autocorrelated time series. J Economet 128:195–213

    Article  Google Scholar 

  • Newey WK, West KD (1987) A simple, positive semi-definite, heteroskedasticity and autocorrelation consistent covariance matrix. Econometrica 55:703–708

    Article  Google Scholar 

  • Nickell S (1981) Biases in dynamic models with fixed effects. Econometrica 49:1417–1426

    Article  Google Scholar 

  • Osterwald-Lenum M (1992) A note with quantiles of the asymptotic distribution of maximum likelihood cointegration rank test statistics. Oxford Bull Econ Stat 54:461–472

    Google Scholar 

  • Pesaran MH (2007) A simple panel unit root test in the presence of cross section dependence. J Appl Economet 22:265–312

    Article  Google Scholar 

  • Pierce DA (1977) Relationships—and the lack thereof—between economic time series, with specific reference to money and interest rates. J Am Stat Assoc 72:11–26

    Article  Google Scholar 

  • Raphael S, Winter-Ebmer R (2001) Identifying the effect of unemployment on crime. J Law Econ 44:259–283

    Article  Google Scholar 

  • Rose DR, Clear TR (1998) Incarceration, social capital, and crime: examining the unintended consequences of incarceration. Criminology 36:441–479

    Article  Google Scholar 

  • Said SA, Dickey DA (1984) Testing for unit roots in autoregressive moving average models of unknown order. Biometrika 71:599–607

    Article  Google Scholar 

  • Sargent TJ (1976) A classic macroeconomic model for the United States. J Polit Econ 84:207–238

    Article  Google Scholar 

  • Schwert GW (1989) Tests for unit roots: a Monte Carlo investigation. J Bus Econ Stat 7:147–160

    Article  Google Scholar 

  • Sims CA (1972) Money, income, and causality. Am Econ Rev 62:540–552

    Google Scholar 

  • Smith KB (2004) The politics of punishment: evaluating political explanations of incarceration rates. J Polit 66:925–938

    Article  Google Scholar 

  • Snedecor GW, Cochrane WG (1980) Statistical methods, 7th edn. Iowa State University Press, Ames, IA

    Google Scholar 

  • Spelman W (1994) Criminal incapacitation. Plenum Press, New York

    Google Scholar 

  • Spelman W (2000) The limited importance of prison expansion. In: Blumstein A, Wallman J (eds) The crime drop in America. Cambridge University Press, Cambridge, UK

  • Spelman W (2005) Jobs or jails? The crime drop in Texas. J Pol Anal Manage 24:133–165

    Article  Google Scholar 

  • Stock JH, Watson MW (1988) Testing for common trends. J Am Stat Assoc 83:1097–1107

    Article  Google Scholar 

  • Thornton DL, Batten DS (1985) Lag-length selection and tests of Granger causality between money and income. J Money Credit Bank 17:164–178

    Article  Google Scholar 

  • Tremblay P (1986) The stability of punishment: a follow-up of Blumstein’s hypothesis. J Quant Criminol 2:157–180

    Article  Google Scholar 

  • Useem B, Piehl AM, Liedka RV (2001) The crime-control effect of incarceration: reconsidering the evidence. Final report. U.S. Department of Justice, National Institute of Justice, Washington, DC

  • Western B (2006) Punishment and inequality in America. Russell Sage Foundation, New York

    Google Scholar 

  • Witt R, Witte A (2000) Crime, prison, and female labor supply. J Quant Criminol 16:69–85

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to William Spelman.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Spelman, W. Specifying the Relationship Between Crime and Prisons. J Quant Criminol 24, 149–178 (2008). https://doi.org/10.1007/s10940-008-9042-x

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10940-008-9042-x

Keywords

Navigation