Skip to main content

Look Before You Analyze: Visualizing Data in Criminal Justice

  • Chapter
  • First Online:
Handbook of Quantitative Criminology

Abstract

Criminal justice data are hardly ever linear, normal, and/or independent, but most statistical techniques rely on the assumptions of linearity, normality and/or independence. Moreover, since the data are often entered by hand, they are prone to error. Plotting the data permits the analyst to determine the extent to which the assumptions are valid and to catch obvious errors in data entry. Moreover, while standard statistical techniques are useful in testing hypotheses, visualization allows the data to tell its story and thus is useful in generating hypotheses. This chapter provides some examples of how visualizing criminal justice data contributes to a fuller understanding of the processes that produced the data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Regarding data entry, one of my favorite quotes is from an economist in the 1920s (Stamp 1929: 258): “The Government are very keen on amassing statistics. They collect them, add them, raise them to the nth power, take the cube root and prepare wonderful diagrams. But what you must never forget is that every one of these figures comes in the first place from the chowty dar [village watchman]. who just puts down what he damn pleases.”

  2. 2.

    Freedman (1985: 308) notes that “investigators often talk about ‘modeling the data.’ This is almost perverse: surely the object is to model the phenomenon, and the data are interesting only because they contain information about that phenomenon. Whatever it is that most social scientists are doing when they construct regression models, discovering natural laws does not seem to be uppermost in their minds.”

  3. 3.

    As the president of the American Statistical Association recently (Morton, 2009) said: “[D]on’t trust complicated models as far as you can throw them – protect yourself and examine those data every which way. Are your assumptions correct? What are those pesky outliers up to? Let the data speak for themselves, rather than being masked and manipulated by complex methods.”

  4. 4.

    Those as fascinated as I am with the great variety of data visualization techniques should visit the website http://addictedtor.free.fr/graphiques/thumbs.php?sort$=$votes, which also includes the source code for the plots (in the computer language R). I thank Andrew Gelman for pointing out this treasure. Antony Unwin noted (personal communication) that Martin Theus developed Mondrian, an interactive graphics package that can be used for most data visualization purposes.

  5. 5.

    Visualization is the key ingredient in map-based analyses as well. It is not covered in this chapter; Ratcliffe (Chap. 2) describes how it can be used to find space-time patterns in crime data.

  6. 6.

    A similar strategy was used in cleaning FBI-collected arrest data (Maltz and Weiss 2007).

  7. 7.

    If the field contains five characters, as it might for the crime of larceny in large cities, then the missing value might be 99999.

  8. 8.

    The Crime Index is the sum of the crimes of murder, forcible rape, robbery, aggravated assault, burglary, larceny, and vehicle theft.

  9. 9.

    It turns out that, for many of the years included in the study, the only police department in Cook County Illinois reporting arrest statistics to the FBI was the Chicago Police Department.

  10. 10.

    Sometimes it takes another pair of eyes to see what may be obvious. I thank Howard Wainer for serving in that capacity.

  11. 11.

    These files have been prepared by Fox (2008) for over a decade. They aggregate all of the annual SHR reports from 1976 to the most current year, and can be downloaded from the National Archive of Criminal Justice Data (http://www.icpsr.umich.edu/nacjd). The most recent version contains data from 1976 through 2005.

  12. 12.

    See Maltz (1998) for how the smoothing was accomplished.

  13. 13.

    The peak values are indicated on the figures. Note that the scales of the male-offender and female-offender figures are not the same.

  14. 14.

    Again I thank Howard Wainer for pointing this out.

  15. 15.

    A researcher might posit a log-linear or a polynomial (i.e., quadratic or cubic) relationship, but even these are linear in some sense. Assuming that something is related to (for instance) the log of family income, plus age squared, plus education level cubed is also a linear assumption: the model implicitly assumes that the factors are additive, because statistical software can handle that kind of relationship more easily.

  16. 16.

    Gelman (2004) shows how “(a) exploratory and graphical methods can be especially effective when used in conjunction with models, and (b) model-based inference can be especially effective when checked graphically.”

References

  • Arbuthnot J (1710) An argument for divine providence taken from the constant regularity in the births of both sexes. Phil Trans R Soc 27:186–190

    Google Scholar 

  • Chernoff H (1973) The use of faces to represent points in k-dimensional space graphically. J Am Stat Assoc 68:342, 361–368

    Article  Google Scholar 

  • Cleveland WS (1993) Visualizing data. Hobart Press, Summit, NJ

    Google Scholar 

  • Cleveland WS (1994) The elements of graphing data. Hobart Press, Summit, NJ

    Google Scholar 

  • Cohen J (1994) The earth is round ({ p} < . 05). Am Psychol 49: 12, 997–1003

    Google Scholar 

  • Fox JA (2008) Uniform crime reports [United States]: supplementary homicide reports with multiple imputation, cumulative files 1976–2005 [computer file]. ICPSR22161-v1. Inter-university Consortium for Political and Social Research [producer and distributor], Ann Arbor, MI, May 12, 2008

    Google Scholar 

  • Francis B, Pritchard J (1998) Bertin, Lexis and the graphical representation of event histories. Bull Comite Francais de Cartographie 156:80–87

    Google Scholar 

  • Francis B, Pritchard J (2000) Visualisation of criminal careers. In: Erbacher RF, Chen PC, Roberts JC, Wittenbrink CM (eds) Visual data exploration and analysis VII. Proceedings of the Society of Photo-Optical Instrumentation Engineers (SPIE), vol 3960, 96–105

    Google Scholar 

  • Freedman DA (1985) Statistics and the scientific method. In: Mason WM, Fienberg SE (eds) Cohort analysis in the social sciences: beyond the identification problem. Springer-Verlag, New York

    Google Scholar 

  • Geertz C (2005). Thick description: toward an interpretive theory of culture, Chap 1. In Geertz C (1973) The interpretation of cultures: selected essays. Basic Books, New York

    Google Scholar 

  • Gelman A (2004) Exploratory data analysis for complex models (with discussion). J Comput Graph Stat 13(4): 755–779

    Article  Google Scholar 

  • Gelman A, Hill J (2007) Data analysis using regression and multilevel/hierarchical models. Cambridge University Press, New York

    Google Scholar 

  • Gelman A, Pasarica C, Dodhia R (2002) Let’s practice what we preach: turning tables into graphs. Am Stat 56: 121–130

    Article  Google Scholar 

  • Harmon RJ, Gliner JA, Morgan GA, Leech NL (2001) Problems with null hypothesis significance testing. J Am Acad Child Adolesc Psychiatry 40(2):250–252

    Article  Google Scholar 

  • Ianni FAJ, Reuss-Ianni E (1972) A family business: kinship and social control in organized crime. Russell Sage Foundation, New York

    Google Scholar 

  • Lieberson S (1985) Making it count: the improvement of social research and theory. University of California Press, Berkeley, CA

    Google Scholar 

  • Loftus GR (1993) A picture is worth a thousand p values: on the irrelevance of hypothesis testing in the microcomputer age. Behav Res Meth Ins C 25(2): 250–256

    Article  Google Scholar 

  • Maltz MD (1994) Deviating from the mean: the declining significance of significance. J Res Crime Delinq 31(4): 434–463

    Article  Google Scholar 

  • Maltz MD (1996) Criminality in space and time: life course analysis and the micro-ecology of crime. In: Eck J, Weisburd D (eds) Crime and place. Criminal Justice Press, Monsey, NY

    Google Scholar 

  • Maltz MD (1998) Visualizing homicide: a research note. J Quant Criminol 14(4):397–410

    Article  Google Scholar 

  • Maltz MD (1999) Bridging gaps in police crime data, a discussion paper from the BJS Fellows Program. Bureau of Justice Statistics Report No. NCJ-1176365. Office of Justice Programs, U.S. Department of Justice, Washington, DC, September, 1999, 72 pp http://www.ojp.usdoj.gov/bjs/pub/pdf/bgpcd.pdf

  • Maltz MD (2006) Some p-baked thoughts ({ p} > 0. 5) on experiments and statistical significance. J Exp Criminol 1(2):211–226

    Google Scholar 

  • Maltz MD (2008) Waves, particles, and crime. In: Bernasco W, Bruinsma G, Weisburd D (eds) Units of analysis in the study of the geography of crime. Springer, New York

    Google Scholar 

  • Maltz MD, Gordon AC, Friedman W (1991) Mapping crime in its community setting: event geography analysis. Springer, New York. Internet version published in 2000; http://www.uic.edu/depts/lib/forr/

  • Maltz MD, Mullany J (2000) Visualizing lives: new pathways for analyzing life course histories. J Quant Criminol 16(2):255–281

    Article  Google Scholar 

  • Maltz MD, Weiss HE (2006) Creating a UCR utility, final report to the National Institute of Justice. Criminal Justice Research Center and Department of Sociology, The Ohio State University, February 23, 2006

    Google Scholar 

  • Maltz MD, Weiss HE (2007) Analysis of FBI-collected arrest data, final report to the Bureau of Justice Statistics. Criminal Justice Research Center and Department of Sociology, The Ohio State University, December 31, 2007

    Google Scholar 

  • Maltz MD, Zawitz MW (1998) Displaying violent crime trends using estimates from the National Crime Victimization Survey. Bureau of Justice Statistics Technical Report. No. NCJ 167881. Office of Justice Programs, U.S. Department of Justice, Washington, DC http://www.ojp.usdoj.gov/bjs/pub/pdf/dvctue.pdf

  • Memon N, Larsen HL (2006) Investigative data mining toolkit: a software prototype for visualizing, analyzing and destabilizing terrorist networks. In: Visualising network information. Meeting Proceedings RTO-MP-IST-063, Paper 14. RTO, Neuilly-sur-Seine, France, pp 14-1–14-24. ftp://ftp.rta.nato.int/PubFullText/RTO/MP/RTO-MP-IST-063/MP-IST-063--14.pdf

  • Morselli C (2005) Contacts, opportunity, and criminal enterprise. University of Toronto Press, Toronto, Ontario

    Google Scholar 

  • Morselli C (2008) Inside criminal networks. Springer, New York

    Google Scholar 

  • Morton S (2009) ASA president delivers NC State commencement speech. AMSTAT News July 2009, p 13

    Google Scholar 

  • Payne D, Maltz MD, Peterson R, Krivo L (2007) Domestic violence in in Ohio 2004, a report to the State of Ohio’s Office of Criminal Justice Services, Criminal Justice Research Center, The Ohio State University, March 5, 2007 http://www.ocjs.ohio.gov/research/Final%20OIBRS%20Report%203–26–07.pdf

  • Rall T (2008) How deterrence works. January 7, 2008. { ©} 2008 Ted Rall, Distributed by Universal Press Syndicate, Kansas City, MO

    Google Scholar 

  • Reiss AJ Jr (1986) Co-offending influences on criminal careers. In: Blumstein A, Cohen J, Roth JA, Visher CA (eds) Criminal Careers and career criminals, vol 2. National Academy of Sciences, Washington, DC, pp 121–160

    Google Scholar 

  • Reiss AJ Jr (1988) Co-offending and criminal careers. In: Tonry M, Morris N (eds) Crime and justice: a review of research, vol 10. University of Chicago Press, Chicago

    Google Scholar 

  • Reiss AJ Jr, Farrington DP (1991) Advancing knowledge about co-offending: results from a prospective survey of London males. J Crim Law Criminol 82:360–395

    Article  Google Scholar 

  • Stamp J (1929) Some economic factors in modern life. P. S. King, London, England

    Google Scholar 

  • Theus M, Urbanek S (2008) Interactive graphics for data analysis: principles and examples. CRC Press, Boca Raton, FL

    Google Scholar 

  • Tufte ER (1983) The visual display of quantitative information. Graphics Press, Cheshire, CT

    Google Scholar 

  • Tukey JW (1977) Exploratory data analysis. Addison-Wesley, Reading, MA

    Google Scholar 

  • Unwin A, Theus M, Hofmann H (2006) Graphics of large datasets: visualizing a million. Springer, New York

    Google Scholar 

  • Vaupel JW, Zhenglian W, Andreev KF, Yashin AI (1998) Population data at a glance: shaded contour maps of demographic surfaces over age and time. Odense University Press, Odense, Denmark

    Google Scholar 

  • Wainer H (1983) On multivariate display. In: Rizvi MH, Rustagi JS, Siegmund D (eds) Recent advances in statistics. Academic Press, New York

    Google Scholar 

  • Wainer H (1997) Visual revelations: graphical tales of fate and deception from Napoleon Bonaparte to Ross Perot. Copernicus (Springer-Verlag), New York

    Google Scholar 

  • Wainer H (2005) Graphic discovery: a trout in the milk and other visual adventures. Princeton University Press, Princeton, NJ

    Google Scholar 

  • Wainer H (2009) Picturing the uncertain world: how to understand, communicate and control uncertainty through graphical display. Princeton University Press, Princeton, NJ

    Google Scholar 

  • Wainer H, Spence I (2005) The commercial and political atlas and statistical breviary, by William Playfair (1801), 3rd edn. Cambridge University Press, New York

    Google Scholar 

  • Wilkinson L (1999) The grammar of graphics. Springer, New York

    Google Scholar 

  • Wilkinson L, the Task Force on Statistical Inference (1999) Statistical methods in psychology journals: guidelines and explanations. Am Psychol 54(8):594–604

    Google Scholar 

  • Xu J, Chen H (2005) Criminal network analysis and visualization. Commun ACM 48(6):100–107

    Article  Google Scholar 

  • Zimring FE (1981) Kids, groups and crime: some implications of a well-known secret. J Crim Law Criminol 72: 867–885

    Article  Google Scholar 

Download references

Acknowledgements

Over the years my research has been supported by the National Institute of Justice, the Bureau of Justice Statistics, the Ohio Office of Criminal Justice Services, the California Department of Corrections and Rehabilitation, and the American Statistical Association, which I acknowledge with gratitude. I am indebted to Andrew Gelman and Howard Wainer for their comments on earlier versions.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Maltz, M.D. (2010). Look Before You Analyze: Visualizing Data in Criminal Justice. In: Piquero, A., Weisburd, D. (eds) Handbook of Quantitative Criminology. Springer, New York, NY. https://doi.org/10.1007/978-0-387-77650-7_3

Download citation

Publish with us

Policies and ethics