Skip to main content

Advertisement

Log in

Crime detection and criminal identification in India using data mining techniques

  • Open Forum
  • Published:
AI & SOCIETY Aims and scope Submit manuscript

Abstract

In the current paper, we propose an approach for the design and implementation of crime detection and criminal identification for Indian cities using data mining techniques. Our approach is divided into six modules, namely—data extraction (DE), data preprocessing (DP), clustering, Google map representation, classification and WEKA® implementation. First module, DE extracts the unstructured crime dataset from various crime Web sources, during the period of 2000–2012. Second module, DP cleans, integrates and reduces the extracted crime data into structured 5,038 crime instances. We represent these instances using 35 predefined crime attributes. Safeguard measures are taken for the crime database accessibility. Rest four modules are useful for crime detection, criminal identification and prediction, and crime verification, respectively. Crime detection is analyzed using k-means clustering, which iteratively generates two crime clusters that are based on similar crime attributes. Google map improves visualization to k-means. Criminal identification and prediction is analyzed using KNN classification. Crime verification of our results is done using WEKA®. WEKA® verifies an accuracy of 93.62 and 93.99 % in the formation of two crime clusters using selected crime attributes. Our approach contributes in the betterment of the society by helping the investigating agencies in crime detection and criminals’ identification, and thus reducing the crime rates.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

Notes

  1. Jessica Lal murder case, http://www.ndtv.com/topic/jessica-lal Accessed on April 28, 2013.

  2. Nithari case, http://wcd.nic.in/nitharireport.pdf Accessed on April 30, 2013.

  3. Mumbai blast, http://www.ipcs.org/pdf_file/issue/SR71-Final.pdf Accessed on April 28, 2013.

  4. Delhi rape case, http://www.dailymail.co.uk/news/article-2269725/Indian-teenager-accused-Delhi-gang-rape-faces-maximum-year-jail-term.html Accessed on April 28, 2013.

  5. National Crime Records Bureau, http://ncrb.gov.in Accessed on March 10, 2013.

  6. Committee to Protect Journalists, http://www.cpj.org Accessed on March 20, 2013.

  7. Crime alert, http://www.crimealert.org Accessed on March 25, 2013.

  8. NSW bureau of crime statistics and research, http://www.bocsar.nsw.gov.au Accessed on March 22, 2013.

  9. Jin F, Wang W, Xiao Y, Pan Z Proposal of Crime Data Mining Project. https://filebox.vt.edu/users/xykid/dataAnalysisProject/Checkpoint-II_Jin_Xiao_Pan_Wang.pdf. Accessed on May 30, 2013.

  10. Netbeans, http://netbeans.org Accessed on April 2, 2013.

  11. WEKA manual, http://www.inf.ufpr.br/lesoliveira/aprendizado/wekamanual.pdf Accessed on March 15, 2013.

  12. Crime wave in India, http://www.merinews.com/article/the-crime-wave-in-india/132433.shtml Accessed on April 18, 2013.

References

  • Adderley R, Townsley M, Bond J (2007) Use of data mining techniques to model crime scene investigator performance. Knowl Based Syst 20(2):170–176

    Article  Google Scholar 

  • Becker RF (2008) Criminal investigation, 3rd edn. Jones and Bartlett Learning Publishers, Burlington, MA

    Google Scholar 

  • Brantingham PJ, Brantingham PL (1984) Patterns in crime. Macmillan/McGraw-Hill School Division, New York

    Google Scholar 

  • Chen H, Chung W, Xu JJ, Wang G, Qin Y, Chau M (2004) Crime data mining: a general framework and some examples. Comput IEEE 37(4):50–56

    Article  Google Scholar 

  • Ehlers D (1998) Predicting crime: a statistical glimpse of the future? Nedbank ISS Crime Index. Halfway House: Inst for Security Stud 2(2)

  • Gorr W, Harries R (2003) Introduction to crime forecasting. Int J Forecast 19(4):551–555

    Article  Google Scholar 

  • Gorr W, Olligschlaegerb A, Thompson Y (2003) Short-term forecasting of crime. Int J Forecast 19(4):579–594

    Article  Google Scholar 

  • Green EJ, Booth CE, Biderman MD (1976) Cluster analysis of burglary M/Os. J Police Sci Adm 4(4):382–388

    Google Scholar 

  • Han J, Kamber M, Pei J (2012) Data mining: concepts and techniques, 3rd edn. Morgan Kaufmann Publishers Inc, San Francisco, CA

    Google Scholar 

  • Hornik K, Feinerer I, Kober M, Buchta C (2012) Spherical k-means clustering. J Stat Softw Am Stat Assoc 50(10):1–22

    Google Scholar 

  • Hussain KZ, Durairaj M, Farzana GRJ (2012) Criminal behavior analysis by using data mining techniques. In: International conference on advances in engineering, science and management, IEEE, pp 656–658

  • Kaur N, Sahiwal JK, Kaur N (2012) Efficient k-means clustering algorithm using ranking method in data mining. Int J Adv Res Comput Eng Technol (IJARCET) 1(3):85–91

    Google Scholar 

  • Korukonda AP (2007) Technique without theory or theory from technique? An examination of practical, philosophical and foundational issues in data mining. AI Soc 21(3):347–355

    Article  Google Scholar 

  • Kulis B, Jordan MI (2011) Revisiting k-means: new algorithms via bayesian nonparametrics. In: 29th International conference machine learning. Omnipress, Edinburgh, pp 513–520

  • Li X, Juhola M (2014) Country crime analysis using self organizing map, with special regard to demographic factors. AI Soc 29(1): 53–68

    Google Scholar 

  • Malathi A, Baboo SS (2011) Evolving data mining algorithms on the prevailing crime trend—an intelligent crime prediction model. Int J Sci Eng Res 2(6)

  • Malathi A, Babboo SS, Anbarasi A (2011) An intelligent analysis of a city crime data using data mining. In: International conference information electronic engineering, vol 6. IACSIT Press, Singapore, pp 130–134

  • Mande U, Srinivas Y, Murthy JVR (2012a) Feature specific criminal mapping using data mining techniques and generalized gaussian mixture model. Int J Comput Sci Commun Netw 2(3):375–379

    Google Scholar 

  • Mande U, Srinivas Y, Murthy JVR (2012b) An intelligent analysis of crime data using data mining & auto correlation models. Int J Eng Res Appl (IJERA) 2(4):149–153

    Google Scholar 

  • Nath SV (2006) Crime pattern detection using data mining. International conference on web intelligence and intelligent agent technology, IEEE/WIC/ACM, pp 41–44

  • Okonkwo RO, Enem FO (2011) Combating crime and terrorism using data mining techniques. In: 10th International conference IT people centred development, Nigeria Computer Society, Nigeria

  • Sayal R, Kumar VV (2011) A novel similarity measure for clustering categorical data sets. Int J Comput Apps 17(1):25–30

    Google Scholar 

  • Siegel D, Bunt H, Zaitch D (2003) Global organized crime: trends and developments. Kluwer, London

    Book  Google Scholar 

  • Thongtae P, Srisuk S (2008) An analysis of data mining applications in crime domain. In: 8th International conference on computer and information technology workshops (ICCIT), IEEE, pp 122–126

  • Visher CA, Weisburd D (1998) Identifying what works: recent trends in crime prediction strategies. Crime Law Soc Change 28:223–242

    Article  Google Scholar 

  • Witten IH, Hall MA, Frank E (2011) Data mining: practical machine learning tools and techniques, 3rd edn. Morgan Kaufmann Publishers, Los Altos, CA

    Google Scholar 

  • Yu G, Shao S, Luo B (2008) Mining crime data by using new similarity measure. In: 2nd International conference genetic and evolutionary computing (WGEC), IEEE, Washington, USA, pp 389–392

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arti Jain.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tayal, D.K., Jain, A., Arora, S. et al. Crime detection and criminal identification in India using data mining techniques. AI & Soc 30, 117–127 (2015). https://doi.org/10.1007/s00146-014-0539-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00146-014-0539-6

Keywords

Navigation