Skip to main content

Prediction Model for Prevalence of Type-2 Diabetes Mellitus Complications Using Machine Learning Approach

  • Chapter
  • First Online:
Data Management and Analysis

Part of the book series: Studies in Big Data ((SBD,volume 65))

Abstract

Nowadays, most of the people are suffering from the attack of chronic diseases because of their lifestyle, food habits, and reduction in physical activities. Diabetes is one of the most common chronic diseases being suffered by the people of all ages. As a result, the healthcare sector is generating extensive data containing huge volume, enormous velocity, and a vast variety of heterogeneous sources. In such scenario, scientific solutions offer to harness these massive, heterogeneous and complex datasets to obtain more meaningful information. Moreover, machine learning algorithms can play a tremendous part in creating a statistical prediction-based model. The aim of this paper is to identify the prevalence of diabetes related to long-term complications among patients with type-2 diabetes mellitus. The processing and statistical analysis require machine learning environment known as Scikit-Learn, Pandas for Python, and R-Studio for R. In this work, machine learning approaches such as decision tree, random forest for developing classification system-based prediction model to assess type-2 diabetes mellitus chronic diseases have been studied. Additionally, we have proposed an algorithm which is solely based on random forest and tried to detect the complicated areas of type-2 diabetes patients.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Ahmed, K. R. (2009). Incidence of diabetic retinopathy: A 15 year follow up in a hospital population (Bangladesh). Master’s thesis.

    Google Scholar 

  2. Al Jarullah, A. A. (2011). Decision tree discovery for the diagnosis of type II diabetes. In 2011 International Conference on Innovations in Information Technology (pp. 303–307). Piscataway: IEEE.

    Chapter  Google Scholar 

  3. Cottle, M., Hoover, W., Kanwal, S., Kohn, M., Strome, T., & Treister, N. (2013). Transforming health care through big data strategies for leveraging big data in the health care industry. Institute for Health Technology Transformation. http://ihealthtran.com/big-data-in-healthcare.

    Google Scholar 

  4. Friedman, J., Hastie, T., & Tibshirani, R. (2001). The Elements of Statistical Learning. Springer Series in Statistics (Vol. 1). New York: Springer.

    Google Scholar 

  5. Hersen, M., & Thomas, J. C. (2007) Handbook of Clinical Interviewing with Adults. Los Angeles: Sage Publications.

    Book  Google Scholar 

  6. Knowler, W. C., Barrett-Connor, E., Fowler, S. E., Hamman, R. F., Lachin, J. M., Walker, E. A., et al. (2002). Reduction in the incidence of type 2 diabetes with lifestyle intervention or metformin. The New England Journal of Medicine, 346(6), 393–403.

    Article  Google Scholar 

  7. Mahmoodi, M., Hosseini-zijoud, S. M., Nabati, S., Modarresi, M., Mehrabian, M., Sayyadi, A., et al. (2013). The effect of white vinegar on some blood biochemical factors in type 2 diabetic patients. Journal of Diabetes and Endocrinology, 4(1), 1–5.

    Google Scholar 

  8. Pan, X. R., Li, G. W., Hu, Y. H., Wang, J. X., Yang, W. Y., An, Z. X., et al. (1997). Effects of diet and exercise in preventing NIDDM in people with impaired glucose tolerance: The Da Qing IGT and diabetes study. Diabetes care, 20(4), 537–544.

    Article  Google Scholar 

  9. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.

    MathSciNet  MATH  Google Scholar 

  10. Rahim, M. A. (2002). Diabetes in Bangladesh: Prevalence and determinants. Master’s thesis.

    Google Scholar 

  11. Rajesh, K., & Sangeetha, V. (2012). Application of data mining methods and techniques for diabetes diagnosis. International Journal of Engineering and Innovative Technology (IJEIT), 2(3), 224–229.

    Google Scholar 

  12. Saquib, N., Khanam, M. A., Saquib, J., Anand, S., Chertow, G. M., Barry, M., et al. (2013). High prevalence of type 2 diabetes among the urban middle class in Bangladesh. BMC Public Health, 13(1), 1032.

    Article  Google Scholar 

  13. Sharmila, K., & Manickam, S. (2016). Diagnosing diabetic dataset using Hadoop and k-means clustering techniques. Indian Journal of Science and Technology, 9(40), 1–5.

    Article  Google Scholar 

  14. Sharmila, K., & Vethamanickam, S. (2015). Survey on data mining algorithm and its application in healthcare sector using Hadoop platform. International Journal of Emerging Technology and Advanced Engineering, 5(1), 2250–2459. ISSN 2250-2459.

    Google Scholar 

  15. Srivastava, A., Han, E. H., Kumar, V., & Singh, V. (1999). Parallel formulations of decision-tree classification algorithms. In High Performance Data Mining (pp. 237–261). Berlin: Springer.

    Google Scholar 

  16. Vaz, N. C., Ferreira, A., Kulkarni, M., Vaz, F. S., & Pinto, N. (2011). Prevalence of diabetic complications in rural Goa, India. Indian Journal of Community Medicine: Official Publication of Indian Association of Preventive & Social Medicine, 36(4), 283.

    Article  Google Scholar 

  17. Wang, L., Ranjan, R., Kołodziej, J., Zomaya, A., & Alem, L. (2015). Software tools and techniques for big data computing in healthcare clouds. Future Generation Computer Systems, 43(C), 38–39.

    Article  Google Scholar 

  18. Wikipedia Contributors. (2018). Body mass index — Wikipedia, the free encyclopedia. https://en.wikipedia.org/w/index.php?title=Body_mass_index&oldid=893812047. Accessed 24 April 2018.

    Google Scholar 

  19. Wild, S., Roglic, G., Green, A., Sicree, R., & King, H. (2004). Global prevalence of diabetes: Estimates for the year 2000 and projections for 2030. Diabetes Care, 27(5), 1047–1053.

    Article  Google Scholar 

  20. Xu, W., Zhang, J., Zhang, Q., & Wei, X. (2017). Risk prediction of type II diabetes based on random forest model. In 2017 Third International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB) (pp. 382–386). Piscataway: IEEE.

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Muhammad Younus .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Younus, M., Munna, M.T.A., Alam, M.M., Allayear, S.M., Ara, S.J.F. (2020). Prediction Model for Prevalence of Type-2 Diabetes Mellitus Complications Using Machine Learning Approach. In: Alhajj, R., Moshirpour, M., Far, B. (eds) Data Management and Analysis. Studies in Big Data, vol 65. Springer, Cham. https://doi.org/10.1007/978-3-030-32587-9_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-32587-9_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-32586-2

  • Online ISBN: 978-3-030-32587-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics