Abstract
Nowadays, most of the people are suffering from the attack of chronic diseases because of their lifestyle, food habits, and reduction in physical activities. Diabetes is one of the most common chronic diseases being suffered by the people of all ages. As a result, the healthcare sector is generating extensive data containing huge volume, enormous velocity, and a vast variety of heterogeneous sources. In such scenario, scientific solutions offer to harness these massive, heterogeneous and complex datasets to obtain more meaningful information. Moreover, machine learning algorithms can play a tremendous part in creating a statistical prediction-based model. The aim of this paper is to identify the prevalence of diabetes related to long-term complications among patients with type-2 diabetes mellitus. The processing and statistical analysis require machine learning environment known as Scikit-Learn, Pandas for Python, and R-Studio for R. In this work, machine learning approaches such as decision tree, random forest for developing classification system-based prediction model to assess type-2 diabetes mellitus chronic diseases have been studied. Additionally, we have proposed an algorithm which is solely based on random forest and tried to detect the complicated areas of type-2 diabetes patients.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ahmed, K. R. (2009). Incidence of diabetic retinopathy: A 15 year follow up in a hospital population (Bangladesh). Master’s thesis.
Al Jarullah, A. A. (2011). Decision tree discovery for the diagnosis of type II diabetes. In 2011 International Conference on Innovations in Information Technology (pp. 303–307). Piscataway: IEEE.
Cottle, M., Hoover, W., Kanwal, S., Kohn, M., Strome, T., & Treister, N. (2013). Transforming health care through big data strategies for leveraging big data in the health care industry. Institute for Health Technology Transformation. http://ihealthtran.com/big-data-in-healthcare.
Friedman, J., Hastie, T., & Tibshirani, R. (2001). The Elements of Statistical Learning. Springer Series in Statistics (Vol. 1). New York: Springer.
Hersen, M., & Thomas, J. C. (2007) Handbook of Clinical Interviewing with Adults. Los Angeles: Sage Publications.
Knowler, W. C., Barrett-Connor, E., Fowler, S. E., Hamman, R. F., Lachin, J. M., Walker, E. A., et al. (2002). Reduction in the incidence of type 2 diabetes with lifestyle intervention or metformin. The New England Journal of Medicine, 346(6), 393–403.
Mahmoodi, M., Hosseini-zijoud, S. M., Nabati, S., Modarresi, M., Mehrabian, M., Sayyadi, A., et al. (2013). The effect of white vinegar on some blood biochemical factors in type 2 diabetic patients. Journal of Diabetes and Endocrinology, 4(1), 1–5.
Pan, X. R., Li, G. W., Hu, Y. H., Wang, J. X., Yang, W. Y., An, Z. X., et al. (1997). Effects of diet and exercise in preventing NIDDM in people with impaired glucose tolerance: The Da Qing IGT and diabetes study. Diabetes care, 20(4), 537–544.
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.
Rahim, M. A. (2002). Diabetes in Bangladesh: Prevalence and determinants. Master’s thesis.
Rajesh, K., & Sangeetha, V. (2012). Application of data mining methods and techniques for diabetes diagnosis. International Journal of Engineering and Innovative Technology (IJEIT), 2(3), 224–229.
Saquib, N., Khanam, M. A., Saquib, J., Anand, S., Chertow, G. M., Barry, M., et al. (2013). High prevalence of type 2 diabetes among the urban middle class in Bangladesh. BMC Public Health, 13(1), 1032.
Sharmila, K., & Manickam, S. (2016). Diagnosing diabetic dataset using Hadoop and k-means clustering techniques. Indian Journal of Science and Technology, 9(40), 1–5.
Sharmila, K., & Vethamanickam, S. (2015). Survey on data mining algorithm and its application in healthcare sector using Hadoop platform. International Journal of Emerging Technology and Advanced Engineering, 5(1), 2250–2459. ISSN 2250-2459.
Srivastava, A., Han, E. H., Kumar, V., & Singh, V. (1999). Parallel formulations of decision-tree classification algorithms. In High Performance Data Mining (pp. 237–261). Berlin: Springer.
Vaz, N. C., Ferreira, A., Kulkarni, M., Vaz, F. S., & Pinto, N. (2011). Prevalence of diabetic complications in rural Goa, India. Indian Journal of Community Medicine: Official Publication of Indian Association of Preventive & Social Medicine, 36(4), 283.
Wang, L., Ranjan, R., Kołodziej, J., Zomaya, A., & Alem, L. (2015). Software tools and techniques for big data computing in healthcare clouds. Future Generation Computer Systems, 43(C), 38–39.
Wikipedia Contributors. (2018). Body mass index — Wikipedia, the free encyclopedia. https://en.wikipedia.org/w/index.php?title=Body_mass_index&oldid=893812047. Accessed 24 April 2018.
Wild, S., Roglic, G., Green, A., Sicree, R., & King, H. (2004). Global prevalence of diabetes: Estimates for the year 2000 and projections for 2030. Diabetes Care, 27(5), 1047–1053.
Xu, W., Zhang, J., Zhang, Q., & Wei, X. (2017). Risk prediction of type II diabetes based on random forest model. In 2017 Third International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB) (pp. 382–386). Piscataway: IEEE.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Younus, M., Munna, M.T.A., Alam, M.M., Allayear, S.M., Ara, S.J.F. (2020). Prediction Model for Prevalence of Type-2 Diabetes Mellitus Complications Using Machine Learning Approach. In: Alhajj, R., Moshirpour, M., Far, B. (eds) Data Management and Analysis. Studies in Big Data, vol 65. Springer, Cham. https://doi.org/10.1007/978-3-030-32587-9_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-32587-9_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32586-2
Online ISBN: 978-3-030-32587-9
eBook Packages: EngineeringEngineering (R0)