Abstract
Diabetes is a menacing disease, which can cause death without any cautions. In this paper we introduce a way to assist people by raising an alert for precautions. It is a prediction system for the diabetes disease, which will predict whether to be a candidate and at what age. The datasets are for Egyptian diabetes patients, 2/3 will be used for training and 1/3 will be used for testing. This system is based on the machine learning concept, by using decision tree technique. This paper introduces a new idea in prediction and differs from previous papers, which focused on classification prediction to answer a yes or no question only. This contribution was new in the prediction system, by adding a regression technique with a randomization code to predict the age. The results were promising, the system predicts diabetes incidents at what age, with accuracy 84 %.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
World Health Organization: Global Health Estimates: Deaths by Cause, Age, Sex and Country, 2000–2012. WHO, Geneva (2014)
Global status report on noncommunicable diseases 2014. World Health Organization, Geneva (2012)
Han, J., Rodriguze, J.C., Beheshti, M.: Diabetes Data Analysis and Prediction Model Discovery Using RapidMiner (2008)
Repalli, P.: Prediction on Diabetes Using Data Mining Approach (2011)
Aekphakorn, W.: Diabetes risk score: Office of Health Information System, pp. 28–31 (2005)
Baa, C.A., et al.: Performance of a predictive model to identify undiagnosed diabetes in a health care setting. Diabetes Care 22(2), 213–219 (1999)
Glumer, C., et al.: A Danish diabetes risk score for targeted screening: the Inter99 study. Diabetes Care 27(3), 727–733 (2003)
Stern, M.P., et al.: Does the metabolic syndrome improve identification of individuals at risk of type 2 diabetes and/or cardiovascular disease? Diabetes Care 27(11), 2676–2680 (2004)
Schmidt, M.I., et al.: Identifying individuals at high risk for diabetes: the Atherosclerosis Risk in Communities study. Diabetes Care 28(8), 2013–2018 (2005)
Mohan, V., et al.: A simplified Indian Diabetes Risk Score for screening for undiagnosed diabetic subjects. J. Assoc. Physicians India 53, 759–763 (2005)
Wilson, P.W., et al.: Prediction of incident diabetes mellitus in middle-aged adults: the Framingham Offspring Study. Arch. Intern. Med. 167, 1068–1074 (2007)
Visang, K., Chialchanwattana, S., Sunat, K.: Risk factor analysis of diabetes mellitus diagnosis. Master’s thesis in Computer Science, Konkan University, pp. 798–805 (2009)
Alby, S.: A survey on data-mining technologies for prediction and diagnosis of diabetes (2014)
Patil, B.M., Joshi, R.C., Toshniwal, D.: Association rule for classification of type-2 diabetic patients. In: Proceedings of the Second International Conference on Machine Learning and Computing, pp. 330–334 (2010)
Nuwangi, S.M., Oruthotaarachchi, C.R., Tilakaratna, J.M.P.P., Caldera, H.A.: Usage of association rules and classification techniques in knowledge
Han, J., Kamber, M., Pei, J.: Concepts and Techniques in Data Mining, 3rd edn. Morgan Kaufmann Publishers, Burlington (2012)
De Muth, J.E.: Basic Statistics and Pharmaceutical Statistical Application, 3rd edn. CRC Press, Boca Raton (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Orabi, K.M., Kamal, Y.M., Rabah, T.M. (2016). Early Predictive System for Diabetes Mellitus Disease. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2016. Lecture Notes in Computer Science(), vol 9728. Springer, Cham. https://doi.org/10.1007/978-3-319-41561-1_31
Download citation
DOI: https://doi.org/10.1007/978-3-319-41561-1_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41560-4
Online ISBN: 978-3-319-41561-1
eBook Packages: Computer ScienceComputer Science (R0)