Abstract
The coronary artery disease (CAD) occurs from the narrowing and damaging of major blood vessels or arteries. It has become the most life-threatening disease in the world, especially in the South Asian region. Its detection and treatment involve expensive medical facilities. The early detection of CAD, which is a major challenge, can minimize the patients’ suffering and expenses. The major challenge for CAD detection is incorporating numerous factors for detailed analysis. The goal of this study is to propose a new Clinical Decision Support System (CDSS) which may assist doctors in analyzing numerous factors more accurately than the existing CDSSs. In this paper, a Rule-Based Expert System (RBES) is proposed which involves five different Belief Rules, and can predict five different stages of CAD. The final output is produced by combining all BRBs and by using the Evidential Reasoning (ER). Performance evaluation is measured by calculating the success rate, error rate, failure rate and false omission rate. The proposed RBES has higher a success rate and false omission rate than other existing CDSSs.
You have full access to this open access chapter, Download conference paper PDF
1 Introduction
Coronary artery disease (CAD) is a condition when the coronary arteries become narrow or blocked. It is developed when bad cholesterols and plaques (fatty droplets) deposit inside the wall of arteries. The process is termed as atherosclerosis which means clogging of arteries, and reduces blood flow inside the heart muscle. Blood carries oxygen and essential nutrients to the heart [1]. Lack of sufficient blood supply can cause angina (chest pain), and lead to a heart attack by injuring heart muscle. The death toll due to heart disease is 16.3 million in America each year which has made it the leading cause of death in the United States. According to the American Heart Association (AHA), one person is suffered from a heart attack in every 40 s. Having zero risk factors of heart disease, any male has 3.6% and any female has less than 1% chance of getting cardiovascular disease in his/her lifetime. Moreover, the chances are 37.5% and 18.3 respectively [2] for having 2 risk factors. In Bangladesh, CAD is responsible for a 17% mortality rate [3]. The regular diagnostic approach of CAD relies on coronary angiogram test [4], echo-cardiogram ram (ECG) [5, 6], nuclear scan test and exercise stress test. ECG and exercise stress do not produce sustainable results for CAD prediction due to their non-invasiveness properties and numerous biases. Moreover, walking on a trade mill in a stress test makes the patient discomfort heart function than normal condition. Nowadays, support vector machine (SVM) [7, 8] and artificial neural network (ANN) [5, 8,9,10,11,12,13,14,15,16,17] based Clinical Decision Support Systems (CDSS) [18,19,20] are developed for CAD prediction. Unfortunately, SVM and ANN have no direct impact on the reasoning process due to their black-box- type modeling approaches. As a result, the degree of significance of individual factors cannot be resolved. So, human judgment and clinical data are both two essential factors for CAD diagnosis. For this purpose, CDSS combines both historical data and doctors’ domain specific knowledge. But clinical data, like clinical domain knowledge, signs, and symptom, contain various uncertainties [21,22,23], and pose challenges for selecting domain knowledge to construct knowledge base. Moreover finding the reasoning under uncertainty requires an excellent computational algorithm. To mitigate the challenges, researches introduced different CDSSs, based on the fuzzy interface system and Bayesian interface system, which also has limitations [10, 24,25,26]. In this paper, the proposed expert system can predict CAD by five classifications according to the severity. They are as follows:
Class A: (Normal or zero sign of heart disease).
Class B: (Unstable angina) - when new symptoms are introduced beside regular stable angina, and appears frequently (mostly when at rest), last long with more severity, and can lead to a heart attack. It can be treated with oral medications (such as nitroglycerine).
Class C: (Non-ST segment elevation myocardial infarction) - echocardiogram does not indicate the symptom of this type of myocardial infarction (MI) but chemical markers in the blood show the damage of heart muscles. The damage may not be significant and artery blockages are usually partial or temporary.
Class D: (ST-segment elevation myocardial infarction) - this type of MI is occurred quickly due to sudden blockage by blood clogging. It can be detected by ECG and chemical markers in the blood, and causes damage to vast heart muscles.
Class E: (Silent ischemia)- Patient with heart disease can be suffered from a sudden heart attack (called silent ischemia) without any prior or early warning, and diabetic patients are common victims of this type [1].
2 Related Research
Researchers recently worked on machine learning and rule-based systems for different purposes [30,31,32,33]. Many researchers developed belief-rule-based interference methodology by using evidential reasoning (RIMER) for CAD diagnosis [18, 27]. The RIMER process uses belief-rule-base for modeling clinical domain knowledge, and applies an evidential reasoning approach for implementing reasoning. Studies show that RIMER based clinical decision support systems are highly efficient in supporting and interacting with clinical domain knowledge under uncertainty. In [28], Multi-Criteria Decision Making Methods were presented for accessing CAD under uncertainty where presence and absence of CAD is predicted through using symptom and signs of CAD. But these approaches report neither the number of blocked arteries nor the significance of severity of the disease [8, 16, 26, 28, 29]. Weak parameters, like signs and symptoms, are used for predicting CAD as well as for predicting the similar types of diseases like mitral regurgitation, dilated cardiomyopathy, congenital heart disease, hyper-tropic cardiomyopathy, myocardial infarction etc. Some researchers developed the Medical Decision Support System (MDSS) to predict CAD. Other proposer polygenic risk scores (PRS), a nonlinear, for CAD prediction with accuracy an 0.92 under the receiver operating curve (AUC) [8].
Experimental analysis reveals that CAD diagnosis and its severity can be predicted significantly through clinical features along with pathological and demographic features [23, 25, 26, 28]. In this paper, we consider all these parameters, and proposed a cooperative-belief-rule based prototype (CDSS) to assist doctors for CAD analysis under uncertainty.
3 Methodology
3.1 Proposed Rule Based Expert System for CAD
In this paper, five separate BRBs are developed based on five distinct feature sets of patients such as i) patients’ pathological features, ii) patients’ physiological features, iii) patients’ demographic features, iv) patients’ behavioral features, and v) patients’ non-modifiable risk factors. The BRBs are as follows:
Here, S = {\( a_{1} ,a_{2} , \ldots ..a_{l} \) }, T = {\( b_{1} ,b_{2} , \ldots ..b_{m} \) }, X = {\( c_{1} ,c_{2} , \ldots ..c_{n} \) }, Y = {\( d_{1} ,d_{2} , \ldots ..d_{o} \) }, Z = {\( e_{1} ,e_{2} , \ldots ..e_{p} \) } represent the demographic, physiological, clinical, behavioral, and Non-modifiable features respectively (where l, m, n, o, and p indicate attributes` number for factors).
Suppose that \( P_{A} ,P_{B} , P_{C} ,P_{D} \, {\text{and}}\,P_{E} \) are the corresponding vectors for the five BRBs, and ω = [\( \upomega_{1} ,\upomega_{2} ,\upomega_{3} ,\upomega_{4} ,\upomega_{5} \)] represent the weight coefficients to the relative BRB where \( f_{A} , f_{B} ,f_{C} , f_{D} \,{\text{and}}\, f_{E} \) functions are for demographic, physiological, clinical, behavioral, and non-modifiable factors. To calculate the individual matching degree for each rule, the following equation is used:
Where u is utility value, aij is individual matching degree, Aij is jth referential value for ith attribute, and xi is the input for ith antecedent (Fig. 1).
To calculate activated weight to each rule the following equation is used:
Where wk is the kth rule’s activation weight and ak is the interrelation between attributes. To calculate ak, the following equations is used:
Where \( \bar{\delta }_{i} \) is the antecedent weight and \( \alpha_{i}^{k} \) represents individual matching degrees for ith attribute. Five separate BRBs to predict CAD are BRB_P, BRB_PH, BRB_D, BRB_B, and BRB_N. BRB_PH considers physiological factors like blood pressure and stress. BRB_P considers pathological factors like blood sugar level, low density lipoprotein, and triglyceride level. BRB_D considers factors like age and body mass index. BRB_B considers behavior factors like diet, smoking, and physical activities. BRB_N considers non-modifiable risk factors like gender, family history, and residential Area.
3.2 Uncertainties in the Attribute
Attributes like blood pressure, stress, blood sugar, lipoprotein, triglyceride, age, body mass index, unhealthy diet, smoking, family history, and race are categorized into five classes, namely Physiological, Pathological, Demographical, Behavioral, and Non-modifiable risk factors. All the attributes have uncertainties at some level except gender attribute (Table 1).
3.3 Explanation of Antecedent Attributes
Five different types of attributes have been considered in this research. Explanations of the numerical values of each attribute are as follows:
Physiological Factor
Blood Pressure (BP)
The blood pressure which creates heartbeats is known as BP. For BP, several numerical points namely Usual, Elevated, Hypertension Stage 1, Hypertension Stage 2, Hypertension Stage 3 are reflected and shown in Table 2.
Here, the referential numerical points are presented as in the Eq. (10).
Stress Score (SS)
Intermediate risk of heart problems can be expressed in the SS score. It can be distributed into several referential numerical points, namely regular, mildly irregular, moderately irregular and severely irregular are shown in Table 3.
The referential numerical values can be presented as in Eq. (11).
Pathological Factor
Blood Sugar Level.
It is the amount of sugar in the blood. Five referential points are shown in Table 4 and expressed by the Eq. (12).
Its measures triglycerides amount in blood. Four referential points are described in Table 5 and Eq. (13).
Low Density Lipoprotein (LDL)
It contains both lipid and protein, and carries cholesterol to body tissues. Five referential values related to LDL are shown in Table 6 and Eq. (14).
Demographic Factor
Age
Older people are more likely to be victims of coronary artery disease, especially, after the age of 65 years. Usually, aged people have higher chance of getting CAD (Table 7).
Four referential values, namely, young (< 35 years), mature (35–49 years), old (50–65), and extremely old (E), have been considered in the following equation from the above table.
Body Mass Index (BMI)
It indicates the amount of fat ratio. It is applicable for the age range from 18 to 65. It is the ratio of weight to height (Table 8).
Four referential values, namely, healthy weight (18.5–24.9), overweight (25–29.9), obese (30–39.9), and morbidly obese (>=40), have been considered in the following equation from the above table.
Behavior
Unhealthy Diet
Mediterranean diet can reduce the risk of CAD by 30%. It is mainly plant based food and categorized into four sections shown in Table 10 and expressed by Eq. (17) (Table 9).
Smoking
Smokers or exposers to smoke have a high risk of CAD. Smoking is categorized into four sections, and shown in Table 10 and expressed by Eq. (18).
Physical Activities
Inactive and less active people are at high risk to develop CAD. Physical activities are categorized into four sections which are shown in Table 11 and expressed by Eq. (19).
Non-modifiable risk Factors
Gender
Male has higher risk of CAD than female. Besides, male suffers from CAD in earlier age than female. But after the age of 70 years, both males and females have similar chances of getting heart disease (Table 12).
Family History
If parents have histories of heart disease, children have a high risk of developing CAD. The risk is even higher if parents have suffered before early 50 years of age. The numerical points for the family history are represented by 0 (No history of parent’s heart disease), 1 (History of parent’s heart disease), and 2 (History of parent’s heart disease before age of 50), and expressed in Table 13 and by Eq. (21).
Residential Area
People from mega-cities are more prone to CAD. This is because of a higher rate of diabetes and obesity. On the other hand, people from hill track areas are less likely to develop heart disease. The numerical points for the residential areas are 0(Mega City), (Rural Area), and 2 (Hill track area), and expressed in Table 14 and by Eq. (22).
3.4 Rule Base
All attributes from Eqs. (10) to (22) are applied as input variables to predict the CAD class. Sub rules 1 to 20 are expressed in Table 15 for the two Physiological factors from Eqs. (10) and (11).
A sub rule of the CAD can be shown as:
R3: IF blood pressure is usual AND stress score is significantly irregular
THEN Overall Prediction is
{(Stage 1, 0.6), (Stage 2, 0.3), (Stage 3, 0.1), (Stage 4, 0.0), (Stage 5, 0.0)}
In the R3, the antecedent attributes are and the consequence attributes are. The rule shows that patient with usual blood pressure and significantly irregular stress score has the probability of developing CAD are (Stage 1 is 60%), (Stage 2 is 30%), (Stage 3 is 10%), (Stage 4 is 0%), (Stage 5 is 0%). The summation of all referential values for R3 is (0.6 + 0.3 + 0.1 + 0.0 + 0.0 =) 1. If the summation of all referential values for a particular rule is 1, we can say that the rule is competed. For some missing attributes or ignorance, the summation may be less than 1 and the rule is incomplete [34].
3.5 Data Set Description
Dataset was collected from the National Heart Foundation, Bangladesh with proper authorization. The data set description is shown in Table 16.
4 Result and Discussion
In the binary diagnostic test, a positive or negative diagnosis is made for each patient. When the result of diagnosis is compared to the true condition, we find four possible outcomes: true positive, true negative, false positive, false negative.
4.1 Success Rate
It is the ratio of correctly identified patient’ numbers and total patients. Equation (23) is used to calculate the success rate and average success rate.
4.2 Error Rate
It is the ratio of incorrectly identified patients’ numbers and total patients. Equation (24) is used to calculate the error rate and average error rate.
4.3 Failure Rate
It is the ratio of the number of non-recognized patients to total patients. Equation (25) is used to calculate the failure rate and average failure rate.
4.4 False Omission Rate (fOR)
False omission rate is the ratio the of number of patients identified to a class A to the total number of patients belongs to a particular class except for class A. Eq. (26) is used to calculate the false omission rate and average false omission rate.
Table 17 explains the results obtained by the equation number (23), (24), (25), and (26). Class A is considered as CAD negative patients and the remaining classes are CAD positive patients. It is observed that the success rate of predicting class C type heart disease is the highest (94.08%) among five classes. On the other hand, the success rate of class E prediction is the lowest (50% only). Class E is very hard to predict as most of the time it does show any symptoms.
5 Conclusion
Heart disease is one of the major threats to public health and the reason for the main cause of death worldwide. Although numerous researches are carried out in this area, still there are challenges to diagnose CAD for treatment. In this paper, the proposed expert system results in an average accuracy rate of 89.90% which is the highest among other existing CDSS. The average false omission rate (3.54%) is also the lowest in this system than that of other CDSS. Our test results satisfy one of the main goals of this research. The average failure rate (11.10%) and average error rate (8.81%) also remain as marginal. Class E (silent Ischemia) success rate is the lowest among all classes. The reason is that Class E occurs suddenly without showing any warning signs of heart problems. It was noted that Class E is common to people with diabetes. It requires further research work to investigate whether or not diabetes influences the success rate in Class E type patients. Apart from this, our research concludes that RBES has a higher success rate and false omission rate than other existing CDSS.
References
Coronary Artery Disease. https://my.clevelandclinic.org/health/diseases. Accessed 21 Nov 2016
Benjamin, E.J., et al.: Heart disease and stroke statistics-2017 update: a report from the American Heart Association. Circulation 135, e146–e603 (2017)
Islam, A.M., Mohibullah, A., Paul, T.J.B.H.J.: Cardiovascular disease in Bangladesh: a review. Heart Journal, 31, 80–99 (2016)
Perez Ruiz De Garibay, A., Kellum, J., Honigschnabel, J., Kreymann, B.: Respiratory and metabolic acidosis correction with the advanced organ Support system. Intensive Care Medicine Experimental 7, 56 (2019)
Shadmi, R., Mazo, V., Bregman-Amitai, O., Elnekave, E.: IEEE: Fully-convolutional deep-learning based system for coronary calcium score prediction from non-contrast chest ct. In: 2018 IEEE 15th International Symposium on Biomedical Imaging, pp. 24–28 (2018)
Kirsch, J., Buitrago, I., Mohammed, T.L.H., Gao, T.M., Asher, C.R., Novaro, G.M.: Detection of coronary calcium during standard chest computed tomography correlates with multi-detector computed tomography coronary artery calcium score. Int. J. Cardiovascular Imag. 28, 1249–1256 (2012)
De Vijver, R.V., et el.: In-field detection of Altemaria solani in potato crops using hyperspectral imaging. Comput. Electron. Agric. 168, 105106 (2020)
Gola, D., Erdmann, J., Muller-Myhsok, B., Schunkert, H., Konig, I.R.: Polygenic risk scores outperform machine learning methods in predicting coronary artery disease status. Genetic Epidemiol. 44, 125–138 (2020)
Weber, M., et al.: Automatic identification of crossovers in cryo-EM images of murine amyloid protein A fibrils with machine learning. J. Microscopy 277, 12–22 (2020)
Tang, X.L., Xiao, M.Q., Liang, Y.J., Zhu, H.Z., Li, J.F.: Online updating belief-rule-base using Bayesian estimation. Knowledge-Based Syst. 171, 93–105 (2019)
Penson, A., et al.: Development of genome-derived tumor type prediction to inform clinical cancer care. JAMA Oncol. 6, 84–91 (2020)
Penafiel, S., et al.: IEEE: Associating risks of getting strokes with data from health checkup records using dempster-shafer theory. In: 2018 20th International Conference on Advanced Communication Technology, pp. 239–246 (2018)
Khanna, N.N., et al.: Nonlinear model for the carotid artery disease 10-year risk prediction by fusing conventional cardiovascular factors to carotid ultrasound image phenotypes: A Japanese diabetes cohort study. Echocardiography-A J. Cardiovascular Ultrasound Allied Techniques 36, 345–361 (2019)
Hyer, J.M., et al.: Can we improve prediction of adverse surgical outcomes? development of a surgical complexity score using a novel machine learning technique. J. Am. College Surgeons, pp. 43–52 (2020)
Chicco, D., Jurman, G.: Machine learning can predict survival of patients with heart failure from serum creatinine and ejection fraction alone. BMC Med. Inform. Decis. Mak. 20, 16 (2020)
Apostolopoulos, I.D., Groumpos, P.P., Apostolopoulos, D.I.: A medical decision support system for The Prediction Of The Coronary Artery Disease Using Fuzzy Cognitive Maps. In: Kravets, A., Shcherbakov, M., Kultsova, M., Groumpos, P. (eds.) Creativity in Intelligent Technologies and Data Science, vol. 754, pp. 269–283 (2017)
Aggarwal, S., Azad, V., Bhattacharyya, S., De, S., Pan, I., Dutta, P.: A Hybrid System Based on FMM and MLP to Diagnose Heart Disease (2017)
Zhou, Z.-G., et al.: A cooperative belief rule based decision support system for lymph node metastasis diagnosis in gastric cancer. Knowledge-Based Syst. 85, 62–70 (2015)
Zhou, Z.-G., et al.: A bi-level belief rule based decision support system for diagnosis of lymph node metastasis in gastric cancer. Knowledge-Based Syst. 54, 128–136 (2013)
Christo, V.R.E., Nehemiah, H.K., Brighty, J., Kannan, A.: Feature selection and instance selection from clinical datasets using co-operative co-evolution and classification using random forest. IETE Journal of Research
Long, N.C., Meesad, P., Unger, H.: A highly accurate firefly based algorithm for heart disease prediction. Expert Syst. Appl. 42, 8221–8231 (2015)
Li, G.L., Zhou, Z.J., Hu, C.H., Chang, L.L., Zhang, H.T., Yu, C.Q.: An optimal safety assessment model for complex systems considering correlation and redundancy. Int. J. Approximate Reason. 104, 38–56 (2019)
Hossain, M.S., Ahmed, F., Fatema Tuj, J., Andersson, K.: A belief rule based expert system to assess tuberculosis under uncertainty. J. Med. Syst. 41, 43 (2017)
Sparapani, R., Logan, B.R., McCulloch, R.E., Laud, P.W.: Nonparametric competing risks analysis using bayesian additive regression trees. Stat. Methods Med. Res. 29, 57–77 (2020)
Onisko, A., Druzdzel, M.J., Austin, R.M.: Application of Bayesian network modeling to pathology informatics. Diagnostic Cytopathol. 47, 41–47 (2019)
Gupta, A., et al.: Probabilistic graphical modeling for estimating risk of coronary artery disease: applications of a flexible machine-learning method. Med. Decis. Mak. 39, 1032–1044 (2019)
Wang, Y., et al.: Reliability assessment model for industrial control system based on belief rule base. Int. J. Comput. Commun. Control 14, 419–436 (2019)
Kishore, A.H.N., Jayanthi, V.E.: Multi criteria decision making methods to predict the prevalence of coronary artery disease. J. Med. Imag. Health Inform. 8, 719–726 (2018)
Sofian, H., Ming, J.T.C., Mohamad, S., Noor, N.M.: Calcification detection using deep structured learning in intravascular ultrasound image for coronary artery disease. In: 2018 2nd International Conference on BioSignal Analysis, Processing and Systems (ICBAPS), pp. 47–52 (2018)
Sarker, I.H.: Context-aware rule learning from smartphone data: survey, challenges and future directions. J. Big Data 6(1), 1–25 (2019). https://doi.org/10.1186/s40537-019-0258-4
Sarker, I.H., Colman, A., Han, J.: RecencyMiner: mining recency-based personalized behavior from contextual smartphone data. J. Big Data 6(1), 1–21 (2019). https://doi.org/10.1186/s40537-019-0211-6
Sarker, I.H., Colman, A., Han, J., Khan, A.I., Abushark, Y.B., Salah, K.: BehavDT: a behavioral decision tree learning to build user-centric context-aware predictive model. Mobile Netw. Appl. 25, 1–11 (2019). https://doi.org/10.1007/s11036-019-01443-z
Sarker, I.H., Kayes, A.S.M., Watters, P.: Effectiveness analysis of machine learning classification models for predicting personalized context-aware smartphone usage. J. Big Data 6(1), 1–28 (2019). https://doi.org/10.1186/s40537-019-0219-y
Hossain, S., et al.: A belief rule based expert system to predict student performance under uncertainty. In: 2019 22nd International Conference on Computer and Information Technology (ICCIT), pp. 1–6. IEEE (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Hossain, S., Sarma, D., Chakma, R.J., Alam, W., Hoque, M.M., Sarker, I.H. (2020). A Rule-Based Expert System to Assess Coronary Artery Disease Under Uncertainty. In: Chaubey, N., Parikh, S., Amin, K. (eds) Computing Science, Communication and Security. COMS2 2020. Communications in Computer and Information Science, vol 1235. Springer, Singapore. https://doi.org/10.1007/978-981-15-6648-6_12
Download citation
DOI: https://doi.org/10.1007/978-981-15-6648-6_12
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-6647-9
Online ISBN: 978-981-15-6648-6
eBook Packages: Computer ScienceComputer Science (R0)