A Rule-Based Expert System to Assess Coronary Artery Disease Under Uncertainty

Hossain, Sohrab; Sarma, Dhiman; Chakma, Rana Joyti; Alam, Wahidul; Hoque, Mohammed Moshiul; Sarker, Iqbal H.

doi:10.1007/978-981-15-6648-6_12

A Rule-Based Expert System to Assess Coronary Artery Disease Under Uncertainty

Sohrab Hossain⁹,
Dhiman Sarma¹⁰,
Rana Joyti Chakma¹⁰,
Wahidul Alam¹¹,
Mohammed Moshiul Hoque¹² &
…
Iqbal H. Sarker¹²

Conference paper
First Online: 19 July 2020

1883 Accesses
13 Citations

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1235))

Abstract

The coronary artery disease (CAD) occurs from the narrowing and damaging of major blood vessels or arteries. It has become the most life-threatening disease in the world, especially in the South Asian region. Its detection and treatment involve expensive medical facilities. The early detection of CAD, which is a major challenge, can minimize the patients’ suffering and expenses. The major challenge for CAD detection is incorporating numerous factors for detailed analysis. The goal of this study is to propose a new Clinical Decision Support System (CDSS) which may assist doctors in analyzing numerous factors more accurately than the existing CDSSs. In this paper, a Rule-Based Expert System (RBES) is proposed which involves five different Belief Rules, and can predict five different stages of CAD. The final output is produced by combining all BRBs and by using the Evidential Reasoning (ER). Performance evaluation is measured by calculating the success rate, error rate, failure rate and false omission rate. The proposed RBES has higher a success rate and false omission rate than other existing CDSSs.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

Coronary artery disease (CAD) is a condition when the coronary arteries become narrow or blocked. It is developed when bad cholesterols and plaques (fatty droplets) deposit inside the wall of arteries. The process is termed as atherosclerosis which means clogging of arteries, and reduces blood flow inside the heart muscle. Blood carries oxygen and essential nutrients to the heart [1]. Lack of sufficient blood supply can cause angina (chest pain), and lead to a heart attack by injuring heart muscle. The death toll due to heart disease is 16.3 million in America each year which has made it the leading cause of death in the United States. According to the American Heart Association (AHA), one person is suffered from a heart attack in every 40 s. Having zero risk factors of heart disease, any male has 3.6% and any female has less than 1% chance of getting cardiovascular disease in his/her lifetime. Moreover, the chances are 37.5% and 18.3 respectively [2] for having 2 risk factors. In Bangladesh, CAD is responsible for a 17% mortality rate [3]. The regular diagnostic approach of CAD relies on coronary angiogram test [4], echo-cardiogram ram (ECG) [5, 6], nuclear scan test and exercise stress test. ECG and exercise stress do not produce sustainable results for CAD prediction due to their non-invasiveness properties and numerous biases. Moreover, walking on a trade mill in a stress test makes the patient discomfort heart function than normal condition. Nowadays, support vector machine (SVM) [7, 8] and artificial neural network (ANN) [5, 8,9,10,11,12,13,14,15,16,17] based Clinical Decision Support Systems (CDSS) [18,19,20] are developed for CAD prediction. Unfortunately, SVM and ANN have no direct impact on the reasoning process due to their black-box- type modeling approaches. As a result, the degree of significance of individual factors cannot be resolved. So, human judgment and clinical data are both two essential factors for CAD diagnosis. For this purpose, CDSS combines both historical data and doctors’ domain specific knowledge. But clinical data, like clinical domain knowledge, signs, and symptom, contain various uncertainties [21,22,23], and pose challenges for selecting domain knowledge to construct knowledge base. Moreover finding the reasoning under uncertainty requires an excellent computational algorithm. To mitigate the challenges, researches introduced different CDSSs, based on the fuzzy interface system and Bayesian interface system, which also has limitations [10, 24,25,26]. In this paper, the proposed expert system can predict CAD by five classifications according to the severity. They are as follows:

Class A: (Normal or zero sign of heart disease).

Class B: (Unstable angina) - when new symptoms are introduced beside regular stable angina, and appears frequently (mostly when at rest), last long with more severity, and can lead to a heart attack. It can be treated with oral medications (such as nitroglycerine).

Class C: (Non-ST segment elevation myocardial infarction) - echocardiogram does not indicate the symptom of this type of myocardial infarction (MI) but chemical markers in the blood show the damage of heart muscles. The damage may not be significant and artery blockages are usually partial or temporary.

Class D: (ST-segment elevation myocardial infarction) - this type of MI is occurred quickly due to sudden blockage by blood clogging. It can be detected by ECG and chemical markers in the blood, and causes damage to vast heart muscles.

Class E: (Silent ischemia)- Patient with heart disease can be suffered from a sudden heart attack (called silent ischemia) without any prior or early warning, and diabetic patients are common victims of this type [1].

2 Related Research

Researchers recently worked on machine learning and rule-based systems for different purposes [30,31,32,33]. Many researchers developed belief-rule-based interference methodology by using evidential reasoning (RIMER) for CAD diagnosis [18, 27]. The RIMER process uses belief-rule-base for modeling clinical domain knowledge, and applies an evidential reasoning approach for implementing reasoning. Studies show that RIMER based clinical decision support systems are highly efficient in supporting and interacting with clinical domain knowledge under uncertainty. In [28], Multi-Criteria Decision Making Methods were presented for accessing CAD under uncertainty where presence and absence of CAD is predicted through using symptom and signs of CAD. But these approaches report neither the number of blocked arteries nor the significance of severity of the disease [8, 16, 26, 28, 29]. Weak parameters, like signs and symptoms, are used for predicting CAD as well as for predicting the similar types of diseases like mitral regurgitation, dilated cardiomyopathy, congenital heart disease, hyper-tropic cardiomyopathy, myocardial infarction etc. Some researchers developed the Medical Decision Support System (MDSS) to predict CAD. Other proposer polygenic risk scores (PRS), a nonlinear, for CAD prediction with accuracy an 0.92 under the receiver operating curve (AUC) [8].

Experimental analysis reveals that CAD diagnosis and its severity can be predicted significantly through clinical features along with pathological and demographic features [23, 25, 26, 28]. In this paper, we consider all these parameters, and proposed a cooperative-belief-rule based prototype (CDSS) to assist doctors for CAD analysis under uncertainty.

3 Methodology

3.1 Proposed Rule Based Expert System for CAD

In this paper, five separate BRBs are developed based on five distinct feature sets of patients such as i) patients’ pathological features, ii) patients’ physiological features, iii) patients’ demographic features, iv) patients’ behavioral features, and v) patients’ non-modifiable risk factors. The BRBs are as follows:

$$ D_{A} = f_{A} \left( {{\text{S}}, P_{A} } \right) $$

(1)

$$ D_{B} = f_{B} \left( {{\text{T}}, P_{B} } \right) $$

(2)

$$ D_{C} = f_{C} \left( {{\text{X}}, P_{C} } \right) $$

(3)

$$ D_{D} = f_{D} \left( {{\text{Y}}, P_{D} } \right) $$

(4)

$$ D_{E} = f_{E} \left( {{\text{Z}}, P_{E} } \right) $$

(5)

Here, S = {$ a_{1} ,a_{2} , \ldots ..a_{l} $ }, T = {$ b_{1} ,b_{2} , \ldots ..b_{m} $ }, X = {$ c_{1} ,c_{2} , \ldots ..c_{n} $ }, Y = {$ d_{1} ,d_{2} , \ldots ..d_{o} $ }, Z = {$ e_{1} ,e_{2} , \ldots ..e_{p} $ } represent the demographic, physiological, clinical, behavioral, and Non-modifiable features respectively (where l, m, n, o, and p indicate attributes` number for factors).

Suppose that $ P_{A} ,P_{B} , P_{C} ,P_{D} \, {\text{and}}\,P_{E} $ are the corresponding vectors for the five BRBs, and ω = [$ \upomega_{1} ,\upomega_{2} ,\upomega_{3} ,\upomega_{4} ,\upomega_{5} $] represent the weight coefficients to the relative BRB where $ f_{A} , f_{B} ,f_{C} , f_{D} \,{\text{and}}\, f_{E} $ functions are for demographic, physiological, clinical, behavioral, and non-modifiable factors. To calculate the individual matching degree for each rule, the following equation is used:

$$ \alpha_{i,j} = \frac{{u\left( {A_{i,j\, + \,1} } \right) - x_{i} }}{{u\left( {A_{i,j\, + \,1} } \right) - u\left( {A_{i,j} } \right)}} $$

(6)

Where u is utility value, a_ij is individual matching degree, A_ij is j^th referential value for i^th attribute, and x_i is the input for i^th antecedent (Fig. 1).

To calculate activated weight to each rule the following equation is used:

$$ w_{k} = \frac{{\theta_{k} \alpha_{k} }}{{\mathop \sum \nolimits_{i = 1}^{L} \theta_{i} \alpha_{i} }} $$

(7)

Where w_k is the k^th rule’s activation weight and a_k is the interrelation between attributes. To calculate a_k, the following equations is used:

$$ \alpha_{k} = \prod\nolimits_{i = 1}^{M} {\left( {\alpha_{i}^{k} } \right)^{{\bar{\delta }_{i}^{k} }} } $$

(8)

$$ \overline{\delta }_{i} = \frac{{\delta_{i} }}{{max_{i = 1, \ldots ,M} \left( {\delta_{i} } \right)}} $$

(9)

Where $ \bar{\delta }_{i} $ is the antecedent weight and $ \alpha_{i}^{k} $ represents individual matching degrees for i^th attribute. Five separate BRBs to predict CAD are BRB_P, BRB_PH, BRB_D, BRB_B, and BRB_N. BRB_PH considers physiological factors like blood pressure and stress. BRB_P considers pathological factors like blood sugar level, low density lipoprotein, and triglyceride level. BRB_D considers factors like age and body mass index. BRB_B considers behavior factors like diet, smoking, and physical activities. BRB_N considers non-modifiable risk factors like gender, family history, and residential Area.

3.2 Uncertainties in the Attribute

Attributes like blood pressure, stress, blood sugar, lipoprotein, triglyceride, age, body mass index, unhealthy diet, smoking, family history, and race are categorized into five classes, namely Physiological, Pathological, Demographical, Behavioral, and Non-modifiable risk factors. All the attributes have uncertainties at some level except gender attribute (Table 1).

Table 1. Uncertainties in the attributes

Abstract

1 Introduction

2 Related Research

3 Methodology

3.1 Proposed Rule Based Expert System for CAD

3.2 Uncertainties in the Attribute

3.3 Explanation of Antecedent Attributes

Blood Pressure (BP)

Stress Score (SS)

Blood Sugar Level.

Low Density Lipoprotein (LDL)

Age

Body Mass Index (BMI)

Unhealthy Diet

Smoking

Physical Activities

Gender

Family History

Residential Area

3.4 Rule Base

3.5 Data Set Description

4 Result and Discussion

4.1 Success Rate

4.2 Error Rate

4.3 Failure Rate

4.4 False Omission Rate (fOR)

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation