Convolutional neural network optimized by differential evolution for electrocardiogram classification

Chen, Shan Wei; Wang, Shir Li; Qi, XiuZhi; Ng, Theam Foo; Ibrahim, Haidi

doi:10.1007/s11042-023-15407-9

Convolutional neural network optimized by differential evolution for electrocardiogram classification

Published: 26 April 2023

Volume 82, pages 45811–45837, (2023)
Cite this article

Download PDF

Multimedia Tools and Applications Aims and scope Submit manuscript

Convolutional neural network optimized by differential evolution for electrocardiogram classification

Download PDF

Shan Wei Chen^1,2,
Shir Li Wang ORCID: orcid.org/0000-0003-4417-3213^1,3,
XiuZhi Qi⁴,
Theam Foo Ng⁵ &
…
Haidi Ibrahim⁶

1489 Accesses
1 Citation
Explore all metrics

Abstract

The Coronavirus disease 2019, or COVID-19, has shifted the medical paradigm from face-to-face to telehealth. Telehealth has become a vital resource to contain the virus spread and ensure the continued care of patients. In terms of preventing cardiovascular diseases, automating electrocardiogram (ECG) classification is a promising telehealth intervention. The healthcare service ensures that patient care is appropriate, comfortable, and accessible. Convolutional neural networks (CNNs) have demonstrated promising results in ECG categorization, which require high accuracy and short training time to ensure healthcare quality. This study proposes a one-dimensional-CNN (1D-CNN) arrhythmia classification based on the differential evolution (DE) algorithm to optimize the accuracy of ECG classification and training time. The performance of 1D-CNNs of different activation functions are optimized based on the standard DE algorithm. Finally, based on MIT-BIH and SCDH arrhythmia databases, the performances of optimized and unoptimized 1D-CNN are compared and analysed. Results show that the 1D-CNN optimized by the DE has higher accuracy in heartbeats classification. The optimized 1D-CNN improves from 97.6% to 99.5% on MIT-BIH and from 80.2% to 88.5% on SCDH. Therefore, the optimized 1D-CNN shows improvements of 1.9% and 8.3% in the two datasets, respectively. In addition, compared with the unoptimized 1D-CNN based on the same parameter settings, the optimized 1D-CNN has less training time. Under the conditions of ReLU function and 10 epochs, the training takes 9.22 s on MIT-BIH and 10.35 s on SCDH, reducing training time by 67.2% and 64.2%, respectively.

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

Article Open access 31 March 2021

Evaluation of artificial intelligence techniques in disease diagnosis and prediction

Article Open access 30 January 2023

Heart disease risk prediction using deep learning techniques with feature augmentation

Article Open access 14 March 2023

1 Introduction

An electrocardiogram (ECG) is a non-invasive detection method that is widely used to reflect a potential heart condition. The ECG information of the patient’s heart is directly obtained to reflect its heart rate status. ECG is appropriate for the diagnosis and treatment of various types of heart diseases [9], and its automatic classification can provide an objective diagnosis and reduce medical diagnosis time [6]. Therefore, the detection and classification of ECG are of great clinical significance, which is also helpful in promoting the clinical research of cardiovascular diseases [8, 15]. Various feature classifiers have been developed to automatically detect ECG signals [27]. However, most methods involve manually extracting features and then classifying heartbeats using traditional classifiers [18]. Thus, achieving high accuracy requires considerable time finding and calculating the best combination of features.

Deep learning has achieved considerable success in computer vision [30] with a robust feature extraction ability. In particular, the convolutional neural network (CNN) is the most widely used deep learning model [2, 13, 16, 19], attaining robust results in applications in medical imaging, gene recognition, speech processing, sleep apnea detection and other aspects [4, 28]. However, in the classification and detection of ECG, the existing methods still reflect three shortcomings, as follows: (1) The complexity of algorithms for QRS waves; (2) The complex changes of irregular heartbeats in rhythm or morphology leading to difficulties in the ECG feature recognition; (3) The need for large training samples and training time to achieve the ideal recognition accuracy [1, 22]. Given the problems in ECG automatic classification, this study proposes a DE-optimized, automated classification approach for arrhythmia based on a representative CNN.

The contributions of this study are as follows: (1) Applying deep learning technology to ECG signal detection classification, which solves the drawback of traditional machine learning methods that require manual feature extraction; (2) Using the DE algorithm to optimize the CNN parameters is helpful to improve the performance of CNN in terms of classification accuracy and training time; (3) Help to detect arrhythmia more accurately and automatically using deep learning.

The remainder of this paper is organized as follows. Section 2 describes the studies related to CNN in ECG recognition. Section 3 presents the methodology of our proposed one-dimensional CNN (1D-CNN) and its optimization method and Sect. 4 describes the experimental setup. Section 5 discusses the experimental results through analysis. Section 6 presents the conclusions and future work.

2 Background

Early detection is beneficial for diagnosing and treating various heart diseases to ensure a high survival rate of patients [5]. Examples of heart diseases are atrial premature beat, ventricular premature beat, left bundle branch block (LBBB) and right bundle branch block (RBBB). Many effective methods for automatically detecting cardiovascular diseases (CVDs) from ECG signals have been developed in the past few decades. For example, in the automatic detection of Atrial Fibrillation (AF), several detectors can identify cardiovascular diseases (CVDs) based on p-wave deletion or RR interval variation (R is a symbol for the beginning of ventricular depolarization). Dash et al. proposed an AF automatic detection algorithm based on the randomness, variability and complexity of heartbeat interval time series. Lian et al. presented an AF detection algorithm based on RR interval scatter plots with variations [11]. For the method in [11], the morphological features are extracted by a wavelet transform and independent component analysis. Wavelet features consist of fourth-order approximation coefficients and third and fourth-order detail coefficients. The features extracted by independent component analysis include a set of independent source signals recovered from the observed samples [14].

Although various feature classifiers have been developed, most of these involve manual extraction and traditional classification. These methods require considerable time in determining the best feature combination to increase their accuracy. Moreover, feature extraction in ECG signal processing requires high professional knowledge of digital signal processing. Therefore, feature extraction or selection presents a challenge to non-medical researchers and those in the field. To overcome this drawback, several researchers have begun using neural networks to automatically extract features such as heartbeat. Escalona-Morán et al. presented a neural network based on the convolution of the 2D heartbeat as a classification method [10], a series of three adjacent beats are converted into a 2D coupling matrix input, enabling the convolution filter to easily capture the continuous waveform of adjacent heartbeats and the correlation between beats. The above method achieved a final sensitivity of 76.8%, with positive predictive values of 74.0% for SupraVentricular Ectopic Beat (SVEB) and 93.8% for Ventricular Ectopic Beat (VEB).

Deep learning, which exhibits strong feature extraction capability, has achieved considerable success in computer vision in recent years. In particular, the CNN model is the most widely used deep learning model, demonstrating robustness in its applications in medical imaging, gene recognition, speech processing, sleep apnea detection and other aspects. Accordingly, the use of deep neural networks (DNN) to automatically detect ECG signals has gained research interest. Salem et al. developed an automatic AF detection method based on CNN [23], where AF features are automatically learned and applied to the classification module. This method simplifies the feature extraction without requiring expert feature engineering to determine the suitability and criticality of features.

Bhagyalakshmi et al. developed a genetic bat-assisted support vector neural network (GB-SVNN) to classify ECG arrhythmia [3], obtaining a final accuracy of 0.9696 and sensitivity of 0.99. Zhang et al. proposed a multi-scale CNN (MCNN) that performs the timescale transformation of input signals and AF detection based on the scale transformation input [31], achieving a depth that strongly correlated with detection performance.

Although the above methods are experimentally effective in addressing specific CVD detection problems, their good performance is typically based on carefully selected clean data or a small number of testers. Thus, their applicability may be limited. Therefore, achieving the generalization capability of models to detect CVD reliably from limited single-lead ECG records remains a considerable challenge. Such generalization capability depends on how a CNN model depends is trained and requires a longer training time. In the present study, we propose using DE to optimize the initial weights of 1D-CNN to ensure its generalization and reduce its training time.

3 Methodology

This section starts with (1) Data source and signal preprocessing, (2) Implementation of 1D-CNN and (3) Optimization of SDE. These three aspects introduce the implementation and optimization methods for ECG classification. The specific process is shown in Fig. 1.

Figure 1 shows the entire process of ECG classification with CNN, which starts from preprocessing the hexadecimal raw file of ECG. After preprocessing, we obtain the intuitive image and extract the ECG image to build the data set. With sufficient data extracted from the database, the data set is divided into 75% training set and 25% test set. We repeatedly train CNN and obtain the required classification results.

3.1 Data source and signal preprocessing

The experimental data for this study is the single-lead ECG signal from the MIT-BIH and Sudden Death Cardiac Holter (SCDH) arrhythmia databases [12] to evaluate the proposed method. MIT-BIH database covers ECG signal records with a sampling rate of 360 Hz and 30 min as the unit, and the SCDH contains records with a sampling rate of 250 Hz. Unlike the equal-length records in the MIT-BIH, the record lengths vary in the SCDH. For example, the No. 30 record is stored for approximately 24 h, and the No. 52 record is stored for approximately 7 h. This study needs data recording with heartbeats as the unit. Therefore, we extract the heartbeat from the records before the experiment. The process can be divided into signal preprocessing and heartbeat location and capture.

3.1.1 Signal preprocessing

The ECG signals are preprocessed to ensure that the experimental data only retain the frequency and characteristics related to arrhythmia. In addition, such preprocessing eliminates the interference of noise and other clutter and thus singles out the waveform mode. Therefore, preprocessing of ECG signals is required to accurately capture heartbeats.

The preprocessing is divided into four processes: band-pass filtering, ‘double slope’ preprocessing, waveform smoothing and window sliding. In signal filtering, the ECG signal is filtered by a 40-order FIR band-pass filter with a passband of 15–25 Hz, as suggested in literature [25]. After the band-pass filtering, the ‘double slope’ preprocessing is applied to increase the prominence of the waveform. Then, waveform smoothing is performed based on the low-pass filter with a cut-off frequency of 5 Hz. The last step is to implement window sliding to the waveform to increase the amplitude of and smoothen the waveform. The width of the sliding window is set to 17 sampling points.

Figure 2 shows the waveform comparison before and after ECG row data preprocessing. After preprocessing, the original ECG signal shows single-mode peaks, each corresponding to a QRS wave. Compared with the original signal, the preprocessed signal is easier to locate and detect, thereby the purpose of preprocessing is attained.

3.1.2 Heartbeat location and capture

Adaptive double threshold for QRS peak location

This study uses the double threshold method [26] to complete the detection logic of the QRS peak. The specific processing method works such that the detection starts when the wave peak is higher than the low threshold. The threshold is lowered when the wave peak is between the high and low thresholds, thenadjusted higher when the wave peak is higher than the high threshold. When the wave peak is lower than the low threshold, then noise is assumed.

The sensitivity (SE) and positive predictive rate (P +) are used to evaluate the QRS detection algorithm. The evaluation indicators are defined as follows [21]:

$$\left\{\begin{array}{c}SE=\frac{TP}{TP+FN}\\ P+=\frac{TP}{TP+FP}\end{array}\right. ,$$

(1)

where $TP$ represents the number of correct check beats, $FN$ represents the number of missed beats and $FP$ represents the number of check wrong beats.

Heartbeats capture

Based on the QRS peak location, 100 sampling points are extracted from the left, 150 sampling points are extracted from the right and a single heartbeat with a total length of 250 sampling points is intercepted.

The manual annotation type codes of various heartbeats are stored in the MIT-BIH and SCDH arrhythmia databases. The heartbeats consist of the normal beat (Normal), left bundle branch block beat (LBBB), right bundle branch block beat (RBBB), premature venture contract (PVC), aberrated atrial premature beat (Aab), left or right bundle branch block (BBB), R-on-T premature ventricular contraction (RONT), nodal (junctional) premature beat (NPC) and for premature or ectopic supraventricular beat (SVPB).

Tables 1 and 2 show the final heartbeat statistics. LBBB, RBBB, PVC and Normal are the four categories with the most significant number of 41 types in the MIT-BIH arrhythmia database. In the SCDH database, Normal, PVC, SVPB and BBB are the four categories with the largest sample sizes. For the accuracy evaluation, we select these two groups of heartbeats as the training and testing data of 1D-CNN.

Table 1 Summary of heartbeat data in MIT-BIH

Convolutional neural network optimized by differential evolution for electrocardiogram classification

Abstract

Similar content being viewed by others

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

Evaluation of artificial intelligence techniques in disease diagnosis and prediction

Heart disease risk prediction using deep learning techniques with feature augmentation

1 Introduction

2 Background

3 Methodology

3.1 Data source and signal preprocessing

3.1.1 Signal preprocessing

3.1.2 Heartbeat location and capture

Adaptive double threshold for QRS peak location

Heartbeats capture

3.2 Implementation of 1D-CNN

3.2.1 Forward propagation

3.2.2 Backpropagation

3.2.3 Update the weights and offsets of each layer

3.2.4 Changes needed for 1D-CNN implementation

3.3 Optimization of 1D-CNN based on SDE

3.3.1 Mutation

3.3.2 Crossover

3.3.3 Selection

3.3.4 Control parameters

3.3.5 Conversion of weight into the chromosome representation

4 Experimental setup

4.1 Experimental process before DE optimization

4.1.1 Simulation

4.1.2 Algorithm performance analysis

Evaluation index setting

Performance evaluation

4.2 Experimental process after DE optimization

4.2.1 Simulation

4.2.2 Optimization analysis of standard differential evolution algorithm

5 Results and comparative analysis of 1D-CNN before and after optimization

6 Conclusions and future works

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation