A Systematic Survey of Data Augmentation of ECG Signals for AI Applications

Rahman, Md Moklesur; Rivolta, Massimo Walter; Badilini, Fabio; Sassi, Roberto

doi:10.3390/s23115237

Open AccessArticle

A Systematic Survey of Data Augmentation of ECG Signals for AI Applications

¹

Dipartimento di Informatica, Università degli Studi di Milano, 20133 Milan, Italy

²

School of Nursing, University of California, San Francisco, CA 94143, USA

³

AMPS-LLC, New York, NY 10025, USA

^*

Author to whom correspondence should be addressed.

Sensors 2023, 23(11), 5237; https://doi.org/10.3390/s23115237

Submission received: 27 April 2023 / Revised: 23 May 2023 / Accepted: 27 May 2023 / Published: 31 May 2023

(This article belongs to the Special Issue ECG Signal Processing Techniques and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

AI techniques have recently been put under the spotlight for analyzing electrocardiograms (ECGs). However, the performance of AI-based models relies on the accumulation of large-scale labeled datasets, which is challenging. To increase the performance of AI-based models, data augmentation (DA) strategies have been developed recently. The study presented a comprehensive systematic literature review of DA for ECG signals. We conducted a systematic search and categorized the selected documents by AI application, number of leads involved, DA method, classifier, performance improvements after DA, and datasets employed. With such information, this study provided a better understanding of the potential of ECG augmentation in enhancing the performance of AI-based ECG applications. This study adhered to the rigorous PRISMA guidelines for systematic reviews. To ensure comprehensive coverage, publications between 2013 and 2023 were searched across multiple databases, including IEEE Explore, PubMed, and Web of Science. The records were meticulously reviewed to determine their relevance to the study’s objective, and those that met the inclusion criteria were selected for further analysis. Consequently, 119 papers were deemed relevant for further review. Overall, this study shed light on the potential of DA to advance the field of ECG diagnosis and monitoring.

Keywords:

ECG augmentation; artificial intelligence; electrocardiogram; AI in cardiology; data augmentation

1. Introduction

Cardiovascular diseases (CVDs) are significant contributors to worldwide fatalities, accounting for approximately 33% of all deaths across the world [1]. Because of its low cost, simplicity, and non-invasive nature, the electrocardiogram (ECG) is the most used technique for diagnosing and monitoring CVDs in both clinical and telemedicine settings [2]. The ECG provides specific information about the structure and electrical activity of the heart by detecting alterations in the shape and timing of the captured waveforms. In clinical applications, prompt and precise ECG interpretation is essential. For instance, in patients suffering from myocardial infarction (MI), the quicker the abnormal condition is detected, the higher the chance of avoiding threats to life and recovery. However, due to the complexity of the ECG signal and the overlapping noise, diagnosing certain conditions may take a long time (for instance, for rare events where 24 h or longer Holter recordings are necessary) and require significant human review [3]. Moreover, due to the unique individual characteristics of the ECG, features may vary significantly in different patients under different physiological conditions, posing considerable difficulties for the recognition of ECG patterns. Signal interpretation can thus be a time-consuming and complex process which leads to subjective ambiguity and human mistakes in their analysis, even for highly-trained specialists. To tackle these problems, automatic systems for interpreting ECG signals have been developed since the 1960s [4].

The accurate and automatic detection of CVDs from ECG signals is a topic of relevant clinical interest, particularly with the increasing spread of new wearable technology, the modern implementation of artificial intelligence (AI), and the growth of digital health solutions. Computer-assisted analysis of ECG signals allows achieving objective results with less room for inter-operator and operator-specific errors. The three main components of an automated system for ECG interpretation are signal preprocessing, feature extraction, and classification. In the preprocessing phase, signals are denoised, segmented, and normalized. In the feature extraction phase, time domain and frequency domain features are extracted. Lastly, the extracted features are used to classify the signals into distinct diseases.

In recent times, AI, but especially deep learning (DL), has shown remarkable results in the classification of ECG signals [5,6]. This is likely due to the robust capability of DL for feature extraction. Deep neural networks consist of a composition of layers (i.e., mathematical functions), which makes DL highly adept at processing complex biomedical signals, including ECG signals. The composition, indeed, resembles the mathematical steps typically employed in traditional ECG signal processing algorithms. For example, the famous Pan–Tompkins algorithm [7], which detects the timing of each heartbeat within an ECG signal, involves the use of a first filter, a rectification operation, and a second filter: these steps are exactly what is commonly employed in DL architectures as a sequence of convolutional layers and non-linear activation functions. Moreover, with each layer, more abstract and high-level features can be extracted which can be leveraged to identify CVDs. As a result, compared with conventional machine learning (ML), and provided sufficient expression capability, DL potentially reaches a better ability to represent the most relevant features of complex data.

DL models are data-hungry by nature, i.e., the performance of a DL model relies heavily on the availability of high-quality labeled datasets and a large number of training samples. Training a DL model with insufficient data or an unbalanced dataset may result in poor performance, a non-converging training phase, and biased classification outcomes. To circumvent these issues, the DL model needs a large and balanced dataset. However, obtaining a large number of training samples and a balanced dataset is difficult due to, for example, the low incidence rate of abnormal cardiac events and the availability of expert cardiologists to accurately label (annotate) the waveforms. In addition, the number of ECG recordings annotated by cardiologists is limited because only expert physicians are capable of accurately annotating the recordings. To guarantee performance, the building of a DL model is frequently accompanied by a data augmentation (DA) technique, which provides additional and presumably non-redundant training data by deforming the training set. DA aims to improve the generalization ability of trained models by reducing overfitting and broadening the decision boundary of the model.

In image recognition, DA has already reached a stable state. To improve the performance, most state-of-the-art convolutional neural network (CNN) architectures use some form of DA technique. For example, residual networks (ResNet) use color augmentation, scaling, and cropping [8], DenseNet uses mirroring and translation [9], and inception networks use mirroring and cropping [10]. Unfortunately, in the case of ECG data, such kinds of random transformations are not effective because the relative amplitudes of different cardiac beat segments (P waves, QRS complex, T waves, ST segments etc.) carry relevant information for the diagnosis. For example, time inversion assumes that it is normal that the sequence of ECG waves is reversed. Although these can be effective for augmenting the data (e.g., spectral), these techniques are likely not effective for ECGs. Another example could be randomly cropping and merging the ECG data which may easily transform the normal (sinus) rhythm into an arrhythmic pattern.

To the best of our knowledge, no comprehensive review of DA techniques of AI applications on ECG signals has been carried out yet. Since its practical significance and potential for the development of ECG classification models, we considered the DA techniques of the AI methods applied in ECG classification worth a review. In this study, papers were systematically reviewed and their characteristics were highlighted. By reviewing all methods, a taxonomy of the ECG DA methods is proposed for the first time and illustrated in Figure 1. The taxonomy mainly breaks down into two categories: basic DA techniques and advanced DA techniques. The basic DA techniques include all methods based on random and non-random transformations of the ECG signal in the magnitude domain, time domain, and time–magnitude domain. Conversely, advanced DA techniques include alternative methods to model the data such as, for example, statistical models and learning-based models. Advanced DA techniques follow the distributions of features present in the dataset to produce novel pattern combinations. Many statistical models, such as the Gaussian mixture model (GMM) and Markov chain (MC) models have been proposed for generating new samples [11,12,13]. Learning-based models such as variational autoencoder (VAE) [14] and generative adversarial networks (GANs) [15], are relatively new models developed recently for the generation of ECGs. The paper is structured as follows:

A comprehensive review of the latest techniques for analyzing ECG signals using DA methods.
A detailed taxonomy and categorization of ECG DA techniques, along with their various applications, datasets, and AI techniques.
A comprehensive discussion of research gaps and open issues in the field that need further investigation.

2. Method

2.1. Literature Search Strategy

A comprehensive search of the published research was conducted through three databases: IEEE Xplore, PubMed, and Web of Science. The search criteria covered a broad spectrum of aspects, including signal types (i.e., ECG), various AI techniques, and diverse DA techniques. To ensure the relevance and accuracy of the findings, only articles published in the last ten years (between 1 January 2013 and 31 January 2023) in peer-reviewed English language journals, conference proceedings, chapters, and magazine articles were included. The comprehensive search query, including the specific search terms used, is detailed in Table 1.

2.2. Study Selection

In order to ensure a rigorous and systematic approach to select articles, we followed the guidelines set forth by the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) [16]. First, we utilized reference manager software to eliminate the duplicates. We then thoroughly screened the remaining works by evaluating titles and abstracts, and subsequently conducted a full reading of the selected papers to apply inclusion/exclusion criteria. To provide transparency and clarity of the selection process, a flowchart summarizing our research is reported in Figure 2. This method enabled us to effectively narrow down our options and identify studies that were directly relevant to our research objectives.

2.3. Results of the Research

Implementing the search query and inclusion/exclusion criteria, we initially obtained a list with 625 articles; we then excluded 193 duplicates using reference manager software or manual review. After screening the remaining 432 papers based on their titles and abstracts, 350 papers were selected for full-text evaluation according to our inclusion/exclusion criteria. The inclusion and exclusion criteria are presented in Table 2. Ultimately, 119 papers met the criteria and were kept for further analysis.

In the following sections, we present the major outputs of the literature review.

3. ECG Applications and Datasets

3.1. Typical ECG Applications

The most common implementation of AI in ECG analysis is automatic ECG interpretation [14,15,17,18]. Other important applications include localizing and annotating specific rhythms and beats, which can aid in the detection of conditions such as MI [13] and fetal heart rate series classification [19]. Moreover, recent advancements in biometric-based human identification show great promise for accurate recognition based on ECG data [20,21]. In addition, ECG analysis can also be utilized for detecting emotions and stress [22], pain [23], sleep-apnea [24,25], identification of COVID-19 infections [26,27,28,29], assessment of signal quality [30,31], and many other potential applications. In this study, we considered all applications as long as they investigated on DA of ECG via AI techniques.

3.2. Datasets

Most of the reviewed studies used a few ECG databases. The MIT-BIH AD database was used in 46% of the studies, followed by Physionet-2017 at 13%, PTB at 7%, Physionet-2020 at 5%, and Physionet-2021 at 3%. INCART, CPSC-2018, and PTB-XL were used in 2% of the studies each. The specific characteristics of these databases are hereafter described in detail.

MIT-BIH AD: The MIT-BIH Arrhythmia Database contains a collection of 48 ECG ambulatory records of two leads, each spanning 30 minutes, gathered during the period from 1975 to 1979 [32]. These recordings, sampled at 360 Hz with 11-bit resolution over a 10 mV range, were collected from 47 individuals who were subjected to testing in the BIH Arrhythmia Laboratory. Within this dataset, we find several different types of cardiac abnormalities (CA), including but not limited to atrial fibrillation (AF), atrial bigeminy, atrial flutter, ventricular premature beat, right bundle branch block (RBBB), and left bundle branch block (LBBB).
PhysioNet-2017: This dataset is a comprehensive collection of 8528 single-lead ECG data records obtained from 3658 individuals [33]. The ECG data are uniformly sampled at a rate of 300 Hz and span a duration of 9 to 61 seconds. The dataset encompasses four distinct rhythm categories, namely normal, AF, noise, and other.
INCART: The St. Petersburg INCART dataset consists of 75 records extracted from 32 24-h Holter recordings where patients were diagnosed with various heart complications such as coronary artery disease, ischemia, conduction abnormalities, and arrhythmia. The records are sampled at a frequency of 257 Hz, ensuring that subtle changes in heart function are captured. Each record spans 30 min and contains 12 standard leads.
CPSC-2018: The China Physiological Signal Challenge-2018 dataset is a comprehensive collection of $6, 877$ recordings of 12-lead ECG data, encompassing a diverse range of patients across genders and medical conditions [34]. The recordings were gathered from 11 hospitals, contributing to data’s diversity. Each ECG recording is sampled at 500 Hz, providing high-resolution physiological signal data for analysis. The recordings range in length from 6 to 60 s. This dataset comprises of nine different types of CAs, including AF, LBBB, RBBB, normal, premature atrial contraction, premature ventricular contraction, intrinsic paroxysmal atrioventricular block, ST-segment depression, and ST-segment elevation.
PTB: The PTB dataset comprises 549 ECG records consisting of 15 leads (12 standard leads and 3 Frank leads) obtained from 290 individuals [35]. The records were sampled at a rate of 1000 Hz with 16-bit resolution. Each individual has up to five records, which allows for a longitudinal view of their health status. Among the subjects, 216 have been diagnosed with one of 8 different types of heart diseases, which include MI, cardiomyopathy/heart failure, bundle branch block, dysrhythmia, myocardial hypertrophy, valvular heart disease, and myocarditis. The remaining 52 individuals represent a healthy control group, which serves as a point of reference for comparison. However, the health status of 22 individuals remains unknown.
PTB-XL: The PTB-XL dataset, a comprehensive collection of clinical ECGs, comprises 21,837 records taken from 18,885 patients [36]. These ECGs are 10 s in length and were captured at two different sampling rates, 100 Hz and 500 Hz, with 16-bit resolution, ensuring that the data were of high accuracy. Within this dataset, there are several distinct ECG rhythms and abnormalities, including normal, MI, conduction disturbance, and hypertrophy.
PhysioNet-2021: The Physionet-2021 includes 12-lead ECG recordings from a large cohort of 6877 patients suffering from various CAs [37]. These recordings have been collected from six different hospital systems, located in four different countries spread across three continents. The dataset is available publicly as training data, with over 88,000 ECGs shared for this purpose. Some of the previously described databases were later included and are now part of Physionet-2021 (e.g., INCART, PTB, and PTB-XL).

4. Basic Data Augmentation Methods

The concept of basic DA techniques of ECGs originally comes from the random transformation of image and time series, such as scaling, flipping, noise addition, etc. There are three basic DA techniques for ECGs: time domain, magnitude domain, and hybrid domain. Time domain transformations change the ECG along the time axis, i.e., the data points on the ECG are moved to different time steps than the original sequence. Magnitude domain transformation is different from time domain transformation because the time steps stay the same while only the values of the elements (mV) change. For example, scaling, adding noise, dropping, etc., are all ways to change the values of the elements. Hybrid methods use both time and magnitude domains. Namely, basic DA generates pattern

x^{^{'}}

using some random transformation functions.

x^{^{'}} \leftarrow f (x)

(1)

where

x

is defined as

x = {[x_{1}, x_{2}, \dots, x_{N}]}^{⊺}

with N the number of time steps from the original dataset. Each

x_{n}

represents the ECG amplitude at time n for each of the L measurements collected by multiple electrodes concurrently. For example, the standard clinical ECG is typically stored in a

N \times 12

matrix (or its transpose). Based on the search query, we obtained several papers that are related to the basic DA method. The summary of basic DA is tabulated in Table 3. In the following, we provide details about the most commonly used basic DA operations in the paper we analyzed.

Noise addition: The ECG signal $x$ is modified by adding Gaussian random noise $n$ . The noise $n$ is generated by a random generator with a mean of 0 and a standard deviation of $σ$ . Mathematically, the generated signal can be expressed as: $x + n$ [19,38,39,40,41,42,43,44,45].
Scaling: Each lead of the ECG signal is scaled by a random factor that is drawn from a normal distribution [21,25,38,39,40,42,46]. The operation is typically performed by multiplying the ECG signal by a diagonal matrix.
Time inversion: Given the ECG signal $x$ , the temporally inverted version of the signal is expressed as $x^{^{'}} = {[x_{N}, x_{N - 1}, \dots, x_{1}]}^{⊺}$ [38,40,41,47].
Spatial inversion: The amplitude of the ECG $x$ is multiplied by −1, causing a spatial inversion of the ECG. The transformed signal can be mathematically expressed as $- x$ [38,40,41].
Time-spatial inversion: Temporal–spatial inversion is a fusion of spatial and temporal inversion. Temporal–spatial reversal performs a horizontal reverse on the segments that have been vertically reversed [41,47,48].
Permutation: The ECG signal is split into multiple segments and shuffled so as to randomly alter the temporal position of each segment and then recombined [38,40].
Dropping: Masking the input signal randomly with a certain probability [38,40,44,49].
Cutout: Randomly cut out (set to zero) a portion of the input signal with the width of magnitude × signal length [19,38,40,50] (same as dropping, but with each portion having the same given length).
Sine: A sine wave is added to the ECG signal. The sine wave’s frequency and amplitude are chosen randomly [38,40].
Square pulse: The ECG signal is added with a square pulse whose frequency and amplitude vary randomly [38,40].
Time warping: Randomly chosen segments of the original ECG signals are stretched or compressed along the time axis. Random signal segments along the time axis are stretched and compressed by dynamic time warping [38,51].
Baseline wandering: To create baseline wandering, different sinusoidal signals with random low frequencies and phases are generated and added to $x$ [51,52].
Lead removal: Lead removal is the process of picking a single lead at random and setting all of its time signal values to zero [52] (similar to dropping and cutout, but setting to zero an entire lead at once).
Lead order shuffling: Lead order shuffling involves changing the placement order of all leads, or a subset, in a random fashion [39,52].
High-pass filter: High-pass filtering employs a Butterworth filter with a fixed cutoff frequency (e.g., $0.5$ Hz) to filter signals and eliminate baseline wander noise across all leads [52].
Low-pass filter: A low-pass filter, specifically a Butterworth filter with a certain cutoff frequency (e.g., 47 Hz), is used to eliminate high-frequency noise from the noise for all leads [52]. Sometimes, this operation is referred to as Gaussian blur, as a one-dimensional Gaussian kernel is employed to “blur” (low-pass filter) the signal for all leads.
Band-pass filter: A band-pass filter eliminates baseline drift and high-frequency signal components for all leads by employing a Butterworth filter with a certain low cut-off frequency, e.g., $0.5$ Hz and a high cut-off frequency e.g., 47 Hz [39,45,52].
Sigmoid compression: Sigmoid compression applies a sigmoidal activation function to the ECG signal for all leads [52].
Powerline noise: Powerline noise refers to the interference pickup from powerlines at $f = 50$ Hz (or 60 Hz) and its higher harmonics. While powerline noise can be a nuisance for ECG, it can be added to the original signal as a form of DA to increase the variability of ECG signal [46,51,53].
Electromyographic (EMG) noise: EMG noise indicates the high-frequency noise induced by muscle contractions. Simulated EMG noise is added to the clean ECG signal using an appropriate signal processing technique, such as adding the two signals together or convolving the ECG signal with the EMG noise signal [51,53].
Baseline shift: Baseline shift refers to changes in the baseline that occur as a result of variations in electrode–skin impedance brought by electrode movements. In this operation, a direct current offset can be added to the ECG signal to simulate baseline shift noise. The direct current offset is randomly generated and varies within a certain range to make it more realistic [46,51,53].
Peak alteration: Peak alteration in ECG refers to any change or deviation from the normal shape and duration of the QRS complex or T-wave peaks in an ECG [21,54].
Mix-up: New signals are generated by linearly interpolating two other real signals, using different weights for each one [55].

Table 3. Summary of basic DA methods for ECG classification using AI techniques.

Type	Lead	Input	Classifier	Improvem. after DA	Dataset	Refs.
CA	12	ECG	CNN	2.24%	Physionet-2020	[56]
CA	12	ECG	CNN-LSTM	3%	Physionet-2020	[39]
CA	12	ECG	ResNet	−0.063–2.54%	CPSC-2018	[52]
CA	12	ECG	CNN	–	Physionet-2020	[50]
CA	12	ECG	CNN	–	Physionet-2020	[57]
CA	12	ECG	ResNet	1.4–3.5%	ICBEB and PTB-XL	[46]
CA	1	ECG	CNN	–	MIT-BIH AD	[58]
CA	1	Spectral	Residual Attention	0.8%	MIT-BIH AD	[59]
CA	12	ECG	CNN	7.73%	Physionet-2021	[40]
CA	1	ECG	CNN	–	MIT-BIH AD	[60]
CA	12	ECG	ResNet	40%	INCART	[54]
CA	2	ECG	CNN	2.3%	Physionet-2017	[61]
CA	1	Spectral	CNN	–	MIT-BIH AD	[62]
CA	1	ECG	CNN	0.028%	MIT-BIH AD	[63]
CA	1	Spectral	CNN	–	MIT-BIH AD	[64]
CA	12	ECG	CNN	–	Physionet-2020	[65]
CA	8	ECG	CNN	–	Private	[43]
CA	12	ECG	CNN	1%	Physionet-2020	[66]
CA	12	Spectral	CNN	4.64%	PTB	[67]
CA	1	Spectral	CNN	–	Physionet-2017	[68]
CA	1	ECG	CNN	5%	MIT-BIH AD	[47]
CA	12	ECG	CNN	–	Physionet-2021	[45]
CA	1	ECG	BeatGAN	0.28%	MIT-BIH AD	[69]
CA	1	ECG	ResNet-LSTM	–	MIT-BIH AD, AFDB and Physionet-2017	[70]
CA	1	Spectral	Residual-Attention	–	MIT-BIH AD and Supraventricular Arrhythmia	[71]
CA	1	Spectral	CNN	–	MIT-BIH AD	[72]
CA	1	ECG	LSTM	42%	Physionet-2017	[73]
CA	2	ECG	CNN-RNN	–	Private	[74]
CA	1	ECG	CNN-LSTM	3%	MIT-BIH AD	[75]
CA	1	ECG	CNN-RNN	1.91%	Physionet-2017	[55]
CA	–	Spectral	CNN	–	MIT-BIH AD and PTB	[76]
CA	1	ECG	CNN	–	Physionet-2017	[44]
CA	1	ECG	CNN	–	Physionet-2017	[77]
CA	1	ECG	CNN	–	Physionet-2017	[49]
CA	1	ECG	ResNet-RNN	–	Physionet-2017	[78]
CA	12	ECG	CNN	–	Physionet-2021	[79]
CA	1	ECG	CNN	0.62–5.61%	MIT-BIH AD	[80]
CA	1	Spectral	Transformer	–	MIT-BIH AD	[81]
Biometric	1	ECG	CNN	–	CYBHi and UofTDB	[82]
Biometric	1	ECG	CNN	0.19%	PTB and LivDet2015 [83]	[84]
Biometric	1	ECG	CNN	12%	Physionet-2018	[21]
Frailty Identification	1	ECG	LSTM	3.2%	Private	[42]
Sleep apnea	1	ECG	CNN	–	Private	[25]
Peak detection	2	ECG	CNN	2.5%	MIT-BIH-NST	[85]
QA	1	ECG	CNN	2%	Physionet-2017	[86]
QA	12	Spectral	CNN	2.91%	PhysioNet-2011	[30]
QA	2	ECG	U-Net	–	QT [87]	[31]
Cardiac auscultation	2	Spectral	CNN	2–9%	Private	[88]
COVID-19	12	Image	CNN	–	COVID-ECG [89]	[27]
COVID-19	12	Image	CNN	–	COVID-ECG [89]	[29]
COVID-19	12	Image	CNN	−0.02%	COVID-ECG [89]	[28]
Emotion	1	ECG	CNN-SVM	20%	MAHNOB-HCI [90]	[41]
Emotion	1	ECG	CNN	59%	Dreamer	[91]
Fetal ECG	1	ECG	LSTM	10%	NIFECGC	[19]

5. Advanced Data Augmentation Techniques

Basic DA often alters the properties of ECG signals, resulting in the creation of “noise” rather than augmenting the dataset with meaningful samples. These augmented samples may have detrimental effects on ECG classification. For example, in [52], the authors reported that horizontal flipping and vertical flipping DA operations have detrimental effects on their classifier. To tackle the limitations of basic DA techniques, advanced DA techniques can be a reasonable alternative. Based on the search query, we obtained several papers that are related to advanced DA methods. The summary of advanced DA is tabulated in Table 4. Advanced DA techniques can be categorized into two types: statistical generative models and learning-based models. These two approaches are described in the following subsections.

5.1. Statistical Generative Model

ECG DA approaches based on statistical generative models typically involve modeling the dynamics of the ECG with, in fact, statistical models. For example, in [11], the authors proposed a GMM to solve the class imbalance issues of the AF detector. The GMM model showed better performance compared to oversampling the minority class. Silva et al. [12] designed a cardiorespiratory signal synthesizer by conditional sampling from a multimodally trained stochastic system of Gaussian copulas integrated with an MC. Zhu et al. [13] proposed a novel DA technique that took into account both probability distribution and geometry. In their technique, they introduced variations to the data distribution along the geodesic in a Wasserstein space, which is a mathematical concept used to measure the distance between two probability distributions. To calculate the ground metric of the Wasserstein space, they analyzed the cardiovascular characteristics of ECG signals, enabling them to compare their geometry. Then, the augmented samples were fed to the multi-feature transformer mode with real samples. The result was a significant improvement in performance: the AUCROC on the PTB-XL dataset increased by 6–17% compared to the unaugmented dataset.

5.2. Learning Based-Models

In the field of AI, DL-based generative models have emerged as a powerful tool for generating diverse data samples. These models have received widespread attention due to their ability to produce high-quality synthetic data that resemble real-world data. While there are several generative models available, not all of them have been utilized for ECG DA. Therefore, in the following subsection, we focused on the specific methods that have been employed for DA, which can potentially help to overcome the challenges of limited labeled ECG data in various AI-based ECG applications.

5.2.1. Embedding Space

ECG DA techniques should not only be able to make diverse samples but also be able to imitate the features of real ECG. Due to the manifold unfolding in feature space, it is hypothesized that simple transformations applied to encoded inputs instead of the raw inputs would yield more convincing synthetic data. For example, Zhang et al. [48] used basic DA techniques for representational learning in the embedding space. Their learning model comprises two modules: an encoder and a classifier. The encoder generates representations using the temporal–spatial reverse detection approach, while the classifier is responsible for performing the temporal–spatial reverse detection task during the learning phase. After completing the learning process, the trained encoder is transferred to the second stage (the classifier) to be applied in different downstream tasks.

5.2.2. Deep Generative Models

To generate realistic high-dimensional data, such as images, time series, and sequence data, deep generative models (DGMs) have lately demonstrated promising results. Based on the obtained results regarding the ECG, we can categorize DGMs into two broad categories: encoder–decoder networks and generative adversarial networks. In the following sections, we provide details about these two DGMs.

Variational Autoencoder (VAE): The VAE is a powerful DL architecture that has revolutionized the field of unsupervised learning. At the heart of a VAE, there are three critical components: an encoder, a decoder, and a loss function. The encoder and decoder are two distinct types of neural network, each responsible for encoding high-dimensional or structural inputs into a lower-dimensional latent space and decoding them back into high-dimensional outputs, respectively. The loss function used in VAEs is the negative log-likelihood, augmented with a regularizer to ensure that the generated outputs remain consistent with the input data. By sampling vectors from the latent space and transforming them through the decoder, VAEs can generate entirely new patterns, making them a powerful tool for data synthesis and augmentation. In [14], the author used vector quantized VAE (VQ-VAE) to augment the training samples of spectral images of 12 leads ECG. They reported that their method improved the performance by

6 %

compared with the unaugmented data. Al Nazi et al. [101] used a VAE model to increase the variations of ECG data. Thiam et al. [23] used deep denoising convolutional autoencoders (DDCAE). Their approach involves optimizing both the joint representation of input channels generated by a multimodal DDCAE and the additional neural network, trained simultaneously, performing the classification task.

Generative Adversarial Networks (GAN): The GAN is a type of DL framework introduced by Ian Goodfellow and his colleagues in June 2014 [137]. GANs became a common way to make new samples be included in the training set. GANs primarily utilize adversarial training to simultaneously optimize two neural networks: a generator and a discriminator. The generator network generates a sample supposed to be similar to those of the original distribution. This is achieved by extracting a random value from a multivariate normal distribution and feed it as input to the generator. The discriminator compares the output from the generator with the original samples and gives out a number between 0 and 1, indicating the probability of identifying a synthetic ECG rather than a real one.

In the context of ECG DA, the authors of [15,17,18,24,94,100,105,107,108,109,110,111,112,114,115,116,117,118,119,122,123,126] used GAN to augment the samples of the minor classes of the MIT-BIH AD. The augmented samples were then fed to a DL model for ECG beat classification, which demonstrated a notable improvement ranging from 0.24–32% compared to the unaugmented samples. While other studies such as [106,113] also employed GANs to augment ECG samples, they used ML-based classifiers, namely, random forest (RF) and support vector machines (SVM), respectively.

Zhou et al. [93] proposed conditional GAN (CGAN) to generate versatile ECG for improving the training efficiency of the DL model. Their methods improved the performance 1.3–2.6% on two different datasets, i.e., MIT-BIH AD and Physionet-2017. Instead of using the ECG signal as a GAN input, some researchers transformed the ECG signal into spectral images. For example, the authors in [11] converted ECG signals into images by using a logarithmic spectrogram.

Xiong et al. [92] designed an ECG generator that consists of three components: clinical ECG recordings, a mathematical model that uses ordinary differential equations, and a 37-layer convolutional recurrent network (CRN) for style transfer. At first, the mathematical model was utilized to create ECG waveforms that represent an idealized heart rate or pacing of the RR intervals using parameters for the mean and standard deviation of the heart rate. These ECG waveforms were then fed into the neural network for style transfer. The authors discovered that their network boosted the accuracy of AF detection by

3 %

when DA was employed.

Fangyu et al. [102] developed a novel approach to detect abnormal ECG signals with higher accuracy. To address the challenge of imbalanced data affecting model learning, they designed two DA techniques (BiLSTM-CNN and TimeGAN) to improve the semantic information of various features. Additionally, they proposed a contrastive learning framework to ensure consistency in data representation across two different channels. By maximizing the similarity of data representations and calculating contrastive loss, they obtained more complete data category embedding and correlation, which ultimately improved performance by 3% compared to the model without contrastive learning.

Some researchers only used the GANs for ECG synthesis. ECG synthesis has the potential to improve our understanding of the underlying mechanisms of various heart conditions and to develop more accurate diagnostic models. However, it is important to validate the accuracy and reliability of the models built using synthetic ECG signals before deploying them in clinical settings. Based on our search criteria, we found papers that used generative methods only for ECG synthesis; the summary of the methods is tabulated in Table 5. In these papers, different metrics were used to evaluate the performance of GAN models for ECG synthesis. The choice of the metric depends on the specific goals of the research and the characteristics of the generated ECG signals. Some commonly used metrics for evaluating the performance of GAN models for ECG synthesis include:

Mean Squared Error (MSE) and Root MSE (RMSE): Both MSE and RMSE are based on the average squared difference between the generated ECG signals and the ground truth ECG signals. A lower error indicates better performance.
Signal-to-Noise Ratio (SNR): The SNR metric calculates the ratio of the signal power to the noise power in the generated ECG signals. A higher SNR value indicates better performance.
Fré chet Inception Distance (FID): The FID metric measures the distance between the distribution of the generated ECG signals and the distribution of the real ECG signals. A lower FID value indicates better performance.
Maximum Mean Discrepancy (MMD): The MMD metric measures the distance between two distributions by comparing the mean of their feature representations in a reproducing Kernel Hilbert Space. If the MMD is small, it means that the two distributions are similar in the feature space, and the model trained on one distribution can generalize well to the other distribution.
Dice Coefficient (DC): The DC metric is used to measure the similarity or overlap between two sets or binary masks. The DC ranges from 0 to 1, where 0 indicates no overlap between the sets and 1 indicates a perfect match.
Percent Mean Square Difference (PMSD): The PMSD metric is calculated as the square of the difference between the values of the generated and real ECG, divided by the average of the values, and expressed as a percentage. A lower PMSD value indicates better performance.
Kernel Maximum Mean Difference (KMMD): The KMMD metric is an extension of MMD that maps data to a high-dimensional space using a kernel function to measure similarity between data points. It is used in generative models to evaluate the quality of generated data by comparing them to real data. A high KMMD value means that generated data are different from real data, while a low KMMD value means they are similar.

6. Discussion

Small-scale and imbalanced datasets limit the application of AI-based models in cardiology. Undoubtedly, DA is an effective way of solving such problems and has been widely used in various domains. However, DA for ECG signals poses challenges different than in other domains. One of the main problems of applying DA in this context is that ECG signals contain fine-grained information such as relative amplitudes of ECG waveforms (down to a few microvolts) and temporal relationships between data points (down to a few milliseconds). This fine-grained information plays a significant role in the AI-based classifier. Indeed, a synthetic ECG signal can be advantageous for the AI-based model if its fine-grained information is present in the generated sample. Otherwise, DA may have a detrimental effect on the classifier. Universality is another important issue of DA techniques, i.e, DA techniques depend on the input type, input shape (number of data points and number of leads), number of parameters (hyper-parameters) of the AI-based method/DL model, or applications. Moreover, the effectiveness of the same DA varies on ECG rhythms, so applying the same DA techniques for two different types of rhythms could increase performance in one case and degrade performance in another.

Various DA methods have been proposed to generate synthetic ECGs from real ones and improve the performance of the AI-based models. We mostly categorized the DA techniques into two broad categories: basic and advanced. The basic DA techniques are generally simple to apply and relatively fast to compute. Many operations in basic DA showed promising results compared to unaugmented datasets. We however do not recommend the use of time inversion, spatial inversion, permutation, and lead shuffling, whereas a careful design of all other basic DA techniques is needed. For example, scaling the QRS complex of a healthy subject induces symptoms of cardiac hypertrophy. Artificially prolonging the PR interval is instead a symptom of atrioventricular block. In another example, in the context of MI, the lead order is essential for the correct localization of the infarcted area. These techniques may generate non-physiological ECGs or new ECGs belonging to other diagnostic classes, potentially leading to detrimental effects rather than being advantageous. In the field of advanced DA for ECG analysis, researchers have extensively explored the use of GAN-based methods. Of note, most of these works have been on synthesizing or generating ECG beats from the MIT-BIH AD dataset, primarily emphasizing beat-level variations. However, it is important to note that there is limited research available on rhythm generation using advanced DA techniques for specific applications. The focus has primarily been on beat-level augmentation, rather than capturing the broader rhythmic patterns present in ECG data. As a result, it is challenging to generalize which DA techniques would be optimal for specific applications where rhythm generation is the primary concern.

In addition, the improvements obtained vary greatly among different DA techniques, datasets, preprocessing steps, and applications. Generally, it is not possible to determine which augmentation method works best for a given dataset and only empirical tests can drive the selection. On the other hand, advanced DA techniques, such as generative models, can generate higher-quality synthetic data which preserve the statistical properties of the original data distribution, making them more representative and similar to real-world data. Therefore, these advanced DA techniques hold great potential in improving the accuracy and robustness of AI-based models.

In Section 4 and Section 5, we specifically discussed methods to generate ECG signals. However, several studies implemented DL models with spectral images as inputs (see Table 3 and Table 4). With spectral methods, the ECG signals are transformed into spectral images which are then used for classification instead of the raw ECG signals [11,14,26,59,62,95]. DA is performed directly on the spectral images rather than the ECG signals. Despite their use being motivated by the fact that ECG features are often accompanied by changes in frequency band energy, the role of the phase of the sinusoidal components (in which the ECG is decomposed) is overlooked, which could potentially reduce the DL performance. Although spectral methods have shown promising results in certain research settings, interpretability and explainability are further compromised because variations in spectral images cannot straightforwardly be associated with changes in ECG features known to cardiologists.

In our analysis, we also have come to the conclusion that there are still obstacles and challenges that need to be addressed. First, there is currently no clear consensus on the most effective proportion of real and synthetic ECGs to use in order to enhance performance and address overfitting issues. Some studies have investigated the impact of different ratios of real and synthetic ECGs on classification performance and have shown that increasing the number of synthetic ECG samples may not necessarily result in an increase in enhancement effect [46,92,136]. Researchers have used varying ratios of synthetic ECG and real ECG to improve AI-based model performance, but the optimal ratio depends on the specific application and must be determined based on the dataset being used. Therefore, further research is needed to explore the most effective use of real and synthetic ECGs in AI-based models for various applications. Second, quantifying the quality of generated synthetic ECGs is challenging since there is no universal method to quantify its similarity with real ECG signals. How does one quantify whether a synthetic ECG is “real” enough? One method is simply visual inspection: “Does it look right?” However, this often requires domain expertise and cannot be scaled. Most studies have not considered any evaluation scheme to quantify how much closer the synthetic ECG signal is to the real ECG signal (they rely on an increase in classification performance). There are several possible solutions for quantifying the quality of the synthetic ECGs. To begin with, we can extract some important features, i.e., heart rate, the amplitude of the QRS complex, peak-to-peak differences, etc., for every real and synthetic ECG. Then, some distance metrics such as Wasserstein distance, Kullback-–Leibler divergence, or Kolmogorov–-Smirnov could be used to quantify the similarity of the distributions.

In conclusion, we recommend conducting further research to overcome these issues. One promising avenue for exploration is the combination of various approaches of DA in order to expand the datasets. This could involve augmenting data in both the input space and feature space. For instance, adversarial learning can be used to provide secondary augmentation on synthetic ECGs generated using basic DA, potentially leading to a higher degree of variation in the synthetic ECGs. Moreover, combining meta-learning with DA might reveal why DA affects the performance of AI-based model for ECG classification. While DA using adversarial learning is currently popular, it is still important to figure out how to enhance the quality of synthetic ECGs. There is room for development in the areas of improving sample quality and evaluating their efficacy across a variety of datasets.

7. Conclusions

Collecting large-scale ECG datasets is challenging due to constraints on available patients, expert cardiologists, recording duration, and operational complexity. Data augmentation can be considered as an effective strategy to augment small-scale datasets and unbalanced minority classes of samples for addressing overfitting and to boost the performance of AI models. The paper discussed the current level of data augmentation research for ECG interpretation using artificial intelligence techniques. After examining the studies, we may conclude that effectiveness of DA methods can vary depending on the specific application. What works well for one application may not be suitable for others. Therefore, advancements in this field necessitate further investigations. In general, we found that data augmentation boosts the performance of automatic ECG analysis. In summary, the paper presented the practical suggestions and performance outcomes presented in the literature. It may provide guidance and help for ECG research and assist to model the inter-patient variability of ECG interpretation.

Author Contributions

Conceptualization, M.M.R., M.W.R., F.B. and R.S.; methodology, M.M.R. and M.W.R.; formal analysis, M.M.R.; data curation, M.M.R. and M.W.R.; writing—original draft preparation, M.M.R.; writing—review and editing, M.W.R., F.B. and R.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data generated and analyzed during this study are available from the corresponding author by request.

Acknowledgments

Md Moklesur Rahman acknowledges support from a PhD fellowship funded by Cardiocalm srl, https://www.cardiocalm.com/, Italy.

Conflicts of Interest

The authors declare no conflict of interest.

References

Deaton, C.; Froelicher, E.S.; Wu, L.H.; Ho, C.; Shishani, K.; Jaarsma, T. The global burden of cardiovascular disease. Eur. J. Cardiovasc. Nurs. 2011, 10, S5–S13. [Google Scholar] [CrossRef] [PubMed]
Isais, R.; Nguyen, K.; Perez, G.; Rubio, R.; Nazeran, H. A low-cost microcontroller-based wireless ECG-blood pressure telemonitor for home care. In Proceedings of the International Conference of the IEEE Engineering in Medicine and Biology Society, Cancun, Mexico, 17–21 September 2003; Volume 4, pp. 3157–3160. [Google Scholar]
Acharya, U.R.; Fujita, H.; Lih, O.S.; Hagiwara, Y.; Tan, J.H.; Adam, M. Automated detection of arrhythmias using different intervals of tachycardia ECG segments with convolutional neural network. Inf. Sci. 2017, 405, 81–90. [Google Scholar] [CrossRef]
Hongo, R.H.; Goldschlager, N. Status of computerized electrocardiography. Cardiol. Clin. 2006, 24, 491–504. [Google Scholar] [CrossRef] [PubMed]
Rajpurkar, P.; Hannun, A.Y.; Haghpanahi, M.; Bourn, C.; Ng, A.Y. Cardiologist-level arrhythmia detection with convolutional neural networks. arXiv 2017, arXiv:1707.01836. [Google Scholar]
Ribeiro, A.H.; Ribeiro, M.H.; Paixão, G.M.; Oliveira, D.M.; Gomes, P.R.; Canazart, J.A.; Ferreira, M.P.; Andersson, C.R.; Macfarlane, P.W.; Meira, W., Jr.; et al. Automatic diagnosis of the 12-lead ECG using a deep neural network. Nat. Commun. 2020, 11, 1760. [Google Scholar] [CrossRef]
Pan, J.; Tompkins, W.J. A real-time QRS detection algorithm. IEEE Trans. Biomed. Eng. 1985, BME-32, 230–236. [Google Scholar] [CrossRef]
Ismael, S.A.A.; Mohammed, A.; Hefny, H. An enhanced deep learning approach for brain cancer MRI images classification using residual networks. Artif. Intell. Med. 2020, 102, 101779. [Google Scholar] [CrossRef]
Huang, G.; Liu, Z.; Van Der Maaten, L.; Weinberger, K.Q. Densely connected convolutional networks. In Proceedings of the Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 4700–4708. [Google Scholar]
Habibzadeh, M.; Jannesari, M.; Rezaei, Z.; Baharvand, H.; Totonchi, M. Automatic white blood cell classification using pre-trained deep learning models: Resnet and inception. In Proceedings of the International Conference on Machine Vision, Vienna, Austria, 13 April 2018; Volume 10696, pp. 274–281. [Google Scholar]
Hatamian, F.N.; Ravikumar, N.; Vesal, S.; Kemeth, F.P.; Struck, M.; Maier, A. The effect of data augmentation on classification of atrial fibrillation in short single-lead ECG signals using deep neural networks. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Barcelona, Spain, 4–8 May 2020; pp. 1264–1268. [Google Scholar]
Silva, D.; Leonhardt, S.; Antink, C.H. Copula-Based Data Augmentation on a Deep Learning Architecture for Cardiac Sensor Fusion. IEEE J. Biomed. Health Inform. 2020, 25, 2521–2532. [Google Scholar] [CrossRef]
Zhu, J.; Qiu, J.; Yang, Z.; Weber, D.; Rosenberg, M.A.; Liu, E.; Li, B.; Zhao, D. GeoECG: Data Augmentation via Wasserstein Geodesic Perturbation for Robust Electrocardiogram Prediction. arXiv 2022, arXiv:2208.01220. [Google Scholar]
Liu, H.; Zhao, Z.; Chen, X.; Yu, R.; She, Q. Using the VQ-VAE to improve the recognition of abnormalities in short-duration 12-lead electrocardiogram records. Comput. Methods Programs Biomed. 2020, 196, 105639. [Google Scholar] [CrossRef]
Ma, S.; Cui, J.; Chen, C.L.; Chen, X.; Ma, Y. An Effective Data Enhancement Method for Classification of ECG Arrhythmia. Measurement 2022, 203, 111978. [Google Scholar] [CrossRef]
Liberati, A.; Altman, D.G.; Tetzlaff, J.; Mulrow, C.; Gøtzsche, P.C.; Ioannidis, J.P.; Clarke, M.; Devereaux, P.J.; Kleijnen, J.; Moher, D. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: Explanation and elaboration. J. Clin. Epidemiol. 2009, 62, e1–e34. [Google Scholar] [CrossRef] [PubMed]
Golany, T.; Radinsky, K. PGANs: Generative adversarial networks for ECG synthesis to improve patient-specific deep ECG classification. In Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 27 January–1 February 2019; Volume 33, pp. 557–564. [Google Scholar]
Golany, T.; Radinsky, K.; Freedman, D. SimGANs: Simulator-based generative adversarial networks for ECG synthesis to improve deep ECG classification. In Proceedings of the International Conference on Machine Learning, Virtual Event, 13–18 July 2020; pp. 3597–3606. [Google Scholar]
Shokouhmand, A.; Tavassolian, N. Fetal Electrocardiogram Extraction Using Dual-Path Source Separation of Single-Channel Non-Invasive Abdominal Recordings. IEEE Trans. Biomed. Eng. 2022, 70, 283–295. [Google Scholar] [CrossRef] [PubMed]
Labati, R.D.; Muñoz, E.; Piuri, V.; Sassi, R.; Scotti, F. Deep-ECG: Convolutional Neural Networks for ECG biometric recognition. Pattern Recognit. Lett. 2019, 126, 78–85. [Google Scholar] [CrossRef]
Barros, A.; Resque, P.; Almeida, J.; Mota, R.; Oliveira, H.; Rosário, D.; Cerqueira, E. Data improvement model based on ECG biometric for user authentication and identification. Sensors 2020, 20, 2920. [Google Scholar] [CrossRef]
Chen, G.; Zhu, Y.; Hong, Z.; Yang, Z. EmotionalGAN: Generating ECG to enhance emotion state classification. In Proceedings of the International Conference on Artificial Intelligence and Computer Science, Wuhan, China, 12–13 July 2019; pp. 309–313. [Google Scholar]
Thiam, P.; Kestler, H.A.; Schwenker, F. Multimodal Deep Denoising Convolutional Autoencoders for Pain Intensity Classification based on Physiological Signals. In Proceedings of the ICPRAM, Valletta, Malta, 22–24 February 2020; pp. 289–296. [Google Scholar]
Wicaksono, P.; Philip, S.; Alam, I.N.; Isa, S.M. Dealing with Imbalanced Sleep Apnea Data Using DCGAN. Trait. Signal 2022, 39, 1527–1536. [Google Scholar] [CrossRef]
Huysmans, D.; Castro, I.; Borzée, P.; Patel, A.; Torfs, T.; Buyse, B.; Testelmans, D.; Van Huffel, S.; Varon, C. Capacitively-Coupled ECG and Respiration for Sleep—Wake Prediction and Risk Detection in Sleep Apnea Patients. Sensors 2021, 21, 6409. [Google Scholar] [CrossRef]
Sobahi, N.; Sengur, A.; Tan, R.S.; Acharya, U.R. Attention-based 3D CNN with residual connections for efficient ECG-based COVID-19 detection. Comput. Biol. Med. 2022, 143, 105335. [Google Scholar] [CrossRef]
Shahin, I.; Nassif, A.B.; Alsabek, M.B. COVID-19 Electrocardiograms Classification using CNN Models. In Proceedings of the International Conference on Developments in eSystems Engineering, Sharjah, United Arab Emirates, 7–10 December 2021; pp. 448–452. [Google Scholar]
Anwar, T.; Zakir, S. Effect of image augmentation on ECG image classification using deep learning. In Proceedings of the International Conference on Artificial Intelligence, Islamabad, Pakistan, 5–7 April 2021; pp. 182–186. [Google Scholar]
Bassiouni, M.M.; Hegazy, I.; Rizk, N.; El-Dahshan, E.S.A.; Salem, A.M. Automated detection of covid-19 using deep learning approaches with paper-based ecg reports. Circuits Syst. Signal Process. 2022, 41, 5535–5577. [Google Scholar] [CrossRef]
Liu, G.; Han, X.; Tian, L.; Zhou, W.; Liu, H. ECG quality assessment based on hand-crafted statistics and deep-learned S-transform spectrogram features. Comput. Methods Programs Biomed. 2021, 208, 106269. [Google Scholar] [CrossRef]
Jimenez-Perez, G.; Alcaine, A.; Camara, O. Delineation of the electrocardiogram with a mixed-quality-annotations dataset using convolutional neural networks. Sci. Rep. 2021, 11, 863. [Google Scholar] [CrossRef] [PubMed]
Moody, G.; Mark, R. The MIT-BIH Arrhythmia Database on CD-ROM and software for use with it. In Proceedings of the Computers in Cardiology, Chicago, IL, USA, 23–26 September 1990; pp. 185–188. [Google Scholar] [CrossRef]
Clifford, G.D.; Liu, C.; Moody, B.; Li-wei, H.L.; Silva, I.; Li, Q.; Johnson, A.; Mark, R.G. AF classification from a short single lead ECG recording: The PhysioNet/computing in cardiology challenge 2017. In Proceedings of the Computing in Cardiology (CinC), Rennes, France, 24–27 September 2017; pp. 1–4. [Google Scholar]
Liu, F.; Liu, C.; Zhao, L.; Zhang, X.; Wu, X.; Xu, X.; Liu, Y.; Ma, C.; Wei, S.; He, Z.; et al. An open access database for evaluating the algorithms of electrocardiogram rhythm and morphology abnormality detection. J. Med. Imaging Health Inform. 2018, 8, 1368–1373. [Google Scholar] [CrossRef]
Bousseljot, R.; Kreiseler, D.; Schnabel, A. Nutzung der EKG-Signaldatenbank CARDIODAT der PTB über das Internet. 1995. Available online: https://www.degruyter.com/document/doi/10.1515/bmte.1995.40.s1.317/html (accessed on 11 January 2023).
Wagner, P.; Strodthoff, N.; Bousseljot, R.D.; Kreiseler, D.; Lunze, F.I.; Samek, W.; Schaeffter, T. PTB-XL, a large publicly available electrocardiography dataset. Sci. Data 2020, 7, 154. [Google Scholar] [CrossRef] [PubMed]
Reyna, M.A.; Sadr, N.; Alday, E.A.P.; Gu, A.; Shah, A.J.; Robichaux, C.; Rad, A.B.; Elola, A.; Seyedi, S.; Ansari, S.; et al. Will two do? Varying dimensions in electrocardiography: The PhysioNet/Computing in Cardiology Challenge 2021. In Proceedings of the Computing in Cardiology (CinC), Brno, Czech Republic, 13–15 September 2021; Volume 48, pp. 1–4. [Google Scholar]
Nonaka, N.; Seita, J. RandECG: Data Augmentation for Deep Neural Network based ECG classification. In Proceedings of the Advances in Artificial Intelligence: Selected Papers from the Annual Conference of Japanese Society of Artificial Intelligence, Virtual Event, Japan, 8–11 June 2021; pp. 178–189. [Google Scholar]
Hasani, H.; Bitarafan, A.; Baghshah, M.S. Classification of 12-lead ECG signals with adversarial multi-source domain generalization. In Proceedings of the Computing in Cardiology, Rimini, Italy, 13–16 September 2020; pp. 1–4. [Google Scholar]
Nonaka, N.; Seita, J. Electrocardiogram classification by modified EfficientNet with data augmentation. In Proceedings of the Computing in Cardiology, Rimini, Italy, 13–16 September 2020; pp. 1–4. [Google Scholar]
Guo, G.; Gao, P.; Zheng, X.; Ji, C. Multimodal Emotion Recognition Using CNN-SVM with Data Augmentation. In Proceedings of the International Conference on Bioinformatics and Biomedicine. IEEE, Las Vegas, NV, USA, 6–8 December 2022; pp. 3008–3014. [Google Scholar]
Eskandari, M.; Parvaneh, S.; Ehsani, H.; Fain, M.; Toosizadeh, N. Frailty Identification Using Heart Rate Dynamics: A Deep Learning Approach. IEEE J. Biomed. Health Inform. 2022, 26, 3409–3417. [Google Scholar] [CrossRef] [PubMed]
Xu, X.; Xu, H.; Wang, L.; Zhang, Y.; Xiao, F. Hygeia: A multilabel deep learning-based classification method for imbalanced electrocardiogram data. IEEE/ACM Trans. Comput. Biol. Bioinform. 2022. [Google Scholar] [CrossRef]
Loh, J.; Wen, J.; Gemmeke, T. Low-Cost DNN Hardware Accelerator for Wearable, High-Quality Cardiac Arrythmia Detection. In Proceedings of the International Conference on Application-Specific Systems, Architectures and Processors, Manchester, UK, 6–8 July 2020; pp. 213–216. [Google Scholar] [CrossRef]
Liu, Y.; Xie, H.; Cao, Q.; Yan, J.; Wu, F.; Zhu, H.; Pan, Y. Multi-Label Classification of Multi-lead ECG Based on Deep 1D Convolutional Neural Networks With Residual and Attention Mechanism. In Proceedings of the Computing in Cardiology (CinC), Brno, Czech Republic, 3–15 September 2021; Volume 48, pp. 1–4. [Google Scholar]
Qiu, J.; Oppelt, M.P.; Nissen, M.; Anneken, L.; Breininger, K.; Eskofier, B. Improving Deep Learning-based Cardiac Abnormality Detection in 12-Lead ECG with Data Augmentation. In Proceedings of the International Conference of the Engineering in Medicine &Biology Society, Glasgow, UK, 11–15 July 2022; pp. 945–949. [Google Scholar]
Cayce, G.I.; Depoian, A.C.; Bailey, C.P.; Guturu, P. Improved Neural Network Arrhythmia Classification Through Integrated Data Augmentation. In Proceedings of the 2022 IEEE MetroCon, Hurst, TX, USA, 3 November 2022; pp. 1–3. [Google Scholar]
Zhang, W.; Geng, S.; Hong, S. A simple self-supervised ECG representation learning method via manipulated temporal–spatial reverse detection. Biomed. Signal Process. Control 2023, 79, 104194. [Google Scholar] [CrossRef]
Zihlmann, M.; Perekrestenko, D.; Tschannen, M. Convolutional recurrent neural networks for electrocardiogram classification. In Proceedings of the Computing in Cardiology (CinC), Rennes, France, 24–27 September 2017; pp. 1–4. [Google Scholar]
Duan, R.; He, X.; Ouyang, Z. MADNN: A multi-scale attention deep neural network for arrhythmia classification. In Proceedings of the Computing in Cardiology, Rimini, Italy, 13–16 September 2020; pp. 1–4. [Google Scholar]
Mehari, T.; Strodthoff, N. Self-supervised representation learning from 12-lead ECG data. Comput. Biol. Med. 2022, 141, 105114. [Google Scholar] [CrossRef]
An, J.; Gregg, R.E.; Borhani, S. Effective Data Augmentation, Filters, and Automation Techniques for Automatic 12-Lead ECG Classification Using Deep Residual Neural Networks. In Proceedings of the International Conference of the Engineering in Medicine &Biology Society, Glasgow, UK, 11–15 July 2022; pp. 1283–1287. [Google Scholar]
Friesen, G.M.; Jannett, T.C.; Jadallah, M.A.; Yates, S.L.; Quint, S.R.; Nagle, H.T. A comparison of the noise sensitivity of nine QRS detection algorithms. IEEE Trans. Biomed. Eng. 1990, 37, 85–98. [Google Scholar] [CrossRef]
Do, E.; Boynton, J.; Lee, B.S.; Lustgarten, D. Data Augmentation for 12-lead ECG Beat Classification. SN Comput. Sci. 2022, 3, 70. [Google Scholar]
Wang, M.; Rahardja, S.; Fränti, P.; Rahardja, S. Single-lead ECG recordings modeling for end-to-end recognition of atrial fibrillation with dual-path RNN. Biomed. Signal Process. Control 2023, 79, 104067. [Google Scholar] [CrossRef]
Sigurthorsdottir, H.; Van Zaen, J.; Delgado-Gonzalo, R.; Lemay, M. ECG classification with a convolutional recurrent neural network. In Proceedings of the Computing in Cardiology, Rimini, Italy, 13–16 September 2020; pp. 1–4. [Google Scholar]
Oppelt, M.P.; Riehl, M.; Kemeth, F.P.; Steffan, J. Combining scatter transform and deep neural networks for multilabel electrocardiogram signal classification. In Proceedings of the Computing in Cardiology, Rimini, Italy, 13-16 September 2020; pp. 1–4. [Google Scholar]
Acharya, U.R.; Oh, S.L.; Hagiwara, Y.; Tan, J.H.; Adam, M.; Gertych, A.; San Tan, R. A deep convolutional neural network model to classify heartbeats. Comput. Biol. Med. 2017, 89, 389–396. [Google Scholar] [CrossRef] [PubMed]
Xu, P.; Liu, H.; Xie, X.; Zhou, S.; Shu, M.; Wang, Y. Interpatient ECG Arrhythmia Detection by Residual Attention CNN. Comput. Math. Methods Med. 2022, 2022, 2323625. [Google Scholar] [CrossRef] [PubMed]
Mahmud, T.; Fattah, S.A.; Saquib, M. DeepArrNet: An efficient deep CNN architecture for automatic arrhythmia detection and classification from denoised ECG beats. IEEE Access 2020, 8, 104788–104800. [Google Scholar] [CrossRef]
Yu, Z.; Chen, J.; Liu, Y.; Chen, Y.; Wang, T.; Nowak, R.; Lv, Z. DDCNN: A Deep Learning Model for AF Detection from a Single-Lead Short ECG Signal. IEEE J. Biomed. Health Inform. 2022, 26, 4987–4995. [Google Scholar] [CrossRef] [PubMed]
Ullah, H.; Bin Heyat, M.B.; AlSalman, H.; Khan, H.M.; Akhtar, F.; Gumaei, A.; Mehdi, A.; Muaad, A.Y.; Islam, M.S.; Ali, A.; et al. An Effective and Lightweight Deep Electrocardiography Arrhythmia Recognition Model Using Novel Special and Native Structural Regularization Techniques on Cardiac Signal. J. Healthc. Eng. 2022, 2022, 3408501. [Google Scholar] [CrossRef] [PubMed]
Liu, S.; Zhou, B.; Ding, Q.; Hooi, B.; bo Zhang, Z.; Shen, H.; Cheng, X. Time series anomaly detection with adversarial reconstruction networks. IEEE Trans. Knowl. Data Eng. 2022, 35, 4293–4306. [Google Scholar] [CrossRef]
Sangeetha, D.; Selvi, S.; Ram, M.S.A. A CNN based similarity learning for cardiac arrhythmia prediction. In Proceedings of the International Conference on Advanced Computing. IEEE, Chennai, India, 18–20 December 2019; pp. 244–248. [Google Scholar]
Goodfellow, S.D.; Shubin, D.; Greer, R.W.; Nagaraj, S.; McLean, C.; Dixon, W.; Goodwin, A.J.; Assadi, A.; Jegatheeswaran, A.; Laussen, P.C.; et al. Rhythm classification of 12-lead ECGs using deep neural networks and class-activation maps for improved explainability. In Proceedings of the 2020 Computing in Cardiology, Rimini, Italy, 13–16 September 2020; pp. 1–4. [Google Scholar]
Weber, L.; Gaiduk, M.; Scherz, W.D.; Seepold, R. Cardiac abnormality detection in 12-lead ECGs with deep convolutional neural networks using data augmentation. In Proceedings of the 2020 Computing in Cardiology, Rimini, Italy, 13–16 September 2020; pp. 1–4. [Google Scholar]
Natesan, P.; Gothai, E. Classification of multi-lead ECG signals to predict myocardial infarction using CNN. In Proceedings of the International Conference on Computing Methodologies and Communication, Erode, India, 11–13 March 2020; pp. 1029–1033. [Google Scholar]
Almalchy, M.T.; ALGayar, S.M.S.; Popescu, N. Atrial fibrillation automatic diagnosis based on ECG signal using pretrained deep convolution neural network and SVM multiclass model. In Proceedings of the International Conference on Communications, Bucharest, Romania, 18–20 June 2020; pp. 197–202. [Google Scholar]
Zhou, B.; Liu, S.; Hooi, B.; Cheng, X.; Ye, J. BeatGAN: Anomalous Rhythm Detection using Adversarially Generated Time Series. In Proceedings of the IJCAI, Macao, China, 10–16 August 2019; pp. 4433–4439. [Google Scholar]
Kim, Y.K.; Lee, M.; Song, H.S.; Lee, S.W. Automatic cardiac arrhythmia classification using residual network combined with long short-term memory. IEEE Trans. Instrum. Meas. 2022, 71, 1–17. [Google Scholar] [CrossRef]
Xie, H.; Liu, H.; Zhou, S.; Gao, T.; Shu, M. A lightweight 2-D CNN model with dual attention mechanism for heartbeat classification. Appl. Intell. 2022, 1–16. [Google Scholar] [CrossRef]
Shanmugavadivel, K.; Sathishkumar, V.; Kumar, M.S.; Maheshwari, V.; Prabhu, J.; Allayear, S.M. Investigation of Applying Machine Learning and Hyperparameter Tuned Deep Learning Approaches for Arrhythmia Detection in ECG Images. Comput. Math. Methods Med. 2022, 2022, 8571970. [Google Scholar] [CrossRef]
Cao, P.; Li, X.; Mao, K.; Lu, F.; Ning, G.; Fang, L.; Pan, Q. A novel data augmentation method to enhance deep neural networks for detection of atrial fibrillation. Biomed. Signal Process. Control 2020, 56, 101675. [Google Scholar] [CrossRef]
Li, D.; Li, X.; Zhao, J.; Bai, X. Automatic staging model of heart failure based on deep learning. Biomed. Signal Process. Control 2019, 52, 77–83. [Google Scholar] [CrossRef]
He, J.; Rong, J.; Sun, L.; Wang, H.; Zhang, Y. An advanced two-step DNN-based framework for arrhythmia detection. In Proceedings of the Advances in Knowledge Discovery and Data Mining: Pacific-Asia Conference, PAKDD 2020, Singapore, 11–14 May 2020; pp. 422–434. [Google Scholar]
Pal, A.; Srivastva, R.; Singh, Y.N. CardioNet: An efficient ECG arrhythmia classification system using transfer learning. Big Data Res. 2021, 26, 100271. [Google Scholar] [CrossRef]
Nankani, D.; Dutta Baruah, R. An End-to-End framework for automatic detection of Atrial Fibrillation using Deep Residual Learning. In Proceedings of the TENCON 2019—2019 IEEE Region 10 Conference (TENCON), Kochi, India, 17–20 October 2019; pp. 690–695. [Google Scholar] [CrossRef]
Zhou, Y.; Hong, S.; Shang, J.; Wu, M.; Wang, Q.; Li, H.; Xie, J. K-margin-based residual-convolution-recurrent neural network for atrial fibrillation detection. arXiv 2019, arXiv:1908.06857. [Google Scholar]
Han, H.; Park, S.; Min, S.; Choi, H.S.; Kim, E.; Kim, H.; Park, S.; Kim, J.; Park, J.; An, J.; et al. Towards High Generalization Performance on Electrocardiogram Classification. In Proceedings of the Computing in Cardiology (CinC), Brno, Czech Republic, 13–15 September 2021; Volume 48, pp. 1–4. [Google Scholar] [CrossRef]
Sabor, N.; Gendy, G.; Mohammed, H.; Wang, G.; Lian, Y. Robust Arrhythmia Classification Based on QRS Detection and a Compact 1D-CNN for Wearable ECG Devices. IEEE J. Biomed. Health Inform. 2022, 26, 5918–5929. [Google Scholar] [CrossRef]
Bing, P.; Liu, Y.; Liu, W.; Zhou, J.; Zhu, L. Electrocardiogram classification using TSST-based spectrogram and ConViT. Front. Cardiovasc. Med. 2022, 9, 983543. [Google Scholar] [CrossRef]
da Silva Luz, E.J.; Moreira, G.J.P.; Oliveira, L.S.; Schwartz, W.R.; Menotti, D. Learning Deep Off-the-Person Heart Biometrics Representations. IEEE Trans. Inf. Forensics Secur. 2018, 13, 1258–1270. [Google Scholar] [CrossRef]
Mura, V.; Orrù, G.; Casula, R.; Sibiriu, A.; Loi, G.; Tuveri, P.; Ghiani, L.; Marcialis, G.L. LivDet 2017 Fingerprint Liveness Detection Competition 2017. In Proceedings of the International Conference on Biometrics, Gold Coast, QLD, Australia, 20–23 February 2018; pp. 297–302. [Google Scholar] [CrossRef]
Hammad, M.; Wang, K. Parallel score fusion of ECG and fingerprint for human authentication based on convolution neural network. Comput. Secur. 2019, 81, 107–122. [Google Scholar] [CrossRef]
Yun, D.; Lee, H.C.; Jung, C.W.; Kwon, S.; Lee, S.R.; Kim, K.; Kim, Y.S.; Han, S.S. Robust R-peak detection in an electrocardiogram with stationary wavelet transformation and separable convolution. Sci. Rep. 2022, 12, 19638. [Google Scholar] [CrossRef]
Huerta, Á.; Martínez-Rodrigo, A.; Rieta, J.J.; Alcaraz, R. ECG Quality Assessment via Deep Learning and Data Augmentation. In Proceedings of the Computing in Cardiology (CinC), Brno, Czech Republic, 13–15 September 2021; Volume 48, pp. 1–4. [Google Scholar]
Laguna, P.; Mark, R.G.; Goldberg, A.; Moody, G.B. A database for evaluation of algorithms for measurement of QT and other waveform intervals in the ECG. In Proceedings of the Computers in Cardiology, Lund, Sweden, 7–10 September 1997; pp. 673–676. [Google Scholar]
Yhdego, H.; Kidane, N.; Mckenzie, F.; Audette, M. ECG-based virtual pathology stethoscope tracking using transfer learning. In Proceedings of the Spring Simulation Conference, Fairfax, VA, USA, 18–21 May 2020; pp. 1–7. [Google Scholar]
Khan, A.H.; Hussain, M.; Malik, M.K. ECG Images dataset of Cardiac and COVID-19 Patients. Data Brief 2021, 34, 106762. [Google Scholar] [CrossRef]
Soleymani, M.; Lichtenauer, J.; Pun, T.; Pantic, M. A multimodal database for affect recognition and implicit tagging. IEEE Trans. Affect. Comput. 2011, 3, 42–55. [Google Scholar] [CrossRef]
Nita, S.; Bitam, S.; Heidet, M.; Mellouk, A. A new data augmentation convolutional neural network for human emotion recognition based on ECG signals. Biomed. Signal Process. Control 2022, 75, 103580. [Google Scholar] [CrossRef]
Xiong, Z.; Stiles, M.K.; Gillis, A.M.; Zhao, J. Enhancing the detection of atrial fibrillation from wearable sensors with neural style transfer and convolutional recurrent networks. Comput. Biol. Med. 2022, 146, 105551. [Google Scholar] [CrossRef] [PubMed]
Zhou, X.; Zhu, X.; Nakamura, K.; Noro, M. Electrocardiogram Quality Assessment with a Generalized Deep Learning Model Assisted by Conditional Generative Adversarial Networks. Life 2021, 11, 1013. [Google Scholar] [CrossRef] [PubMed]
Y ang, H.; Liu, J.; Zhang, L.; Li, Y.; Zhang, H. ProEGAN-MS: A progressive growing generative adversarial networks for electrocardiogram generation. IEEE Access 2021, 9, 52089–52100. [Google Scholar] [CrossRef]
Ma, K.; Chang’an, A.Z.; Yang, F. Multi-classification of arrhythmias using ResNet with CBAM on CWGAN-GP augmented ECG Gramian Angular Summation Field. Biomed. Signal Process. Control 2022, 77, 103684. [Google Scholar] [CrossRef]
Suh, J.; Kim, J.; Lee, E.; Kim, J.; Hwang, D.; Park, J.; Lee, J.; Park, J.; Moon, S.Y.; Kim, Y.; et al. Learning ECG representations for multi-label classification of cardiac abnormalities. In Proceedings of the Computing in Cardiology (CinC), Brno, Czech Republic, 13-15 September 2021; Volume 48, pp. 1–4. [Google Scholar]
Sarkar, P.; Etemad, A. CardioGAN: Attentive generative adversarial network with dual discriminators for synthesis of ECG from PPG. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtual Event, 2–9 February 2021; Volume 35, pp. 488–496. [Google Scholar]
Tan, S.; Androz, G.; Chamseddine, A.; Fecteau, P.; Courville, A.; Bengio, Y.; Cohen, J.P. Icentia11k: An unsupervised representation learning dataset for arrhythmia subtype discovery. arXiv 2019, arXiv:1910.09570. [Google Scholar]
Fonseca, K.; Osorio, S.; Castillo, J.; Fajardo, C. Contrastive learning for atrial fibrillation detection in challenging scenarios. In Proceedings of the European Signal Processing Conference, Belgrade, Serbia, 29 August– 2 September 2022; pp. 1218–1222. [Google Scholar]
Adib, E.; Afghah, F.; Prevost, J.J. Arrhythmia Classification Using CGAN-Augmented ECG Signals. In Proceedings of the International Conference on Bioinformatics and Biomedicine, Las Vegas, NV, USA, 6–8 December 2022; pp. 1865–1872. [Google Scholar]
Al Nazi, Z.; Biswas, A.; Rayhan, M.A.; Abir, T.A. Classification of ECG signals by dot residual LSTM network with data augmentation for anomaly detection. In Proceedings of the International Conference on Computer and Information Technology, Dhaka, Bangladesh, 18–20 December 2019; pp. 1–5. [Google Scholar]
Li, F.; Chang, H.; Jiang, M.; Su, Y. A Contrastive Learning Framework for ECG Anomaly Detection. In Proceedings of the International Conference on Intelligent Computing and Signal Processing, Xi’an, China, 15–17 April 2022; pp. 673–677. [Google Scholar]
Wang, P.; Hou, B.; Shao, S.; Yan, R. ECG arrhythmias detection using auxiliary classifier generative adversarial network and residual network. IEEE Access 2019, 7, 100910–100922. [Google Scholar] [CrossRef]
Banerjee, R.; Ghose, A. Synthesis of realistic ECG waveforms using a composite generative adversarial network for classification of atrial fibrillation. In Proceedings of the European Signal Processing Conference, Dublin, Ireland, 23–27 August 2021; pp. 1145–1149. [Google Scholar]
Xia, Y.; Xu, Y.; Chen, P.; Zhang, J.; Zhang, Y. Generative adversarial network with transformer generator for boosting ECG classification. Biomed. Signal Process. Control 2023, 80, 104276. [Google Scholar] [CrossRef]
Sun, H.; Zhang, F.; Zhang, Y. An LSTM and GAN Based ECG Abnormal Signal Generator. In Proceedings of the Advances in Artificial Intelligence and Applied Cognitive Computing: Proceedings from ICAI’20 and ACC’20; Springer: Berlin/Heidelberg, Germany, 2021; pp. 743–755. [Google Scholar] [CrossRef]
Brophy, E.; De Vos, M.; Boylan, G.; Ward, T. Multivariate Generative Adversarial Networks and Their Loss Functions for Synthesis of Multichannel ECGs. IEEE Access 2021, 9, 158936–158945. [Google Scholar] [CrossRef]
Shaker, A.M.; Tantawi, M.; Shedeed, H.A.; Tolba, M.F. Generalization of Convolutional Neural Networks for ECG Classification Using Generative Adversarial Networks. IEEE Access 2020, 8, 35592–35605. [Google Scholar] [CrossRef]
Hossain, K.F.; Kamran, S.A.; Tavakkoli, A.; Pan, L.; Ma, X.; Rajasegarar, S.; Karmaker, C. ECG-Adv-GAN: Detecting ECG Adversarial Examples with Conditional Generative Adversarial Networks. In Proceedings of the 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA), Pasadena, CA, USA, 13–16 December 2021; pp. 50–56. [Google Scholar] [CrossRef]
Liu, J.; Xia, X.; Peng, X.; Hui, J.; Han, C. Research on ECG Signal Classification Based on Data Enhancement of Generative Adversarial Network. In Proceedings of the Artificial Intelligence and Security: International Conference, ICAIS 2022, Qinghai, China, 15–20 July 2022; pp. 405–419. [Google Scholar]
Rath, A.; Mishra, D.; Panda, G.; Satapathy, S.C. Heart disease detection using deep learning methods from imbalanced ECG samples. Biomed. Signal Process. Control 2021, 68, 102820. [Google Scholar] [CrossRef]
Wang, H.; Zhou, Y.; Zhou, B.; Niu, X.; Zhang, H.; Wang, Z. Interactive ECG annotation: An artificial intelligence method for smart ECG manipulation. Inf. Sci. 2021, 581, 42–59. [Google Scholar] [CrossRef]
Wang, X.; Chen, B.; Zeng, M.; Wang, Y.; Liu, H.; Liu, R.; Tian, L.; Lu, X. An ECG Signal Denoising Method Using Conditional Generative Adversarial Net. IEEE J. Biomed. Health Inform. 2022, 26, 2929–2940. [Google Scholar] [CrossRef] [PubMed]
Wang, Y.; Sun, L.; Subramani, S. CAB: Classifying arrhythmias based on imbalanced sensor data. KSII Trans. Internet Inf. Syst. 2021, 15, 2304–2320. [Google Scholar]
Du, C.; Liu, P.X.; Zheng, M. Classification of imbalanced electrocardiosignal data using convolutional neural network. Comput. Methods Programs Biomed. 2022, 214, 106483. [Google Scholar] [CrossRef]
Islam, M.S.; Islam, M.N.; Hashim, N.; Rashid, M.; Bari, B.S.; Farid, F.A. New Hybrid Deep Learning Approach Using BiGRU-BiLSTM and Multilayered Dilated CNN to Detect Arrhythmia. IEEE Access 2022, 10, 58081–58096. [Google Scholar] [CrossRef]
He, Y.; Fu, B.; Yu, J.; Li, R.; Jiang, R. Efficient learning of healthcare data from IoT devices by edge convolution neural networks. Appl. Sci. 2020, 10, 8934. [Google Scholar] [CrossRef]
Golany, T.; Lavee, G.; Yarden, S.T.; Radinsky, K. Improving ECG classification using generative adversarial networks. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 13280–13285. [Google Scholar]
Ma, S.; Cui, J.; Xiao, W.; Liu, L. Deep Learning-Based Data Augmentation and Model Fusion for Automatic Arrhythmia Identification and Classification Algorithms. Comput. Intell. Neurosci. 2022, 2022, 1577778. [Google Scholar] [CrossRef]
Guryanova, V. Online augmentation for quality improvement of neural networks for classification of single-channel electrocardiograms. In Proceedings of the Analysis of Images, Social Networks and Texts: International Conference, AIST 2019, Kazan, Russia, 17–19 July 2019; pp. 37–49. [Google Scholar]
Shin, D.H.; Park, R.C.; Chung, K. Decision Boundary-Based Anomaly Detection Model Using Improved AnoGAN from ECG Data. IEEE Access 2020, 8, 108664–108674. [Google Scholar] [CrossRef]
Rafi, T.H.; Woong Ko, Y. HeartNet: Self Multihead Attention Mechanism via Convolutional Network with Adversarial Data Synthesis for ECG-Based Arrhythmia Classification. IEEE Access 2022, 10, 100501–100512. [Google Scholar] [CrossRef]
Golany, T.; Freedman, D.; Radinsky, K. ECG ODE-GAN: Learning ordinary differential equations of ECG dynamics via generative adversarial learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtual Event, 2–9 February 2021; Volume 35, pp. 134–141. [Google Scholar]
Hernandez-Matamoros, A.; Fujita, H.; Perez-Meana, H. A novel approach to create synthetic biomedical signals using BiRNN. Inf. Sci. 2020, 541, 218–241. [Google Scholar] [CrossRef]
Jia, Z.; Hong, F.; Ping, L.; Shi, Y.; Hu, J. Enabling On-Device Model Personalization for Ventricular Arrhythmias Detection by Generative Adversarial Networks. In Proceedings of the ACM/IEEE Design Automation Conference (DAC), San Francisco, CA, USA, 5–9 December 2021; pp. 163–168. [Google Scholar]
Sun, L.; Wang, Y.; Qu, Z.; Xiong, N.N. BeatClass: A Sustainable ECG Classification System in IoT-Based eHealth. IEEE Internet Things J. 2022, 9, 7178–7195. [Google Scholar] [CrossRef]
Maweu, B.M.; Shamsuddin, R.; Dakshit, S.; Prabhakaran, B. Generating Healthcare Time Series Data for Improving Diagnostic Accuracy of Deep Neural Networks. IEEE Trans. Instrum. Meas. 2021, 70, 1–15. [Google Scholar] [CrossRef]
Liu, H.; Zhao, Z.; She, Q. Self-supervised ECG pre-training. Biomed. Signal Process. Control 2021, 70, 103010. [Google Scholar] [CrossRef]
Dahal, K.; Ali, M.H. A Hybrid GAN-Based DL Approach for the Automatic Detection of Shockable Rhythms in AED for Solving Imbalanced Data Problems. Electronics 2022, 12, 13. [Google Scholar] [CrossRef]
Deng, Y.; Gao, Z.; Xu, S.; Ren, P.; Wen, Y.; Mao, Y.; Li, Z. ST-Net: Synthetic ECG tracings for diagnosing various cardiovascular diseases. Biomed. Signal Process. Control 2020, 61, 101997. [Google Scholar] [CrossRef]
Li, W.; Tang, Y.M.; Yu, K.M.; To, S. SLC-GAN: An automated myocardial infarction detection model based on generative adversarial networks and convolutional neural networks with single-lead electrocardiogram synthesis. Inf. Sci. 2022, 589, 738–750. [Google Scholar] [CrossRef]
Zhang, Y.; Zhao, Z.; Deng, Y.; Zhang, X. FHRGAN: Generative adversarial networks for synthetic fetal heart rate signal generation in low-resource settings. Inf. Sci. 2022, 594, 136–150. [Google Scholar] [CrossRef]
Furdui, A.; Zhang, T.; Worring, M.; Cesar, P.; El Ali, A. AC-WGAN-GP: Augmenting ECG and GSR Signals Using Conditional Generative Models for Arousal Classification. In Proceedings of the ACM International Joint Conference on Pervasive and Ubiquitous Computing and ACM International Symposium on Wearable Computers, Virtual Event, USA, 21–26 September 2021; pp. 21–22. [Google Scholar]
Garg, A.; Karimian, N. ECG Biometric Spoofing Using Adversarial Machine Learning. In Proceedings of the International Conference on Consumer Electronics, Las Vegas, NV, USA, 10–12 January 2021; pp. 1–5. [Google Scholar] [CrossRef]
Hu, J.; Li, Y. Electrocardiograph Based Emotion Recognition via WGAN-GP Data Enhancement and Improved CNN. In Proceedings of the Intelligent Robotics and Applications: International Conference, ICIRA 2022, Harbin, China, 1–3 August 2022; pp. 155–164. [Google Scholar]
Munia, M.S.; Nourani, M.; Houari, S. Biosignal oversampling using wasserstein generative adversarial network. In Proceedings of the International Conference on Healthcare Informatics, Oldenburg, Germany, 30 November–3 December 2020; pp. 1–7. [Google Scholar]
Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial networks. Commun. ACM 2020, 63, 139–144. [Google Scholar] [CrossRef]
Kalyakulina, A.I.; Yusipov, I.I.; Moskalenko, V.A.; Nikolskiy, A.V.; Kosonogov, K.A.; Osipov, G.V.; Zolotykh, N.Y.; Ivanchenko, M.V. LUDB: A new open-access validation tool for electrocardiogram delineation algorithms. IEEE Access 2020, 8, 186181–186190. [Google Scholar] [CrossRef]
Kuznetsov, V.; Moskalenko, V.; Gribanov, D.; Zolotykh, N.Y. Interpretable feature generation in ECG using a variational autoencoder. Front. Genet. 2021, 12, 638191. [Google Scholar] [CrossRef] [PubMed]
Ye, F.; Zhu, F.; Fu, Y.; Shen, B. ECG Generation With Sequence Generative Adversarial Nets Optimized by Policy Gradient. IEEE Access 2019, 7, 159369–159378. [Google Scholar] [CrossRef]
Seo, H.C.; Yoon, G.W.; Joo, S.; Nam, G.B. Multiple electrocardiogram generator with single-lead electrocardiogram. Comput. Methods Programs Biomed. 2022, 221, 106858. [Google Scholar] [CrossRef] [PubMed]
Xu, B.; Liu, R.; Shu, M.; Shang, X.; Wang, Y. An ECG denoising method based on the generative adversarial residual network. Comput. Math. Methods Med. 2021, 2021, 5527904. [Google Scholar] [CrossRef]
Hazra, D.; Byun, Y.C. SynSigGAN: Generative adversarial networks for synthetic biomedical signal generation. Biology 2020, 9, 441. [Google Scholar] [CrossRef] [PubMed]
Soleimani, R.; Lobaton, E. Enhancing Inference on Physiological and Kinematic Periodic Signals via Phase-Based Interpretability and Multi-Task Learning. Information 2022, 13, 326. [Google Scholar] [CrossRef]
Nankani, D.; Baruah, R.D. Investigating deep convolution conditional GANs for electrocardiogram generation. In Proceedings of the International Joint Conference on Neural Networks, Glasgow, UK, 19–24 July 2020; pp. 1–8. [Google Scholar]
Lee, J.; Oh, K.; Kim, B.; Yoo, S.K. Synthesis of electrocardiogram V-lead signals from limb-lead measurement using R-peak aligned generative adversarial network. IEEE J. Biomed. Health Inform. 2019, 24, 1265–1275. [Google Scholar] [CrossRef]
Zhu, F.; Ye, F.; Fu, Y.; Liu, Q.; Shen, B. Electrocardiogram generation with a bidirectional LSTM-CNN generative adversarial network. Sci. Rep. 2019, 9, 6734. [Google Scholar] [CrossRef]
Huang, S.; Wang, P.; Li, R. Noise ECG generation method based on generative adversarial network. Biomed. Signal Process. Control 2023, 81, 104444. [Google Scholar] [CrossRef]
Singh, P.; Pradhan, G. A New ECG Denoising Framework Using Generative Adversarial Network. IEEE/ACM Trans. Comput. Biol. Bioinform. 2021, 18, 759–764. [Google Scholar] [CrossRef]
Chen, F.; Pan, Y.; Li, K.; Cheng, K.T.; Huan, R. Standard 12-lead ECG synthesis using a GA optimized BP neural network. In Proceedings of the International Conference on Advanced Computational Intelligence, Wuyi, China, 27–29 March 2015; pp. 289–293. [Google Scholar]
Abdelmadjid, M.A.; Boukadoum, M. Neural Network-Based Signal Translation with Application to the ECG. In Proceedings of the IEEE Interregional NEWCAS Conference, Quebec City, QC, Canada, 19–22 June 2022; pp. 542–546. [Google Scholar] [CrossRef]

Figure 1. Taxonomy of ECG DA techniques.

Figure 2. The search method for identifying relevant studies.

Table 1. List of search queries and the final query.

Parameter	Search Query
Signal type (Q1)	“ECG” OR “electrocardiography” OR “electrocardiogram” OR “EKG”
AI technique (Q2)	“DNN” OR “deep learning” OR “neural network” OR “AI" OR “artificial intelligence” OR “machine learning”
DA technique (Q3)	“augmentation” OR “synthesis” OR “generation”
Specific technique (Q4)	“GAN" OR “generative adversarial network” OR “normalizing flow” OR “stable diffusion”
Final query	Q1 AND Q2 AND (Q3 OR Q4)

Table 2. Inclusion and exclusion criteria for selecting papers.

Inclusion Criteria	Exclusion Criteria
Works published in the period between 1 January 2013 and 31 January 2023	Review papers and non-English written papers
Applying DA only to the ECG	Not applying DA and not providing a clear description of DA and datasets
With a clear description of DA	Not considering the ECG signal
Inclusion of AI technique	Not reporting performance metrics

Table 4. Summary of advanced DA methods for ECG classification using AI techniques.

Types	Lead	DA Methods	Input	Classifier	Improvem. after DA	Dataset	Refs.
CA	1	Style-transfer	ECG	CRN	3%	Physionet-2017 & Private	[92]
CA	2	CGAN	ECG	CNN	1.3–2.6%	MIT-BIH AD & Physionet-2017	[93]
CA	12	VAE	Spectral	CNN	0–6%	Private	[14]
CA	1	GAN	ECG	CNN	1%	MIT-BIH AD	[94]
CA	1	GAN	ECG	CNN	1.3%	MIT-BIH AD	[15]
CA	1	Embedding space	ECG	CNN	–	Physionet-2017	[48]
CA	1	GAN	Spectral	CBAM-ResNet	–	MIT-BIH AD	[95]
CA	12	Embedding space	ECG	Self-supervised	–	Physionet-2021	[96]
CA	1	GAN	Spectral	CNN	3%	Physionet-2017	[11]
CA	1	GAN	ECG	CNN	–	MIT-BIH AD	[17]
CA	1	GAN	ECG	CNN	5-37%	MIT-BIH AD	[18]
CA	1	GAN	ECG-PPG	CNN	–	BIDMC	[97]
CA	1	MC	ECG	CNN	–	MIT-BIH AD	[12]
CA	1	Embedding space	ECG	CNN	5.8%	ICENTIA11K [98]	[99]
CA	1	GAN	ECG	CNN	–	MIT-BIH AD	[100]
CA	1	VAE	ECG	CNN-LSTM	2%	MIT-BIH AD	[101]
CA	1 & 12	BiLSTM-CNN & TimeGAN	ECG	CNN	–	MIT-BIH AD & PTB	[102]
CA	12	GAN	ECG	ResNet	5%	CPSC-2018	[103]
CA	1	GAN	ECG	CRNN	14%	Physionet-2017	[104]
CA	1	GAN	ECG	Bi-LSTM	1.9%	MIT-BIH AD	[105]
CA	1	GAN	ECG	RF	11%	MIT-BIH AD	[106]
CA	1	GAN	ECG	LSTM	–	MIT-BIH AD & MIT-BIH NSR	[107]
CA	1	GAN	ECG	CNN	1.45%	MIT-BIH AD	[108]
CA	1	GAN	ECG	CNN	–	MIT-BIH AD	[109]
CA	1	GAN	ECG	CNN-LSTM	2.65%	MIT-BIH AD	[110]
CA	1 & 12	GAN	ECG	CNN	–	MIT-BIH AD & PTB	[111]
CA	1	GAN	ECG	CNN	0.24%	MIT-BIH AD	[112]
CA	2	GAN	ECG	SVM	32%	MIT-BIH AD	[113]
CA	1	GAN	ECG	Bi-LSTM	2–51%	MIT-BIH AD	[114]
CA	1	VAE & GAN	ECG	CNN	5%	MIT-BIH AD	[115]
CA	1	GAN	ECG	CNN	–	MIT-BIH AD	[116]
CA	1	GAN	ECG	CNN	–	MIT-BIH AD	[117]
CA	1	GAN	ECG	LSTM	–	MIT-BIH AD	[118]
CA	1	GAN	ECG	ResNet-BiLSTM-attention	–	MIT-BIH AD	[119]
CA	1	AE	ECG	CNN	–	Physionet-2017	[120]
CA	1	GAN	Spectral	CNN	–	MIT-BIH AD	[121]
CA	1	GAN	ECG	Multi-head Attention	5–10%	MIT-BIH AD	[122]
CA	1	GAN	ECG	CNN	–	MIT-BIH AD	[123]
CA	1	GAN	ECG	CNN	32%	MIT-BIH AD	[113]
CA	1	BiRNN	ECG	Ensemble Bagged Trees	–	MIT-BIH AD	[124]
CA	1	GAN	ECG	CNN	4.8–8.1%	Private	[125]
CA	1	GAN	ECG	LSTM	4%	MIT-BIH AD	[126]
CA	1	GMM	ECG	ResNet	6.7%	MIT-BIH AD	[127]
CA	12	Embedding space	Spectral	Self-supervised	–	Private	[128]
CA	1	GAN	ECG	CNN	–	AHADB, VFDB, & CUDB	[129]
MI	1	Encoder-decoder	ECG	CNN	–	PTB	[130]
MI	12	Wasserstein Geodesic Perturbation	ECG	MFT	6–17%	PTB-XL	[13]
MI	1	GAN	ECG	CNN	4–6%	PTB	[131]
Fetal	1	GAN	ECG	CNN	12%	CTU-UHB	[132]
Emotion	1	GAN	ECG	LSTM	17%	CASE	[133]
Biometric	1	GAN	ECG	CNN	–	ECG-ID	[134]
Sleep-Apnea	1	GAN	ECG	CNN-LSTM	1.78	Apnea-ECG & MIT-BIH AD	[24]
Emotion	–	GAN	ECG	CNN	5.64%	Private	[135]
MI	12	GAN	ECG	SVM	0.75%	PTB	[136]
Emotion	1	GAN	ECG	SVM	–	DECAF	[22]
Pain intensity	1	DDCAE	ECG	NN	–	BioVid Heat Pain	[23]

Table 5. Summary of generative methods for ECG synthesis using AI techniques.

Lead	Input	Method	Metric	Dataset	Refs.
1	ECG	GAN	MMD (3.83 $\times 10^{- 3}$ )	LUDB [138]	[139]
1	ECG	GAN	KMMD (5.53)	MIT-BIH AD	[140]
1	ECG	GAN	MSE (0.017–0.099)	PTB-XL	[141]
1	ECG	GAN	SNR (40.85 dB)	MIT-BIH AD	[142]
1	ECG	GAN	RMSE (0.126)	MIT-BIH AD	[143]
1	ECG	AE	MSE (0.2)	MIT-BIH AD	[144]
1	ECG	GAN	FID (4.77–17.19)	MIT-BIH AD	[145]
2	ECG	GAN	PMSD (7.21%)	–	[146]
1	ECG	BiLSTM-CNN GAN	RMSE (0.276)	–	[147]
12	ECG	U-Net generator	DC (0.868)	Private and INCART	[148]
1	ECG	GAN	RMSE (0.015–0.028)	MIT-BIH AD	[149]
12	ECG	Genetic Algorithm-NN	RMSE (44.9–90) $μ$ V	PTB	[150]
12	ECG	CycleGAN	MSE ([0.5–31] $\times 10^{- 3}$ )	Private	[151]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rahman, M.M.; Rivolta, M.W.; Badilini, F.; Sassi, R. A Systematic Survey of Data Augmentation of ECG Signals for AI Applications. Sensors 2023, 23, 5237. https://doi.org/10.3390/s23115237

AMA Style

Rahman MM, Rivolta MW, Badilini F, Sassi R. A Systematic Survey of Data Augmentation of ECG Signals for AI Applications. Sensors. 2023; 23(11):5237. https://doi.org/10.3390/s23115237

Chicago/Turabian Style

Rahman, Md Moklesur, Massimo Walter Rivolta, Fabio Badilini, and Roberto Sassi. 2023. "A Systematic Survey of Data Augmentation of ECG Signals for AI Applications" Sensors 23, no. 11: 5237. https://doi.org/10.3390/s23115237

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Systematic Survey of Data Augmentation of ECG Signals for AI Applications

Abstract

1. Introduction

2. Method

2.1. Literature Search Strategy

2.2. Study Selection

2.3. Results of the Research

3. ECG Applications and Datasets

3.1. Typical ECG Applications

3.2. Datasets

4. Basic Data Augmentation Methods

5. Advanced Data Augmentation Techniques

5.1. Statistical Generative Model

5.2. Learning Based-Models

5.2.1. Embedding Space

5.2.2. Deep Generative Models

6. Discussion

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI