Automatic identification of respiratory diseases from stethoscopic lung sound signals using ensemble classifiers

doi:10.1016/j.bbe.2020.11.003

Biocybernetics and Biomedical Engineering

Volume 41, Issue 1, January–March 2021, Pages 1-14

https://doi.org/10.1016/j.bbe.2020.11.003 Get rights and content

Abstract

This paper investigates the application of different homogeneous ensemble learning methods to perform multi-class classification of respiratory diseases. The case sample involved a total of 215 subjects and consisted of 308 clinically acquired lung sound recordings and 1176 recordings obtained from the ICBHI Challenge database. These recordings corresponded to a wide range of conditions including healthy, asthma, pneumonia, heart failure, bronchiectasis or bronchitis, and chronic obstructive pulmonary disease. Feature representation of the lung sound signals was based on Shannon entropy, logarithmic energy entropy, and spectrogram-based spectral entropy. Decision trees and discriminant classifiers were employed as base learners to build bootstrap aggregation and adaptive boosting ensembles. The optimal structure of the investigated ensemble models was identified through Bayesian hyperparameter optimization and was then compared to typical classifiers in literature. Experimental results showed that boosted decision trees provided the best overall accuracy, sensitivity, specificity, F1-score, and Cohen's kappa coefficient of 98.27%, 95.28%, 98.9%, 93.61%, and 92.28%, respectively. Among the baseline methods, SVM provided the best yet a slightly poorer performance, as demonstrated by its average accuracy (98.20%), sensitivity (91.5%), and specificity (98.55%). Despite their simplicity, the investigated ensemble classification methods exhibited a promising performance for detecting a wide range of respiratory disease conditions. The data fusion approach provides a promising insight into an alternative and more suitable solution to reduce the effect of imbalanced data for clinical applications in general and respiratory sound analysis studies in specific.

Introduction

According to a recent report published by the Forum of International Respiratory Societies (FIRS), respiratory diseases (RDs) are among the leading causes of severe illness worldwide, with a death toll exceeding 4 million lives annually [1]. In 2017, the World Health Organization (WHO) declared that chronic RDs accounted for more than 10% of the global disease burden, second only to cardiovascular diseases [2].

The diagnostic process of RD involves auscultation, which is a clinical examination that involves listening to the internal sounds of moving air inside and outside the lungs. Pulmonary sounds are commonly auscultated through the anterior and posterior chest walls or the trachea using a stethoscope [3]. During auscultation, a medical practitioner examines a patient to identify adventitious (atypical) lung sounds superimposing regular breathing patterns. Examples of commonly heard adventitious lung sounds (ALS) include coarse or fine crackles, pleural rubs, wheezes, and stridors [3], [4]. Many RDs causing obstructed or restricted respiratory pathways are characterized by the existence of ALS while breathing. These sound types can be distinctly identified based on their characteristic frequency, pitch, intensity, and energy. For example, both wheezes and stridors are continuous high-pitched sounds occurring at a frequency range of 400 and 500 Hz, respectively. Lower pitched wheeze sounds are also known as rhonchi. High-pitched wheeze sounds may be present as a result of inflamed or narrowed bronchial tubules and are thus an indication of asthma or chronic obstructive pulmonary disease [5]. Stridors usually occur due to tracheal or laryngeal edema [6]. On the other hand, crackles are discontinuous high-pitched (fine) or low-pitched (coarse) waves associated with pneumonia, bronchitis, or heart failure conditions [7]. Similarly, pleural rubs are low-pitched rhythmic sounds associated with inflamed lung lining due to pleural effusion.

Regardless of the type of stethoscopic system used, auscultation is considered one of the safest, easiest, and cheapest examination procedures. Moreover, it provides a patient-friendly and non-invasive solution to monitoring the function of the lungs and other respiratory organs [4]. These qualities are of great value in resource-constrained primary care settings where advanced diagnostic tools are technologies, such as spirometry and radiography, are inaccessible. However, despite being routinely used in healthcare settings, it is a consensus among clinicians that standard pulmonary auscultation has some notable limitations. Firstly, acquiring good auscultatory skills requires extensive training and expertise. Moreover, efficient detection of adventitious sounds is sensitive to the level of experience and auditory acuity of the healthcare professional. Even if auscultation is performed by an expert practitioner, abnormal patterns are sometimes overlooked or misinterpreted during the examination [8]. Thus, subjectivity and inter-variability in observations and interpretations may limit the diagnostic effectiveness of such an approach. These challenges have given rise to the significance of computer-aided auscultation systems that can perform automated identification of ALS and RDs.

Recently, researchers have proposed various artificial intelligence solutions for the identification of adventitious lung sounds. Generally, proposed approaches were based on feature extraction paired with different classification models. At the feature extraction stage, breathing sounds were commonly characterized using several signal processing techniques, including higher-order statistics [9], spectrograms or scalograms [10], [11], wavelet transform coefficients [12], [13], Hilbert–Huang transform [14], and mel-frequency cepstral coefficients (MFCC) [15]. These feature extraction methods have been utilized in conjunction with the standard machine or deep learning methods such as naive Bayes classifiers [9], k-nearest neighbors [14], support vector machines [16], artificial neural networks (ANN) [13], convolutional neural networks (CNN) [17], and recurrent neural networks (RNN) [18]. Generally, obtained accuracy results varied between 97% and 70.2% for wheeze [19], [20], [21], [22], 97.5% and 86% for crackle [23], [24], and 99% for normal sound types. In [9], discriminating between normal, fine crackles, coarse crackles, mono-wheezes, and ploy-wheezes sound using a tree-based classifier provided an overall accuracy of 94%.

Despite the various efforts to develop automated adventitious lung sound detection algorithms, their usefulness in identifying RDs is still limited. Recent studies have shown that the presence of abnormal respiratory sounds is not a distinctive characteristic of impaired respiratory functions [3], [25]. For example, atypical respiratory sounds might not reflect impaired breathing patterns, and abnormalities do not always translate into audible sounds. These findings necessitate the need for first-hand computer-aided tools that are capable of identifying RDs directly from lung sound signals regardless of the existence of adventitious sounds. In this regard, a recent subclass of studies in the literature focused on exploring different machines and deep learning techniques to perform binary (normal vs. pathological) [26], [27], ternary (normal vs. chronic vs. non-chronic) [28], or multi-class [28], [29] classification of RDs. Investigated disease conditions included respiratory tract infections, pneumonia, bronchiectasis, bronchiolitis, asthma, and COPD. These studies reported accuracies up to 93.3%, 99%, and 98% for binary, ternary, and multi-class classification, respectively.

It is worth noting that most of the achieved satisfactory results in the context of multi-class classification were based on hybrid deep learning approaches. Despite exhibiting a highly promising performance without the need to incorporate sophisticated feature engineering techniques, training a reliable deep network architecture can be time-consuming and requires significant computational resources. Besides, the training process is iterative, and it involves multiple model parameters and enormous datasets. In clinical contexts, the lack of sufficient high-quality, diverse, and annotated training data is considered among the main limitations. To address the issue of data available, previous studies have employed several data augmentation techniques to over-sample minority classes such as variational autoencoder, adaptive synthetic sampling, and synthetic minority oversampling. However, minority oversampling techniques can introduce data leakage during the validation process, and thus, obtained results might be biased towards high fake accuracies. In fact, the majority of data augmentation approaches are mainly employed to improve the classification performance, without addressing the imperative requirement of using fine-grained datasets that represent the studied population.

In this study, we carried out a multi-class RD classification task considering six different conditions, namely normal, asthma, pneumonia, heart failure, bronchiectasis and bronchitis (BRON disorders), and chronic obstructive pulmonary disease (COPD). To this end, a novel stethoscopic lung sound dataset was collected locally at King Abdullah University Hospital, Jordan University of Science and Technology, Irbid, Jordan. This dataset was complemented by the publicly available ICBHI Challenge database to obtain a more balanced distribution among the respiratory disease classes. In terms of validity in the context of clinical applications, we believe that this data fusion approach provides a better alternative to data augmentation techniques employed in the literature. We propose to tackle the classification problem through a simple yet effective framework utilizing entropy features along with homogeneous ensemble classification methods. Subsequently, a comparative investigation is carried out to compare the proposed ensemble models to several baseline machine learning classifiers that were repeatedly employed in previous works.

The rest of this paper is organized as follows. Section 2 describes the methods used to acquire the lung sound signals along with the mathematical formulation of the entropy features. A brief overview touching on the mathematical groundwork and the implementation of the classification models is also provided. Section 3 presents the experimental results, and Section 4 provides a discussion of these results. Finally, Section 5 concludes this paper.

Section snippets

Materials and methods

As shown in Fig. 1, the adopted methodology consists of the following main phases: data acquisition and preparation, feature extraction, construction and training of the ensemble and baseline classifiers, and finally performance evaluation. These steps are detailed below.

Experimental results

The investigated models were trained tested using a computer with an Intel(R) Core TM i7-8750H (Intel Corporation, Santa Clara, USA), a 2.20 GHz CPU, 16 GB of RAM, and an NVIDIA GeForce GTX 1050 TI GPU (NVIDIA, California USA). The training process for all the ensemble models lasted around 15.38 min, while the baseline models needed around 2.02 min to train in total. The complete analysis was performed via Matlab software (R2020a, Natick, Massachusetts, USA).

Discussion

Computer-aided detection of respiratory diseases can expedite diagnostic and treatment decisions and support the study of physiological patterns associated with various respiratory pathologies. In this work, we propose to combine entropy-based features and homogeneous ensemble classifiers to perform multi-class classification of a wide range of respiratory diseases.

As imperative to all machine learning frameworks, the feature extraction stage aims at providing a better representation of the

Conclusion

To sum up, this paper investigated the use of ensemble classifiers with a dataset of lung sounds obtained via a stethoscope to perform multi-class classification. The dataset included a total of 215 subjects with 308 clinically acquired lung sound recordings, in addition to the 1176 recordings obtained from the ICBHI Challenge database. Entropy was the central feature representation used, more specifically Shannon entropy, logarithmic energy entropy, and spectrogram-based spectral entropy.

Authors’ contribution

Dr. Luay Fraiwan: project administration and supervision; fund acquisition, Abu Dhabi University Fund; conceptualization; methodology; writing. Eng. Omnia Hassanin: conceptualization, creating training and testing models; software: Matlab programming (ensemble classifier); methodology; formal analysis; writing. Dr. Mohammed Fraiwan: investigation: building the data acquisition system and measurement protocol; formal analysis: analyzing the recorded sound signal and clinical data verification;

Funding

This research is supported by the Deanship of Scientific Research at Jordan University of Science and Technology, Jordan, grant no. 20180356, and the Office of Research and Sponsored Programs (ORSP) Abu Dhabi University's, UAE.

Conflict of interest

The authors declare no conflict of interest associated with this work.

References (53)

R. Naves et al.
Classification of lung sounds using higher-order statistics: a divide-and-conquer approach
Comput Methods Programs Biomed
(2016)
G. Serbes et al.
Pulmonary crackle detection using time-frequency and time-scale analysis
Dig Signal Process
(2013)
M. Bahoura
Pattern recognition methods applied to respiratory sounds classification into normal and wheeze classes
Comput Biol Med
(2009)
F. Jin et al.
New approaches for spectro-temporal feature extraction with applications to respiratory sound classification
Neurocomputing
(2014)
E. Messner et al.
Multi-channel lung sound classification with convolutional recurrent neural networks
Comput Biol Med
(2020)
P. Bokov et al.
Wheezing recognition algorithm using recordings of respiratory sounds at the mouth in a pediatric population
Comput Biol Med
(2016)
K. Zhang et al.
The detection of crackles based on mathematical morphology in spectrogram analysis
Technol Health Care: Off J Eur Soc Eng Med
(2015)
G. Serbes et al.
Pulmonary crackle detection using timeâfrequency and time scale analysis
Dig Signal Process
(2013)
M.A. Islam et al.
Multichannel lung sound analysis for asthma detection
Comput Methods Programs Biomed
(2018)
S. Lapi et al.
Respiratory rate assessments using a dual-accelerometer device
Respir Physiol Neurobiol
(2014)

K. He et al.

Selecting the number of bins in a histogram: a decision theoretic approach

J Stat Plan Inference

(1997)

Y. Freund et al.

A decision-theoretic generalization of on-line learning and an application to boosting

J Comput Syst Sci

(1997)

J.N.G.K. Bousquet

Global surveillance, prevention and control of chronic respiratory diseases: a comprehensive approach

(2007)

C.D. Mathers et al.

Projections of global mortality and burden of disease from 2002 to 2030

PLOS Med

(2006)

M. Sarkar et al.

Auscultation of the respiratory system

Ann Thorac Med

(2015)

E. Andrès et al.

Respiratory sound analysis in the era of evidence-based medicine and the world of medicine 2.0

J Med Life

(2018)

R.X.A. Pramono et al.

Evaluation of features for classification of wheezes and normal respiratory sounds

PLOS ONE

(2019)

H. Pasterkamp et al.

Respiratory sounds: advances beyond the stethoscope

Am J Respir Crit Care Med

(1997)

S. Reichert et al.

Analysis of respiratory sounds: state of the art

Clin Med Circ Respir Pulmon Med

(2008)

L. Shi et al.

Lung sound recognition algorithm based on vggish-bigru

IEEE Access

(2019)

J. Acharya et al.

Feature extraction techniques for low-power ambulatory wheeze detection wearables

N. Gautam et al.

Wavelet scalogram analysis of phonopulmonographic signals

Int J Med Eng Informatics

(2013)

Y.P. Kahya et al.

Classifying respiratory sounds with different feature sets

2006 International Conference of the IEEE Engineering in Medicine and Biology Society

(2006)

Alvaro D. Orjuela-Ca nón et al.

Artificial neural networks for acoustic lung signals classification

Ibero-American Congress on Pattern Recognition

(2014)

M. Aykanat et al.

Classification of lung sounds using convolutional neural networks

EURASIP J Image Video Process

(2017)

R. Oweis et al.

An alternative respiratory sounds classification system utilizing artificial neural networks

Biomed J

(2014)

Cited by (52)

SUPER-COUGH: A Super Learner-based ensemble machine learning method for detecting disease on cough acoustic signals
2024, Biomedical Signal Processing and Control
Sound classification has obtained considerable attention in recent years due to its wide range of applications in various fields, such as speech recognition, sound surveillance, music analysis, and environmental monitoring. Because of its success, audio classification can also be employed in medical applications. Coughing is the most common disease symptom, and cough sounds might be used to diagnose them. This research focuses on identifying observable features of cough and classifying them into positive, negative, or symptomatic categories. A novel ensemble learning model based on the super learner (SL) is proposed to diagnose the disease using cough sounds utilizing various audio features such as Frequency Distribution, Time Domain Features, Spectral Features, and Time–Frequency Features. The SL method is a cross-validated approach to stacked generalization, and it can select an optimal learner from a set of learners and improve performance by selecting and merging models using cross-validation. The proposed SL model comprises DT, RF, LR, SVM, ET, and k-NN algorithms. We use the public Coughvid dataset, and the proposed model achieves a correct classification rate for symptomatic cases, which was 90.90%, and the positive predictive value for COVID-19 cases was 84.50%. The SL3 model attains 72%, 78%, 73%, 74.4%, and 78.85% precision, recall, f1-score, accuracy, and average AUC values, respectively. The numerical results show that the proposed model might be implemented to diagnose various other diseases that can be determined from respiratory sounds.
TRespNET: A dual-route exploratory CNN model for pediatric adventitious respiratory sound identification
2024, Biomedical Signal Processing and Control
Pediatric respiratory diseases significantly contribute to the global burden of morbidity and mortality among children. Moreover, the long-term persistence of respiratory diseases from childhood into adulthood underscores the critical importance of early identification and diagnosis of such diseases. Auscultation, the widely recognized diagnostic method for respiratory diseases, relies on expertise and is subject to variability across different practitioners. To address these challenges, extensive research has focused on automating respiratory disease detection; however, the majority of existing studies have focused on adult populations. Therefore, in this study, we utilize the SPRSound database, which comprises respiratory events from 288 youths aged between 1 month and 18 years to develop deep learning-based models for automatic adventitious lung sound detection in children. Initially, we explore various architectures, including convolutional neural networks with and without transformer encoders, as well as vision transformer. Building upon these investigations, we propose a novel spectrotemporal deep neural network called TRespNET, which incorporates both Mel-scale spectrograms and raw time series of sound recordings. In addition, we integrate hand-crafted acoustic features with the proposed neural network feature maps for further model development. Our results yield a specificity of 0.98, a sensitivity of 0.84, and a harmonic score of 0.90, demonstrating the superiority of the proposed method over all existing methods on the SPRSound dataset. This research provides remarkable insights into an automated and accurate approach for respiratory disease classification in children.
Pre-processing techniques to enhance the classification of lung sounds based on deep learning
2024, Biomedical Signal Processing and Control
Deep learning has recently proved a huge potential in the classification of lung sounds. Most studies rely on publicly available data sets that are usually well-cleaned and annotated by expert physicians. The result of annotation is subjective by definition and, above all, large and public data sets are not collected in the scope of a very specific clinical investigation. Other works rely on private and suitably collected data sets that either may or may not stem from clinical studies. The main issue in these cases is represented by the reliability and noisiness of auscultations.
This paper delves into the significant impact of quantitative, systematic and reproducible cleaning of data sets of lung sounds. For “cleaning a data set” we mean discarding the records that carry mostly noise and interfering signals, since machine learning can be significantly impaired by outliers.
The developed pre-processing techniques are tested on several data sets of lung sounds. We designed a deep neural network (DNN) for the diagnosis of interstitial lung diseases (ILD) in patients affected by connective tissue diseases (CTD). The devised DNN can provide significant performance on the clean data set with impressive accuracy, F1-score, and F2-score of 97% with respect to the high-resolution computer tomography. Considering that the screening of ILD in patients affected by chronic autoimmune diseases is still an open issue, the proposed pipeline represents the enabling technology for the early, safe, reliable and cheap diagnosis of CTD-ILD.
Lung disease detection using EasyNet
2024, Biomedical Signal Processing and Control
One of the major causes of deaths worldwide is the pulmonary diseases. There is an increasing need for an efficient technique that can automatically diagnose these diseases with high accuracy. In this paper, we propose a deep learning architecture to automatically detect pulmonary diseases. The raw pulmonary sound signals are taken from two popular datasets: ICBHI and KAUH datasets. These signals have diverse sampling frequencies of 4 kHz, 10 kHz or 44.1 kHz. The signals from KAUH dataset have a duration of minimum 5 s, while ICBHI signals have a duration of 10 to 90 s. These signals undergo pre-processing, which involves re-sampling them to a common 4 kHz frequency, and segmenting them into frames lasting 3 s. The frames are then normalized and passed to the proposed EasyNet model for training and classification. The EasyNet architecture contains only two convolution layers, which reduces the model complexity. The model’s performance is analyzed for both binary detection as well as multi-class detection. Our method performs well in all the considered evaluation scenarios, and yields an accuracy, sensitivity, and specificity of 1.0 for the KAUH dataset, while for the ICBHI dataset, an accuracy of 0.997, sensitivity of 0.999, and specificity of 0.997 is achieved. For the combined dataset, we have achieved an accuracy of 0.998, with a sensitivity and specificity of 0.999. These values are better than the existing state-of-the-art methods. The proposed architecture is quite simple yet effective in detecting lung diseases.
Lung disease recognition methods using audio-based analysis with machine learning
2024, Heliyon
The use of computer-based automated approaches and improvements in lung sound recording techniques have made lung sound-based diagnostics even better and devoid of subjectivity errors. Using a computer to evaluate lung sound features more thoroughly with the use of analyzing changes in lung sound behavior, recording measurements, suppressing the presence of noise contaminations, and graphical representations are all made possible by computer-based lung sound analysis. This paper starts with a discussion of the need for this research area, providing an overview of the field and the motivations behind it. Following that, it details the survey methodology used in this work. It presents a discussion on the elements of sound-based lung disease classification using machine learning algorithms. This includes commonly prior considered datasets, feature extraction techniques, pre-processing methods, artifact removal methods, lung-heart sound separation, deep learning algorithms, and wavelet transform of lung audio signals. The study introduces studies that review lung screening including a summary table of these references and discusses the literature gaps in the existing studies. It is concluded that the use of sound-based machine learning in the classification of respiratory diseases has promising results. While we believe this material will prove valuable to physicians and researchers exploring sound-signal-based machine learning, large-scale investigations remain essential to solidify the findings and foster wider adoption within the medical community.
Weighted aggregation through probability based ranking: An optimized federated learning architecture to classify respiratory diseases
2023, Computer Methods and Programs in Biomedicine
Background and Objective
Respiratory Diseases are one of the leading chronic illnesses in the world according to the reports by World Health Organization. Diagnosing these respiratory diseases is done through auscultation where a medical professional listens to sounds of air in the lungs for anomalies through a stethoscope. This method necessitates extensive experience and can also be misinterpreted by the medical professional. To address this issue, we introduce an AI-based solution that listens to the lung sounds and classifies the respiratory disease detected. Since the research work deals with medical data that is tightly under wraps due to privacy concerns in the medical field, we introduce a Deep learning solution to classify the diseases and a custom Federated learning (FL) approach to further improve the accuracy of the deep learning model and simultaneously maintain data privacy. Federated Learning architecture maintains data privacy and facilitates a distributed learning system for medical infrastructures.
Methods
The approach utilizes Generative Adversarial Networks (GAN) based Federated learning approach to ensure data privacy. Generative Adversarial Networks generate new data by synthesizing new lung sounds. This new synthesized data is then converted to spectrograms and trained on a neural network to classify four lung diseases, Heart Attack and Normal breathing patterns. Furthermore, to address performance loss during FL, we also propose a new “Weighted Aggregation through Probability-based Ranking (FedWAPR)” algorithm for optimizing the FL aggregation process. The FedWAPR aggregation takes inspiration from exponential distribution function and ranks better performing clients according to it.
Results and Conclusion
A test accuracy of about 92% was achieved by the trained model while classifying various respiratory diseases and heart failure. Additionally, we developed a novel FedWAPR approach that significantly outperformed the FedAVG approach for the FL aggregate function. A patient can be checked for respiratory diseases using this improved learning approach without the need for extensive sensitive data recording or for making sure the data sample obtained is secure. In a decentralized training runtime, the trained model successfully classifies various respiratory diseases and heart failure using lung sounds with a test accuracy on par with a centralized model.

View all citing articles on Scopus

View full text

Original Research ArticleAutomatic identification of respiratory diseases from stethoscopic lung sound signals using ensemble classifiers

Abstract

Introduction

Section snippets

Materials and methods

Experimental results

Discussion

Conclusion

Authors’ contribution

Funding

Conflict of interest

Comput Methods Programs Biomed

Dig Signal Process

Comput Biol Med

Neurocomputing

Comput Biol Med

Comput Biol Med

Technol Health Care: Off J Eur Soc Eng Med

Dig Signal Process

Comput Methods Programs Biomed

Respir Physiol Neurobiol

J Stat Plan Inference

J Comput Syst Sci

Global surveillance, prevention and control of chronic respiratory diseases: a comprehensive approach

Projections of global mortality and burden of disease from 2002 to 2030

PLOS Med

Auscultation of the respiratory system

Ann Thorac Med

Respiratory sound analysis in the era of evidence-based medicine and the world of medicine 2.0

J Med Life

Evaluation of features for classification of wheezes and normal respiratory sounds

PLOS ONE

Respiratory sounds: advances beyond the stethoscope

Am J Respir Crit Care Med

Analysis of respiratory sounds: state of the art

Clin Med Circ Respir Pulmon Med

Lung sound recognition algorithm based on vggish-bigru

IEEE Access

Feature extraction techniques for low-power ambulatory wheeze detection wearables

Wavelet scalogram analysis of phonopulmonographic signals

Int J Med Eng Informatics

Classifying respiratory sounds with different feature sets

2006 International Conference of the IEEE Engineering in Medicine and Biology Society

Artificial neural networks for acoustic lung signals classification

Ibero-American Congress on Pattern Recognition

Classification of lung sounds using convolutional neural networks

EURASIP J Image Video Process

An alternative respiratory sounds classification system utilizing artificial neural networks

Biomed J

Original Research Article
Automatic identification of respiratory diseases from stethoscopic lung sound signals using ensemble classifiers