logo
Ponte Academic Journal
Mar 2020, Volume 76, Issue 3

A COMPREHENSIVE COMPARISON OF MACHINE LEARNING ALGORITHMS ON DIAGNOSING ASTHMA DISEASE AND COPD

Author(s): Erkut Bolat ,Hasan Yildirim, Sedat Altin, Eray Yurtseven

J. Ponte - Mar 2020 - Volume 76 - Issue 3
doi: 10.21506/j.ponte.2020.3.17



Abstract:
The diagnosis and classification of asthma and COPD, which are the most common respiratory diseases, have been vital importance. Recently, machine learning has been great interest for processing health data due to this importance. The focus of previous studies has been to obtain the most predictive algorithm for diagnosing aforementioned diseases. However, variable importance has received little attention. The principal objective of this study is to investigate machine learning algorithms both predictivity and variable importance evaluation abilities. A comprehensive comparison has been carried out by considering different performance criteria including accuracy, precision, recall, Kappa statistics, F-measure, ROC curve and AUC value. The findings have been indicated that Random forest (with CART learner), C5.0 for asthma and SVM (with non-linear kernel) and GBM for COPD are the best algorithms. While the most important variable for asthma is FEV3, MEF50 for COPD. FEV1 and FVC have been common variables among the top five variables. The rank of variables has been supported by statistical significance tests. Additionally, ROC curves and AUC values have provided similar and supportive results as visually. Consequently, this study should be of value to practitioners and researchers studying on expert systems on health sciences and machine learning applications.
Download full text:
Check if you have access through your login credentials or your institution