Machine learning and deep learning approach for medical image analysis: diagnosis to detection

Rana, Meghavi; Bhushan, Megha

doi:10.1007/s11042-022-14305-w

Machine learning and deep learning approach for medical image analysis: diagnosis to detection

Published: 24 December 2022

Volume 82, pages 26731–26769, (2023)
Cite this article

Download PDF

Multimedia Tools and Applications Aims and scope Submit manuscript

Machine learning and deep learning approach for medical image analysis: diagnosis to detection

Download PDF

17k Accesses
32 Citations
1 Altmetric
Explore all metrics

Abstract

Computer-aided detection using Deep Learning (DL) and Machine Learning (ML) shows tremendous growth in the medical field. Medical images are considered as the actual origin of appropriate information required for diagnosis of disease. Detection of disease at the initial stage, using various modalities, is one of the most important factors to decrease mortality rate occurring due to cancer and tumors. Modalities help radiologists and doctors to study the internal structure of the detected disease for retrieving the required features. ML has limitations with the present modalities due to large amounts of data, whereas DL works efficiently with any amount of data. Hence, DL is considered as the enhanced technique of ML where ML uses the learning techniques and DL acquires details on how machines should react around people. DL uses a multilayered neural network to get more information about the used datasets. This study aims to present a systematic literature review related to applications of ML and DL for the detection along with classification of multiple diseases. A detailed analysis of 40 primary studies acquired from the well-known journals and conferences between Jan 2014–2022 was done. It provides an overview of different approaches based on ML and DL for the detection along with the classification of multiple diseases, modalities for medical imaging, tools and techniques used for the evaluation, description of datasets. Further, experiments are performed using MRI dataset to provide a comparative analysis of ML classifiers and DL models. This study will assist the healthcare community by enabling medical practitioners and researchers to choose an appropriate diagnosis technique for a given disease with reduced time and high accuracy.

Medical image analysis based on deep learning approach

Article 06 April 2021

Muralikrishna Puttagunta & S. Ravi

Enhancing Medical Diagnosis Through Deep Learning and Machine Learning Approaches in Image Analysis

Medical Image Analysis Using Machine Learning and Deep Learning: A Comprehensive Review

1 Introduction

The significance of disease classification and prediction can be observed from the previous years. The important properties and features given in a dataset should be well-known to identify the exact cause along with the symptom of the disease. Artificial Intelligence (AI) has shown promising results by classifying and assisting in decision making. Machine Learning (ML), a subset of AI, has accelerated many research related to the medical field. Whereas, Deep Learning (DL) is a subset of ML that deals with neural network layers, analyzing the exact features required for disease detection [34, 71, 94]. The existing studies from 2014 to present, discusses many applications and algorithms developed for enhancing the medical field by providing accurate results for a patient. Using data, ML has driven advanced technologies in many areas including natural language processing, automatic speech recognition, and computer vision to deliver robust systems such as driverless cars, automated translation, etc. Despite all advances, the application of ML in medical care remained affected with hazards. Many of these issues were raised from medical care stating the goal of making accurate predictions using the collected data and managed by the medical system.

AI examines a given dataset using various techniques to get the required features or highlights from a huge amount of data resulting in difficulty for tracking down an ideal arrangement of significant features and excluding repetitive ones. Considering such features is inconvenient and accuracy metrics becomes erroneous. Hence, choosing a small subset from a wide scope of features will upgrade the efficiency of the model. Subsequently, the exclusion of inconvenient and repetitive features will decline the dimensionality of the information, speed up the learned model similar to boosting [37]. From the existing features, the significant features are extracted using practical approaches such as Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA). Particularly, choosing a feature has two essential clashing objectives, first, boosting the presentation of arrangement and second, limiting the count of features to conquer the issue of dimensionality. Hence, selection of features is considered as an essential task for aforementioned objectives. Later, research related to the features improvement was enhanced by using choice-based multi-target strategies. Thus, in this review, strategies to choose efficient features will be focused.

Cancer disease was identified using multiple techniques of image segmentation, feature selection, and regression using Root Mean Square Error (RMSE), with the parameters such as recognizing patterns, detecting objects, and classifying the image [7]. Brain tumor was detected using six classifiers and Transfer Learning (TL) techniques for image segmentation with Magnetic Resonance Imaging (MRI) of the brain [28]. Also, a TL approach was implemented to identify lung cancer and brain disease in [55]. It analyzed MRI and Computer-Tomography (CT) scan images by using supervised learning Support Vector Machine (SVM) classifiers. The image analysis process has been well understood in the existing studies. However, the techniques using ML and DL are continuously being updated. Therefore, it is a complex task for researchers to identify an accurate method for analyzing images and feature selection techniques varying with every method. The key contributions of this study include:

(i)
Classification of diseases after reviewing primary studies,
(ii)
Recognition of various image modalities provided by existing articles,
(iii)
Description of tools along with reliable ML and DL techniques for disease prediction,
(iv)
Dataset description to provide awareness of available sources,
(v)
Experimental results using MRI dataset to compare different ML and DL methods,
(vi)
Selection of suitable features and classifiers to get better accuracy, and.
(vii)
Insights on classification as well as review of the techniques to infer future research.

The significance of this review is to enable physicians or clinicians to use ML or DL techniques for precise and reliable detection, classification and diagnosis of the disease. Also, it will assist clinicians and researchers to avoid misinterpretation of datasets and derive efficient algorithms for disease diagnosis along with information on the multiple modern medical imaging modalities of ML and DL.

The study presented consists of 11 sections. The organization of the section is described as follows: Section 2 discusses the background of study, Section 3 discusses the review techniques, search criteria, source material and the quality assessment. Section 4 summarizes the current techniques and important parameters to acquire good accuracy. Section 5 gives an insight of medical image modalities. Section 6 sums up the tools and techniques being used in ML and DL models. Section 7 discusses the datasets used by the authors previously and gives an insight of data. Section 8 represents the experimental section using ML classifiers and DL models over brain MRI dataset. Section 9 recaps the analytic discussion about the techniques, datasets being used, tools in ML and DL, journals studied for the given article. Discussion, conclusion and future scope is discussed in Sections 10 and 11, respectively.

2 Background

This section discusses the preliminary terms which are required to comprehend this review. Further, it also presents the statistical analysis of ML and DL techniques used for medical image diagnosis.

2.1 Machine learning

ML is a branch of AI where a machine learns from the data by identifying patterns and automates decision-making with minimum human intervention [96, 24, 12]. The most important characteristic of a ML model is to adapt independently, learn from previous calculations and produce reliable results when new datasets are exposed to models repeatedly. The two main aspects include (i) ML techniques help the physicians to interpret medical images using Computer Aided Design (CAD) in a small period of time, and (ii) algorithms used for challenging tasks like segmentation with CT scan [81], breast cancer and mammography, segmenting brain tumors with MRI. Traditional ML models worked on structured datasets where the techniques were predefined for every step, the applied technique fails if any of the steps were missed. The process of evaluating the data quality used by ML and DL algorithms is essential [16–22, 61]. Whereas, new algorithms adapt the omission of data based on the requirement for robustness of the algorithm. Figure 1 illustrates the process used by ML algorithms for the prediction and diagnosis of disease.

2.2 Deep learning

DL models enable machines to achieve the accuracy by advancements in techniques to analyze medical images. In [58], the heart disease was diagnosed using the labelled chest X-Rays, cardiologist reviewed and relabelled all the data while discarding the data other than heart failure and normal images. To extract the exact features from the images, data augmentation and TL were used with 82% accuracy, 74% specificity and 95% sensitivity for heart failure. In [14], an automatic feature selection, using histopathology images with the labelling of positive and negative cancer images, was developed with minimum manual work. Two networks named Deep Neural Network (DNN) 2-F and DNN1-F were used with PCA to reduce features in DNN whereas for unsupervised feature learning a single-layer network of K-means centroids was used. Later, the results of unsupervised (93.56%) and supervised (94.52%) learning were compared. The DL model automates the feature extraction procedure to handle data efficiently [14, 74]. Figure 2 depicts the process used by DL algorithms for the prediction and diagnosis of various diseases.

To process the medical images for better prediction and accuracy, ML and DL techniques were used as shown in Figs. 1 and 2, respectively. As input, medical images from various modalities are taken into consideration, and then algorithms are applied to these images. Further, the input image is segmented based on various factors, these segments were used to extract the essential and maximum features using feature extraction techniques. After the extraction of the required features, they are further refined to obtain actual features used for the identification of diseases [60]. Also, ML approaches were used to denoise the medical images for better prediction and accuracy in [46]. Once the feature selection and noise removal from the data are achieved, the classification of the images according to the disease using classifiers like SVM, Decision Tree (DT), etc. was attained.

ML is the process where computers learn from data and use algorithms to carry out a task without being explicitly programmed. It uses pattern recognition to make predictions with new dataset. Alternatively, DL is modeled according to the human brain including a complex structure of algorithms enabling machines to process images, text and documents. It uses layered-structure algorithms such as Convolutional Neural Network (CNN), Artificial Neural Network (ANN), etc., to analyze the data with logics. Comparatively, DL is more capable of processing huge amount of data than ML models.

3 Review technique

In this section, an overview of the technique used to conduct this review systematically is discussed. It provides the details of the electronic databases used to search, retrieve information, and discuss the research questions framed to execute the review successfully. The systematic review guidelines implemented by [49, 50] were followed for this literature review.

3.1 Research questions

In this review, following review questions will be discussed:

1.
What are the present techniques of ML and DL used for medical imaging?
1. 1.1
  What are the considered parameters while selecting the classifiers?
2. 1.2
  What are the evaluation metrics used to evaluate classification models?
2.
What are various medical image modalities for classifying the diseases?
3.
What are the tools and techniques used for medical imaging?
4.
What are various datasets used by several researchers in the domain of healthcare?
5.
What are the results of comparative analysis of ML classifiers and DL models based on experiments using MRI dataset?

3.2 Source material

The guidelines given in [49, 50] are followed for searching the existing literature related to the area of ML and DL in medical imaging. Following electronic database sources are used for searching:

ScienceDirect (https://www.sciencedirect.com/).
IEEE Xplore (https://ieeexplore.ieee.org/Xplore/home.jsp).
Springer (https://www.springer.com/in).
PubMed (https://pubmed.ncbi.nlm.nih.gov/16495534/).
Wiley Interscience (https://onlinelibrary.wiley.com/).
Google Scholar (https://scholar.google.co.in/).
IOP (https://www.iop.org/#gref).
Oxford Publications (https://india.oup.com/).
Elsevier (Elsevier Books and Journals - Elsevier).
Hindawi (https://www.hindawi.com).
Bentham science (Bentham Science - International Publisher of Journals and Books).

3.3 Search criteria

This review consists of the articles written in English language between the years 2014–2022. The review process can be considered as the filtering process for attaining the quality research articles with the inclusion and exclusion criteria at various stages. The search was based on the keywords as shown in Table 1 to retrieve research articles from various journals, conferences, book chapters, and other sources.

Table 1 Keywords used

Full size table

The journals and conferences included were taken from IEEE, Science Direct, Springer, Oxford Publication, etc. The article selection method is depicted in Fig. 3. As depicted in Fig. 3, the initial search consisted of 16,900 articles which were refined to 250 based on the specific keywords used as shown in Table 1. Then 100 articles were retrieved based on their titles and were reduced to 75 articles based on their abstract and introduction. Finally, 40 articles were selected as primary studies based on the criteria of exclusion and inclusion.

3.4 Quality assessment

The quality of this review was assured after inclusion and exclusion criteria discussed in sub-section 3.3. These primary studies were from various journals, conferences, workshops, and others (manuscripts, online records, and society publications). To retrieve the quality articles, analysis of each article was done to maintain fairness and validation (external and internal) of the results based on the CRD guidelines [50].

Table 2 presents the top 20 highly influential and cited articles related to the classification of diseases, identification of tools and techniques, explanation for the cause of disease, and solutions to the diagnosed disease (source: https://scholar.google.co.in).

Table 2 Top 20 cited articles

Full size table

3.5 Data extraction

Initially, many challenges were faced to extract the relevant data for this review, therefore, some researchers were approached to acquire the necessary information. The method for extracting the required data in this review is as follow:

One of the authors extracted the data after a thorough review of 40 articles.
The acquired results of the review were cross checked by another author to maintain consistency.
During the process of cross checking (in case of a conflict), issues were resolved by meetings between the authors.

4 ML and DL techniques for medical imaging

Research question 1 is answered in this section to provide an overview of the current techniques of ML and DL for medical imaging. Further, followed by various parameters considered for selecting the classifiers and the evaluation metrics used to evaluate classification models. The existing literature review is divided according to the diseases such as breast cancer, brain tumor, lung disease, diabetes, multiple disease detection, etc.

4.1 Breast disease

In this subsection, articles related to breast disease symptoms, detection, classification, prediction and diagnosis using ML and DL methods are discussed. In [33], significant features were identified using BI-RADS (Breast Imaging Reporting and Data System) to develop a CAD system for obtaining breast ultrasound. Also, 10-fold cross validation technique was used upon the benign and malignant lesions. As a result, 77% accuracy was achieved using the SVM classifier. However, some methods with a few algorithms handling the vast variety of data need to be understood and analyzed precisely [84]. CNN was used to train the system with the available clinical data and to comprehend the complex structure. Moreover, it was suggested to study radiomics and expansion of CADx to get the tumor signs using a CAD system. Breast cancer disease was classified using the parameters like Area Under Curve (AUC), sensitivity, and specificity [100]. A CAD system was developed using CNN where a large number of features were required, using multiview features. These features provide the maximum details of the image data to be extracted for the accuracy of detection and classification.

DL was used for analyzing medical images and also, the limitations along with success of DL techniques for medical imaging were discussed in [86]. Recent ML and DL technologies were reviewed for the classification and detection of medical imaging modalities [39]. It provided an insight on the progress of the technology used in the medical field. Various ML techniques used for image processing and DL techniques with the architecture of the algorithm were discussed. To study the technologies, the evaluation of various images such as histological images, thermography images, mammography, ultrasound and MRI using the CAD system was explored. Moreover, the system included ML techniques like SVM, ANN, DT, Naive bayes, K-Nearest Neighbor (KNN), etc.

4.2 Brain disease

The concept of TL was used for image segmentation where the MRI scan of the brain was segmented using voxel wise classification [7]. ML classifiers were applied for the classification of multiple diseases. Later, the results obtained were compared with the existing results to detect the disease.

A brief introduction of DNN in medical image analysis to diagnose the brain tumor using brain tissues is provided in [56]. It indicated the ways for applying DL to the entire process of MRI scanning, image retrieval, segmentation and disease prediction. It also focused on image acquisition to image retrieval, and from feature segmentation to prediction of disease. The entire process was divided into two parts: (i) the signal processing of MRI including the image restoration and image registration, and (ii) usage of DL for disease detection and prediction-based reports in the form of text and images. Also, the influence of DL in medical imaging was discussed in [82]. Image segmentation approaches using DL included tumor segmentation, brain and lung’s structure with bone tissues or cells. Patches were taken as input and 2-Dimensional Convolutional Neural Network (2D-CNN) was used to preprocess these at a later stage.

4.3 Lung disease

DL has the ability to automate the process of image interpretation which enhances the clinical decision making, identifying the disease and predicting the best treatment for the patient by reviewing the pros and cons of the DL techniques [51]. These techniques were used for the cardiovascular medication, following are the steps for implementing DL model: (i) problem identification, (ii) data selection, (iii) hardware and software selection, (iv) data preparation, (v) feature selection, and (vi) splitting of data for training as well as validation process. In [13], a disease was analyzed automatically using labeled data and achieved the accuracy by processing medical images using DL models. The automatic prediction of the disease using ML techniques and the concept of big data was summarized to detect the patterns [23]. The advantages and disadvantages for each algorithm were also discussed.

4.4 Diabetes

A comparative analysis of the classification algorithms based on iris images, using an iridology chart, was done for the diagnosis of diabetes [76]. Type-2 diabetes was detected by identifying the center of the pupil of an eye at the early stage using the I-Scan-2. Also, a filter-based feature selection method was used with the combination of five classifiers namely binary tree, SVM, neural network model, Random Forest (RF) and adaptive boosting model. Later, in [77] a study was compiled using the textural, statistical and various features (62 features of iris) to detect the same disease, however, an iridology chart was not used. ML and DL techniques were used to diagnose the errors in existing diagnostic systems [81]. These techniques were used to analyze the medical images and extract the features which are required for the diagnosis of errors in existing diagnostic systems. Both supervised and unsupervised algorithms were used for the prediction of the disease in specific datasets.

It was observed that DL technique is a way more powerful to investigate medical images [65]. Various techniques such as image classification, object detection, pattern recognition, etc. were used for the proper decision-making. It improved medical treatments by predicting the early symptoms of a disease. Moreover, an overview of ML and DL techniques used in the medical field was given for providing knowledge to the future researchers. In [78], techniques such as rubber sheet normalization, ML classifiers, PCA, etc. were used with self-created data and computed six parameters (i) accuracy, (ii) sensitivity, (iii) specificity, (iv)AUC, (v) precision, and (vi) F-score for accurate prediction of Type-2 diabetes.

4.5 Multiple disease detection

Multiple diseases were identified with different radiology techniques like MRI imaging for breast cancer along with brain tumor, CAD for breast cancer along with skin lesions, and X-Rays for chest analysis [46]. Also, ML techniques were used to attain better accuracy with denoising techniques including homographic wavelet, soft thresholding, non-homomorphic and wavelet thresholding. A CAD system using CNN was proposed to diagnose breast lesions as benign and malignant to assist the radiologists [100]. It was implemented using Inception-v3 architecture to extract the multiview features from Automated Breast Ultrasound (ABUS) images. For the implementation of the model, 316 breast lesions data were trained and evaluated. ML feature extraction scheme was compared with the given method, resulting in 10% increase in AUC value.

A review on image fusion was presented in [42], it reduced the randomness and improved the quality of available images. Various methods and challenges related to image fusion were also summarized. In [44], ML and DL techniques focusing on small labeled dataset were discussed as it was considered one of the important factors in decision making. Further, noisy data in medical images was analyzed with pros and cons of various ML algorithms.

In [4], data augmentation techniques were used to evaluate the dermatology diseases such as acne, atopic dermatitis, impetigo, psoriasis, and rosacea. To diagnose the mentioned diseases, the model was retrained in two phases: (i) with data augmentation, and (ii) without data augmentation using TensorFlow Inception V3. For statistical analysis, both the models were then compared and six parameters namely: (i) Positive Predictive Value (PPV), (ii) Negative Predictive Value (NPV), (iii) Matthew’s Correlation Coefficient (MCC), (iv) sensitivity, (v) specificity, and (vi) F1 score were calculated resulting in an increase of 7.7% average correlation coefficient.

Multiple diseases like diabetes, heart disease, liver disease, dengue and hepatitis were identified by recognizing the pattern in the available data and classifying them further using ML classifiers [29, 27, 47]. It used high-dimensional and multimodal dataset to predict the diseases accurately. The deteriorating condition of a patient was predicted using ML techniques like ML pipelines, classifiers (SVM and 5-fold cross-validation) with the baseline variables from MRI imaging [79]. AI applications in medical imaging, DL tools for the prediction and pattern recognition were described in [87]. In addition, apart from AI techniques, ANN and CNN were also useful for predicting the disease by analyzing the image pattern and classification of the disease can be carried out with the help of classifiers [62, 63].

Various algorithms were reviewed to detect the error in the diagnosis system implying the importance of ML and DL for early diagnosis of the disease [81]. Whereas, [104] discussed the three main challenges: (i) to cope up with image variations, (ii) learning from weak labels, and (iii) interpreting the results with accuracy for the diagnosis of cancer through given medical images. It concluded that TL was used to cope up with image variations. The concept of Multiple Instance Learning (MIL) and weighted TL were used to overcome the weakly labeled data and improve the accuracy of the disease classification for better medical results, respectively. It was suggested to comprehend the relation between image label and image collection instead of learning about the individual instance. The main advantage of the used technique is that it does not require the local manual annotations.

Table 3 represents the current ML and DL techniques for medical imaging, various parameters considered while selecting the classifiers, identified disease and evaluation metrics. Also, early tumor detection can assist clinicians to treat patients timely.

Table 3 Summary of existing works related to ML and DL techniques for medical imaging

Full size table

5 Modalities for medical image

Research question 2 (refer subsection 3.1) is addressed in this section, various medical image modalities (I-Scan-2, CT-Scan, MRI, X-Ray, Mammogram and Electrocardiogram (ECG)) used for classifying the diseases in the primary studies are shown in Table 4. As observed, following modalities were used for the evaluation of medical data using ML and DL techniques.

MRI: It uses magnetic resonance for obtaining electromagnetic signals. These signals are generated from human organs, which further reconstructs information about human organ structure [91]. MRIs with high resolution have more structural details which are required to locate lesions and disease diagnosis.
CT-Scan: It is a technology which generates 3-D images from 2-D X-Ray images using digital geometry [88].
Mammogram: For the effective breast cancer screening and early detection of abnormalities in the body, mammograms are used. Calcifications and masses are considered as the most common abnormalities resulting in breast cancer [5].
ECG: It is used to measure the heart activity electrically and to detect the cardiac problems in humans [8, 9, 105].

Table 4 Modalities for medical imaging and digital signal

Full size table

6 Tools and techniques

This section addresses research question 3 (refer subsection 3.1). After a thorough analysis of primary studies, various techniques (refer Table 6) and tools (refer Fig. 4) related to ML and DL techniques for healthcare were identified [67, 89]. It was observed that techniques have used scanned images with the help of image modalities such as MRI, CT-Scan, X-Rays, and so on. Also, in order to automate the process of image segmentation and classification, programming languages like R, MATLAB and Python were used to obtain accurate results. The subsections 6.1 and 6.2 precisely explain the tools and techniques used in primary studies for medical images, respectively.

6.1 Tools used for medical images

Figure 4 depicts the percentage of various tools (Table 5) used in the primary studies for the implementation of ML and DL models where MATLAB and NumPy have the percentage of 38 and 37, respectively, which signify the popularity of these tools among researchers. R and TensorFlow are the second most used tools with a percentage of 13 and 12, respectively.

Table 5 Tool description

Full size table

6.2 Techniques used for medical images

This subsection includes the description and identification of the most common ML and DL techniques (i) used for disease classification, detection and diagnosis, (ii) based on type of disease, and (iii) used for EEG and MEG data processing.

6.2.1 Description of techniques

CNN: It is a combination of DNNs which comprises three components, used to analyze the images (Refer Fig. 5). The components of CNN are as follow:
1. a.
  Convolutional Layer: It is responsible to apply the filters systematically to create feature maps for summarizing features present in the input image.
2. b.
  Pooling Layer: It is used for ordering the repeated layers in a model. It operates on each feature map, received from the convolutional layer, to produce a new set of feature maps pooled together. Pooling operation is used to reduce the feature map size with required pixels or values in each feature map, hence, reducing the overfitting problem. It consists of two main functions namely, average pooling and maximum pooling.
3. c.
  Fully-Connected Layer: It is simply the feed-forward neural network where input is received from the final pooling layer. Based on the extracted features, a fully connected layer predicts the image class.
ANN: The flowchart shown in Fig. 6 depicts the working of ANN architecture. The model extracts the data required from the input image to further normalize it accordingly. While the images are normalized, random weights are assigned to all the connections present in the network. Furthermore, the dataset is divided in the ratio of 80:20. Then the training algorithm is selected for the error attainment, and if errors are identified then the weights should be recalibrated. At last, the model is tested and validated for further evaluation.
TL: It introduces a concept of transferring selected features from a predefined model to another model for solving the problem. It selects the feature or learns from the previous model and applies the features or learning to the new model to address different issues. ML models are capable of addressing only one specific task, however, TL can be applied to more than one problem making it more reliable and efficient (Refer Fig. 7).
RF: Fig. 8 illustrates the working of RF algorithm, where the algorithm randomly selects the data from a given dataset. Further, the GINI index given in Eq. 1 is applied to select the best possible split of the dataset. The splitting is applied to the dataset until the dataset becomes too small for splitting.

$$\boldsymbol{G}\boldsymbol{I}\boldsymbol{N}\boldsymbol{I} \left(\boldsymbol{T}\right)=1-{\sum }_{j=1}^{n}{p}_{{j}^{2}}$$
(1)
where,
T:

dataset

n:

number of classes

p_j:

relative frequency of class j in T
DT: It is a supervised ML algorithm which divides the problem into small sub problems. It consists of root node, internal node and leaf node. As shown in Fig. 9, root node, internal node, and leaf node depict the optimized version of the best selected feature, new subsets or features and outcome of each internal node, respectively.
SVM: It is a supervised ML algorithm used for the classification and regression problems. It is well known for predicting the class of unknown data. Also, it categorizes the unknown data into one of the two categories based on the labeled dataset (Refer Fig. 10).

6.2.2 ML and DL techniques

Table 6 summarizes ML and DL techniques such as Naïve bayes [43, 69], KNN [6], DTs [36, 48], neural networks, and SVM [59, 73, 90] which are used for medical imaging in primary studies. Here, column 1 represents articles and row 1 represents various techniques. Further, “+” and “-” signify that the article does and does not support the corresponding technique, respectively. The most reliable ML and DL techniques based on the type of disease are shown in Table 7. The most significant ML and DL techniques for EEG and MEG data processing are shown in Table 8.

Table 6 ML and DL techniques used for medical imaging

Full size table

Table 7 ML and DL techniques based on the type of disease

Full size table

Table 8 ML and DL techniques for EEG and MEG data processing

Full size table

7 Dataset description

Following section addresses the research question 4 (refer subsection 3.1) by providing the details of the datasets used in primary studies for implementing ML and DL algorithms. Table 9 summarizes the description of dataset(s) such as MRI, X-Rays, lesion data, infra-red images and CT-Scan. The accessibility to a dataset is divided as (i) public (available at online repositories), and (ii) own created (created by the authors).

Table 9 Dataset description

Full size table

8 Experimental description

Research question 5 (refer subsection 3.1) is addressed in this section. MRI dataset is used for the experiments to show the comparative analysis of ML classifiers and DL models. Dataset¹ description and experimental setup are discussed in subsections 8.1 and 8.2, respectively. Similarly, the methodology and results are discussed in subsections 8.3 and 8.4, respectively.

8.1 Dataset description

The experiments to classify the brain tumor include the publicly available tumor dataset. (https://www.kaggle.com/datasets/masoudnickparvar/brain-tumor-mri-dataset). The MRI dataset contains the 711 images of meningioma tumor and no tumor. Dataset is divided into two parts: testing and training with different image resolutions.

8.2 Experimental setup

The whole series of experiments were performed on a 64-bit computer with an Intel(R) 221 Core(TM) i3-10110U CPU @ 2.10 GHz 2.59 GHz, 8GB RAM. To train and validate the model, code was implemented in python language in Google colab platform.

8.3 Methodology

Figure 11 depicts the methodology used in the experiments for disease classification. It is described as follows:

1.
Import dataset: Dataset¹ is retrieved from the public website which is divided into two categories namely: no tumor and meningioma tumor. The dimensions of images given in the dataset were different from one another, which was further resized to 200 × 200.
2.
Label dataset: Dataset is labeled in the form of 0 and 1, where 0 and 1 indicate the data having no tumor and data having meningioma tumor, respectively.
3.
Split dataset: Further, the dataset is splitted in the ratio of 80:20 for training (80%) and testing (20%) dataset.
4.
Feature scaling and feature selection: ML algorithms work on numbers without knowing what the number represents. Feature scaling helps to resolve the given problem by scaling the features into a specific defined range, so that one feature does not dominate the other one. In this experiment, PCA technique is used to reduce the feature count and select the required features.
5.
Apply ML classifiers: For this experiment, ML classifiers (SVM, RF, DT, LR) and DL models (CNN, ResNet50V2) are used, which further classified the dataset into two categories i.e., 0 and 1.
6.
Prediction and testing the model: The model was tested with testing data (20% of the dataset) and predicted the disease accurately for the given dataset.
7.
Metrics calculation: The prediction for dataset using classifiers is illustrated with the help of a confusion matrix shown in Fig. 13. It calculates the four parameters, TP, TN, FP and FN, along with the accuracy metrics.

8.4 Results

This subsection discusses the results obtained by ML classifiers as shown in Fig. 12; Table 10. In Fig. 12a, b, c, and d illustrate the confusion matrix obtained from SVM, LR, RF, and DT, respectively. Table 10 shows the values of accuracy obtained after implementing the considered ML classifiers and DL models for the MRI dataset. The results show that CNN and RF have better accuracy with 97.6% and 96.93%, respectively.

Table 10 Accuracy results for MRI images using ML classifier/DL model

Full size table

9 Analytical discussion

The primary studies were analyzed based on the publisher citation count, year wise publications, keywords, various diseases, techniques, imaging modalities and type of publication.

9.1 Publisher by citations

A schematic view of the influential publishers in the concerned domain is presented by the citations of the articles published in it. Figure 13 shows all the publishers considered for this review in between 2014 and 2022. Moreover, it depicts the number of citations of ML and DL articles with respect to the publishers in Fig. 14a and b, respectively. Due to many types of indexing procedures along with time, there is a variation in the count of citations in Google Scholar. It was observed that most of the articles for ML and DL were published in ScienceDirect and IEEE publishers with the maximum citation 2425 and 42,866, respectively.

9.2 Scholarly articles published between 2014 and 2022

In this subsection, Fig. 14 depicts that out of 40 primary studies, the most published articles for ML were from the year 2020 with a count of 10, which is equivalent to 25% of the total. Followed by the year 2021 with 8 (20%), 2019 with 6 (15%), and 2022 with 4 (10%). Other years like 2017 and 2018 have the same count of 3 with 7.5%, 2014 and 2016 have the same count of 2 with 5%, and 2015 has the count of 1 with 2.5%. Thus, it can be observed that the maximum number of articles for primary study was considered from the year 2020 and minimum from 2014 to 2017.

9.3 Most commonly used keywords in the primary studies

Word cloud is the simple way to identify the relevant terms and themes utilized in the referenced research articles. Figure 15 depicts the word cloud which represents larger font for the most often used keywords and smaller font for less frequent keywords.

9.4 Disease types

Figure 16 depicts the percentage of multiple diseases diagnosed in the primary studies. As observed, breast disease is the most common disease with the highest percentage (21%) among all. Brain tumor took the second place (18%) followed by diabetes (16%) and lung disease (16%). Also, other diseases such as eye, liver, skin, hepatitis and cancer were diagnosed using various techniques.

9.5 Publication by techniques and statistical analysis of techniques

It was observed that researchers have used multiple techniques to attain better results as shown in Table 5. For classification, ML classifiers like SVM, RF and Naïve bayes were combinedly used for the same. Detection was performed using neural networks such as ANN or CNN, and TL was performed frequently due to its capability of breaking down the large datasets. Figure 17 depicts the percentage of various techniques used in primary studies. It summarizes that SVM (20%) is the most widely used technique for medical image classification.

The statistical analysis of ML and DL techniques for medical diagnosis is represented in Fig. 18.

9.6 Imaging modalities

Figure 19 demonstrates the multiple image modalities used for the evaluation of medical images. However, MRI/X-Ray dominates the subject area with 45%. The second most used modality is CT-Scan (30%), followed by mammogram (10%) and I-Scan-2 (10%). Moreover, to automate the process of retrieving and analyzing the features, computer modalities such as CAD was included for the detection of hepatitis and cancer [55, 60].

9.7 Type of publication

Figure 20 illustrates the distribution of articles according to the type of publications considered for this review. Majority of the articles were considered from journals with 70%, book Chaps. (8%), conference proceeding papers (7%), workshop articles (2%) and others (13%) including the society articles, online database articles, articles from publications like Bentham Science, springer archives and the transcript.

10 Discussion

From this study, it was observed that the variability in the literature occurred due to uncertainty of the evaluated data and models (refer Fig. 21). Data uncertainty was caused due to the multiple sources such as transmission noise, missing values and measurement noise. Whereas, model uncertainty was observed due to the less understanding of architecture and prediction of future data with parameters. The observed uncertainty was helpful to attain different results with various methods. Recently, many advanced technologies were introduced to attain enormous amounts of raw data in different scenarios.

Further, while reviewing the literature, it has been observed that focusing on every aspect of data (noisy or clear) is important as it impacts the results. The utilization of an appropriate algorithm to analyze images can be used for increasing the success ratio. Thus, variation in expected standard results is due to the use of raw data which may incorporate a certain amount of noise (refer Fig. 22). CNN is not much sensitive to the noise due to which it can extract information from noisy data [44]. Moreover, Hermitian basis functions were used for extracting the accumulated data from the ECG data which reduce the effects of Gaussian noise.

Therefore, dealing with the uncertainty of data and models with ML along with DL techniques is the most important issue to be handled by researchers. These techniques are useful for obtaining accurate and better results for decision making in every respective domain [2, 3, 45, 64, 75, 93]. Therefore, there is a need to deal with the variance in ML and DL algorithms such as RF, Rubber Sheet Normalization, DT, bagging-boosting, ANN, CNN, SVM, TL, Bayes Net, and GLCM. Further, such strategies can be used to deal with ambiguity in medical data for achieving high performance. Based on this review, it has been observed that medical professionals may be able to treat tumors promptly if they are identified early.

11 Conclusions and future work

This study provides an overview of various ML and DL approaches for the disease diagnosis along with classification, imaging modalities, tools, techniques, datasets and challenges in the medical domain. MRI and X-Ray scans are the most commonly used modalities for the disease diagnosis. Further, among all the tools and techniques studied, MATLAB and SVM dominated, respectively. It was observed that MRI dataset is widely used by researchers. Also, a series of experiments using MRI dataset has provided a comparative analysis of ML classifiers and DL models where CNN (97.6%) and RF (96.93%) have outperformed other algorithms. This study indicates that there is a need to include denoising techniques with DL models in the healthcare domain. It also concludes that various classical ML and DL techniques are extensively applied to deal with data uncertainty. Due to the superior performance, DL approaches have recently become quite popular among researchers. This review will assist healthcare community, physicians, clinicians and medical practitioners to choose an appropriate ML and DL technique for the diagnosis of disease with reduced time and high accuracy.

Future work will incorporate DL approaches for the diagnosis of all diseases considering noise removal from any given dataset. The additional aspects and properties of DL models for medical images can be explored. To increase the accuracy, enormous amount of data is required, therefore, the potential of the model should be improved to deal with large datasets. Also, different data augmentation techniques along with required features of the dataset can be explored to attain better accuracy.

Data availability

Data sharing is applicable to this article as MRI datasets was analyzed during the current study.

References

Abdulbaqi AS, Younis MT, Younus YT, Obaid AJ (2022) A hybrid technique for EEG signals evaluation and classification as a step towards neurological and cerebral disorders diagnosis. Int J Nonlinear Anal Appl 13(1):773–781
Google Scholar
Aceto G, Persico V, Pescapé A (2018) The role of information and communication technologies in healthcare: taxonomies, perspectives, and challenges. J Netw Comput Appl 107:125–154. https://doi.org/10.1016/j.jnca.2018.02.008
Article Google Scholar
Acharya UR, Fujita H, Oh SL, Hagiwara Y, Tan JH, Adam M (2017) Application of deep convolutional neural network for automated detection of myocardial infarction using ECG signals. Inf Sci 415:190–198. https://doi.org/10.1016/j.ins.2017.06.027
Article Google Scholar
Aggarwal 1LP (2019) Data augmentation in dermatology image recognition using machine learning. Skin Res Technol 25(6):815–820. https://doi.org/10.1111/srt.12726
Article Google Scholar
Al-Najdawi N, Biltawi M, Tedmori S (2015) Mammogram image visual enhancement, mass segmentation and classification. Appl Soft Comput 35:175–185. https://doi.org/10.1016/j.asoc.2015.06.029
Article Google Scholar
Altan G, Kutlu Y, Allahverdi N (2019) Deep learning on computerized analysis of chronic obstructive pulmonary disease. IEEE J Biomed Health Inf 24(5):1344–1350. https://doi.org/10.1109/JBHI.2019.2931395
Article Google Scholar
Anbeek P, Vincken KL, Van Bochove GS, Van Osch MJ, van der Grond J (2005) Probabilistic segmentation of brain tissue in MR imaging. NeuroImage 27(4):795–804. https://doi.org/10.1109/TMI.2014.2366792
Article Google Scholar
Arya R, Kumar A, Bhushan M (2021) Affect recognition using brain signals: a survey. In: Computational methods and data engineering. Springer, Singapore, pp 529–552. https://doi.org/10.1007/978-981-15-7907-3_40
Arya R, Kumar A, Bhushan M, Samant P (2022) Big five personality traits Prediction using brain signals. Int J Fuzzy Syst Appl (IJFSA) 11(2):1–10. https://doi.org/10.4018/IJFSA.296596
Article Google Scholar
Available at: https://techblog.cdiscount.com/a-brief-overview-of-automatic-machine-learning-solutions-automl/
Available at: Noisy Data in Data Mining | Soft Computing and Intelligent Information Systems (ugr.es)
Beam AL, Kohane IS (2018) Big data and machine learning in health care. JAMA 319(13):1317–1318. https://doi.org/10.1001/jama.2017.18391
Article Google Scholar
Bhatt C, Kumar I, Vijayakumar V, Singh KU, Kumar A (2021) The state of the art of deep learning models in medical science and their challenges. Multimedia Syst 27(4):599–613. https://doi.org/10.1007/s00530-020-00694-1
Article Google Scholar
Bhattacharya P, Tanwar S, Bodkhe U, Tyagi S, Kumar N (2019) Bindaas: Blockchain-based deep-learning as-a-service in healthcare 4.0 applications. IEEE Trans Netw Sci Eng 8(2):1242–1255
Article Google Scholar
Bhattacharyya A, Pachori RB, Upadhyay A, Acharya UR (2017) Tunable-Q wavelet transform based multiscale entropy measure for automated classification of epileptic EEG signals. Appl Sci 7(4):385
Article Google Scholar
Bhushan M, Goel S (2016) Improving software product line using an ontological approach. Sādhanā 41(12):1381–1391
Article MathSciNet MATH Google Scholar
Bhushan M, Goel S, Kumar A, Negi A (2017) Managing software product line using an ontological rule-based framework. In: 2017 International Conference on Infocom Technologies and Unmanned Systems (Trends and Future Directions)(ICTUS). IEEE, pp 376–382
Bhushan M, Goel S, Kaur K (2018) Analyzing inconsistencies in software product lines using an ontological rule-based approach. J Syst Softw 137:605–617
Article Google Scholar
Bhushan M, Goel S, Kumar A (2018) Improving quality of software product line by analysing inconsistencies in feature models using an ontological rule-based approach. Expert Syst 35(3):e12256
Article Google Scholar
Bhushan M, Negi A, Samant P, Goel S, Kumar A (2020) A classification and systematic review of product line feature model defects. Software Qual J 28(4):1507–1550
Article Google Scholar
Bhushan M, Kumar A, Samant P, Bansal S, Tiwari S, Negi A (2021) Identifying quality attributes of FODA and DSSA methods in domain analysis using a case study. In: 2021 10th international conference on System Modeling & Advancement in Research Trends (SMART). IEEE, pp 562–567
Bhushan M, Duarte JÁG, Samant P, Kumar A, Negi A (2021) Classifying and resolving software product line redundancies using an ontological first-order logic rule based method. Expert Syst Appl 168:114167
Article Google Scholar
Caballé-Cervigón N, Castillo-Sequera JL, Gómez-Pulido JA, Gómez-Pulido JM, Polo-Luque ML (2020) Machine learning applied to diagnosis of human diseases: a systematic review. Appl Sci 10(15):5135. https://doi.org/10.3390/app10155135
Article Google Scholar
Cabitza F, Rasoini R, Gensini GF (2017) Unintended consequences of machine learning in medicine. JAMA 318(6):517–518. https://doi.org/10.1001/jama.2017.7797
Article Google Scholar
Caliskan A, Badem H, Basturk A, YUKSEL M (2017) Diagnosis of the parkinson disease by using deep neural network classifiers. IU-J Electr Electron Eng 17(2):3311–3318
Google Scholar
Chaganti SY, Nanda I, Pandi KR, Prudhvith TG, Kumar N (2020) Image classification using SVM and CNN. In: 2020 International Conference on Computer Science, Engineering and Applications (ICCSEA). IEEE, pp 1–5
Currie G, Hawk KE, Rohren E, Vial A, Klein R (2019) Machine learning and deep learning in medical imaging: intelligent imaging. J Med imaging radiation Sci 50(4):477–487. https://doi.org/10.1016/j.jmir.2019.09.005
Article Google Scholar
De Bruijne M (2016) Machine learning approaches in medical image analysis: from detection to diagnosis. Med Image Anal 33:94–97. https://doi.org/10.1016/j.media.2016.06.032
Article Google Scholar
Fatima M, Pasha M (2017) Survey of machine learning algorithms for disease diagnostic. J Intell Learn Syst Appl 9(01):1. https://doi.org/10.4236/jilsa.2017.91001
Article Google Scholar
Feng Y, Xiao W, Wu T, Zhang J, Xiang J, Guo H (2021) An automatic identification method for the blink artifacts in the magnetoencephalography with machine learning. Appl Sci 11(5):2415
George ST, Subathra MSP, Sairamya NJ, Susmitha L, Premkumar MJ (2020) Classification of epileptic EEG signals using PSO based artificial neural network and tunable-Q wavelet transform. Biocybern Biomed Eng 40(2):709–728
Article Google Scholar
Ghassemi M, Naumann T, Schulam P, Beam AL, Chen IY, Ranganath R (2020) A review of challenges and opportunities in machine learning for health. AMIA Summits on Translational Science Proceedings, 2020, 191. Available at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7233077/
Giger ML (2018) Machine learning in medical imaging. J Am Coll Radiol 15(3):512–520. https://doi.org/10.1016/j.jacr.2017.12.028
Article Google Scholar
Greenspan H, Van Ginneken B, Summers RM (2016) Guest editorial deep learning in medical imaging: overview and future promise of an exciting new technique. IEEE Trans Med Imaging 35(5):1153–1159. https://doi.org/10.1109/TMI.2016.2553401
Article Google Scholar
Grover S, Bhartia S, Yadav A, Seeja KR (2018) Predicting severity of Parkinson’s disease using deep learning. Procedia Comput Sci 132:1788–1794
Article Google Scholar
Gupta R, Pandey G, Chaudhary P, Pal SK (2020) Machine learning models for government to predict COVID-19 outbreak. Digit Government: Res Pract 1(4):1–6. https://doi.org/10.1038/s41746-020-00372-6
Article Google Scholar
Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3(Mar):1157–1182
MATH Google Scholar
Hoque MZ, Keskinarkaus A, Nyberg P, Seppänen T (2021) Retinex model based stain normalization technique for whole slide image analysis. Comput Med Imaging Graph 90:101901. https://doi.org/10.1016/j.compmedimag.2021.101901
Article Google Scholar
Houssein EH, Emam MM, Ali AA, Suganthan PN (2021) Deep and machine learning techniques for medical imaging-based breast cancer: a comprehensive review. Expert Syst Appl 167:114161. https://doi.org/10.1016/j.eswa.2020.114161
Article Google Scholar
Itälinna V (2021) Detecting mild traumatic brain injury from MEG data using normative modelling and machine learning
Jaiswal AK, Banka H (2018) Epileptic seizure detection in EEG signal using machine learning techniques. Australas Phys Eng Sci Med 41(1):81–94
Article Google Scholar
James AP, Thiruvenkadam S, Paul JS, Braun M (2014) Special issue on medical image computing and systems. Inf Fusion. https://doi.org/10.1016/j.inffus.2013.12.002
Article Google Scholar
Kaplan A, Cao H, FitzGerald JM, Iannotti N, Yang E, Kocks JW … Mastoridis P (2021) Artificial intelligence/machine learning in respiratory medicine and potential role in asthma and COPD diagnosis. J Allergy Clin Immunol Pract 9(6):2255–2261. https://doi.org/10.1016/j.jaip.2021.02.014
Karimi D, Dou H, Warfield SK, Gholipour A (2020) Deep learning with noisy labels: exploring techniques and remedies in medical image analysis. Med Image Anal 65:101759. https://doi.org/10.1016/j.media.2020.101759
Article Google Scholar
Kashani MH, Madanipour M, Nikravan M, Asghari P, Mahdipour E (2021) A systematic review of IoT in healthcare: applications, techniques, and trends. J Netw Comput Appl 192:103164. https://doi.org/10.1016/j.jnca.2021.103164
Article Google Scholar
Kaur P, Singh G, Kaur P (2018) A review of denoising medical images using machine learning approaches. Curr Med imaging 14(5):675–685. https://doi.org/10.2174/1573405613666170428154156
Article Google Scholar
Kedia S, Bhushan M (2022) Prediction of mortality from heart failure using machine learning. In: Proceedings of the 2nd International Conference on Emerging Frontiers in Electrical and Electronic Technologies (ICEFEET), pp 1–6. https://doi.org/10.1109/ICEFEET51821.2022.9848348
Khan SI, Qaisar SM, Pachori RB (2022) Automated classification of valvular heart diseases using FBSE-EWT and PSR based geometrical features. Biomed Signal Process Control 73:103445. https://doi.org/10.1016/j.bspc.2021.103445
Article Google Scholar
Kitchenham B, Charters S (2007) Guidelines for performing systematic literature reviews in software engineering
Kitchenham B, Brereton OP, Budgen D, Turner M, Bailey J, Linkman S (2009) Systematic literature reviews in software engineering–a systematic literature review. ‎Inf Softw Technol 51(1):7–15
Article Google Scholar
Krittanawong C, Johnson KW, Rosenson RS, Wang Z, Aydar M, Baber U … Narayan SM (2019) Deep learning for cardiovascular medicine: a practical primer. Eur Heart J 40(25):2058–2073. https://doi.org/10.1093/eurheartj/ehz056
Kostas D, Pang EW, Rudzicz F (2019) Machine learning for MEG during speech tasks. Sci Rep 9(1):1–13
Article Google Scholar
Kwekha-Rashid AS, Abduljabbar HN, Alhayani B (2021) Coronavirus disease (COVID-19) cases analysis using machine-learning applications. Appl Nanosci:1–13. https://doi.org/10.1007/s13204-021-01868-7
Lahmiri S, Shmuel A (2019) Accurate classification of seizure and seizure-free intervals of intracranial EEG signals from epileptic patients. IEEE Trans Instrum Meas 68(3):791–796
Article Google Scholar
Latif J, Xiao C, Imran A, Tu S (2019) Medical imaging using machine learning and deep learning algorithms: a review. In: 2019 2nd International conference on computing, mathematics and engineering technologies (iCoMET). IEEE, pp 1–5. https://doi.org/10.1109/ICOMET.2019.8673502
Lee JG, Jun S, Cho YW, Lee H, Kim GB, Seo JB, Kim N (2017) Deep learning in medical imaging: general overview. Korean J Radiol 18(4):570–584. https://doi.org/10.3348/kjr.2017.18.4.570
Article Google Scholar
Ma Y, Xu X, Yu Q, Zhang Y, Li Y, Zhao J, Wang G (2019) LungBRN: a smart digital stethoscope for detecting respiratory disease using bi-resnet deep learning algorithm. In: 2019 IEEE Biomedical Circuits and Systems Conference (BioCAS). IEEE, pp 1–4. https://doi.org/10.1109/BIOCAS.2019.8919021
Matsumoto T, Kodera S, Shinohara H, Ieki H, Yamaguchi T, Higashikuni Y … Komuro I (2020) Diagnosing heart failure from chest X-ray images using deep learning. Int Heart J 61(4):781–786. https://doi.org/10.1536/ihj.19-714
Nadakinamani RG, Reyana A, Kautish S, Vibith AS, Gupta Y, Abdelwahab SF, Mohamed AW (2022) Clinical data analysis for prediction of cardiovascular disease using machine learning techniques. Comput Intell Neurosci 2022:1–13. https://doi.org/10.1155/2022/2973324
Nagarajan SM, Muthukumaran V, Murugesan R, Joseph RB, Munirathanam M (2021) Feature selection model for healthcare analysis and classification using classifier ensemble technique. Int J Syst Assur Eng Manag:1–12. https://doi.org/10.1007/s13198-021-01126-7
Negi A, Kaur K (2017) Method to resolve software product line errors. In: International conference on information, communication and computing technology. Springer, Singapore, pp 258–268
Oh SL, Ng EY, Tan S, Acharya UR (2018) Automated diagnosis of arrhythmia using a combination of CNN and LSTM techniques with variable length heart beats. Comput Biol Med 102:278–287. https://doi.org/10.1016/j.compbiomed.2018.06.002
Article Google Scholar
Oh SL, Ng EY, Tan S, Acharya UR (2019) Automated beat-wise arrhythmia diagnosis using modified U-net on extended electrocardiographic recordings with heterogeneous arrhythmia types. Comput Biol Med 105:92–101. https://doi.org/10.1016/j.compbiomed.2018.12.012
Article Google Scholar
Pal S, Hitchens M, Varadharajan V, Rabehaja T (2019) Policy-based access control for constrained healthcare resources in the context of the internet of things. J Netw Comput Appl 139:57–74. https://doi.org/10.1016/j.jnca.2019.04.013
Article Google Scholar
Pal S, Mishra N, Bhushan M, Kholiya PS, Rana M, Negi A (2022) Deep learning techniques for prediction and diagnosis of diabetes mellitus. In: 2022 International Mobile and Embedded Technology Conference (MECON). IEEE, pp 588–593. https://doi.org/10.1109/MECON53876.2022.9752176
Pantazis D, Adler A (2021) MEG source localization via deep learning. Sensors 21(13):4278
Pathan S, Bhushan M, Bai A (2020) A study on health care using data mining techniques. J Crit Reviews 7(19):7877–7890. https://doi.org/10.31838/jcr.07.19.896
Patidar S, Panigrahi T (2017) Detection of epileptic seizure using Kraskov entropy applied on tunable-Q wavelet transform of EEG signals. Biomed Signal Process Control 34:74–80
Article Google Scholar
Pham L, McLoughlin I, Phan H, Tran M, Nguyen T, Palaniappan R (2020) Robust deep learning framework for predicting respiratory anomalies and diseases. In: 2020 42nd annual international conference of the IEEE Engineering in Medicine & Biology Society (EMBC). IEEE, pp 164–167. https://doi.org/10.1109/EMBC44109.2020.9175704
Pirrone D, Weitschek E, Di Paolo P, De Salvo S, De Cola MC (2022) EEG Signal Processing and supervised machine learning to early diagnose Alzheimer’s Disease. Appl Sci 12(11):5413
Article Google Scholar
Plis SM, Hjelm DR, Salakhutdinov R, Allen EA, Bockholt HJ, Long JD, Calhoun VD (2014) Deep learning for neuroimaging: a validation study. Front NeuroSci 8:229. https://doi.org/10.3389/fnins.2014.00229. (516)
Article Google Scholar
Poongodi M, Hamdi M, Malviya M, Sharma A, Dhiman G, Vimal S (2022) Diagnosis and combating COVID-19 using wearable Oura smart ring with deep learning methods. Personal Uniquit Comput 26(1):25–35. https://doi.org/10.1007/s00779-021-01541-4
Article Google Scholar
Rai P, Kumar BK, Deekshit VK, Karunasagar I, Karunasagar I (2021) Detection technologies and recent developments in the diagnosis of COVID-19 infection. Appl Microbiol Biotechnol 105(2):441–455. https://doi.org/10.1007/s00253-020-11061-5
Article Google Scholar
Rana M, Bhushan M (2022) Advancements in healthcare services using deep learning techniques. In: 2022 International Mobile and Embedded Technology Conference (MECON). IEEE, pp 157–161. https://doi.org/10.1109/MECON53876.2022.9752020
Ray PP, Dash D, De D (2019) Edge computing for internet of things: a survey, e-healthcare case study and future direction. J Netw Comput Appl 140:1–22. https://doi.org/10.1016/j.jnca.2019.05.005
Article Google Scholar
Samant P, Agarwal R (2018) Comparative analysis of classification based algorithms for diabetes diagnosis using iris images. J Med Eng Technol 42(1):35–42. https://doi.org/10.1080/03091902.2017.1412521
Article Google Scholar
Samant P, Agarwal R (2018) Machine learning techniques for medical diagnosis of diabetes using iris images. Comput Methods Programs Biomed 157:121–128. https://doi.org/10.1016/j.cmpb.2018.01.004
Article Google Scholar
Samant P, Agarwal R (2019) Analysis of computational techniques for diabetes diagnosis using the combination of iris-based features and physiological parameters. Neural Comput Appl 31(12):8441–8453. https://doi.org/10.1007/s00521-019-04551-9
Article Google Scholar
Samad MD, Wehner GJ, Arbabshirani MR, Jing L, Powell AJ, Geva T … Fornwalt BK (2018) Predicting deterioration of ventricular function in patients with repaired tetralogy of Fallot using machine learning. Eur Heart J Cardiovasc Imaging 19(7):730–738. https://doi.org/10.1093/ehjci/jey003
Saminu S, Xu G, Shuai Z, Abd El Kader I, Jabire AH, Ahmed YK … Ahmad IS (2021) A recent investigation on detection and classification of epileptic seizure techniques using EEG signal. Brain Sci 11(5):668
Schoepf UJ, Zwerner PL, Savino G, Herzog C, Kerl JM, Costello P (2007) Coronary CT angiography. Radiology 244(1):48–63. https://doi.org/10.1148/radiol.2441052145
Article Google Scholar
Selvikvåg Lundervold A, Lundervold A (2018) An overview of deep learning in medical imaging focusing on MRI. arXiv e-prints, arXiv-1811. https://doi.org/10.1016/j.zemedi.2018.11.002
Shakeel PM, Burhanuddin MA, Desa MI (2020) Automatic lung cancer detection from CT image using improved deep neural network and ensemble classifier. Neural Comput Appl:1–14
Shan J, Alam SK, Garra B, Zhang Y, Ahmed T (2016) Computer-aided diagnosis for breast ultrasound using computerized BI-RADS features and machine learning methods. Ultrasound Med Biol 42(4):980–988. https://doi.org/10.1016/j.ultrasmedbio.2015.11.016
Article Google Scholar
Sharma M, Pachori RB (2017) A novel approach to detect epileptic seizures using a combination of tunable-Q wavelet transform and fractal dimension. J Mech Med Biology 17(07):1740003
Article Google Scholar
Shen D, Wu G, Suk HI (2017) Deep learning in medical image analysis. Annu Rev Biomed Eng 19:221–248. https://doi.org/10.1007/978-3-030-33128-3_1
Article Google Scholar
Singh SN, Bhushan M (2022) Smart ECG monitoring and analysis system using machine learning. In: Proceedings of the 2022 IEEE VLSI Device Circuit and System (VLSI DCS), pp 304–309. https://doi.org/10.1109/VLSIDCS53788.2022.9811433
Singh VK, Kolekar MH (2022) Deep learning empowered COVID-19 diagnosis using chest CT scan images for collaborative edge-cloud computing platform. Multimed Tools Appl 81:3–30. https://doi.org/10.1007/s11042-021-11158-7
Article Google Scholar
Singh VJ, Bhushan M, Kumar V, Bansal KL (2015) Optimization of segment size assuring application perceived QoS in healthcare. In: Proceedings of the world congress on engineering, vol 1, pp 1–3
Spadarella G, Perillo T, Ugga L, Cuocolo R (2022) Radiomics in cardiovascular disease imaging: from pixels to the heart of the problem. Curr Cardiovasc Imaging Rep:1–11. https://doi.org/10.1007/s12410-022-09563-z
Strijkers GJ, Drost MR, Nicolay K (2011) Diffusion MRI: theory, methods, and applications. Diffusion imaging in muscle, pp 672–689
Sun J, Cao R, Zhou M, Hussain W, Wang B, Xue J, Xiang J (2021) A hybrid deep neural network for classification of schizophrenia using EEG Data. Sci Rep 11(1):1–16
Google Scholar
Sworna NS, Islam AM, Shatabda S, Islam S (2021) Towards development of IoT-ML driven healthcare systems: a survey. J Netw Comput Appl 196:103244. https://doi.org/10.1016/j.jnca.2021.103244
Article Google Scholar
Tajbakhsh N, Shin JY, Gurudu SR, Hurst RT, Kendall CB, Gotway MB, Liang J (2016) Convolutional neural networks for medical image analysis: full training or fine tuning? IEEE Trans Med Imaging 35(5):1299–1312. https://doi.org/10.1109/TMI.2016.2535302. (2191)
Article Google Scholar
Tzimourta KD, Tzallas AT, Giannakeas N, Astrakas LG, Tsalikakis DG, Angelidis P, Tsipouras MG (2019) A robust methodology for classification of epileptic seizures in EEG signals. Health Technol 9(2):135–142
Article Google Scholar
Vamathevan J, Clark D, Czodrowski P, Dunham I, Ferran E, Lee G … Zhao S (2019) Applications of machine learning in drug discovery and development. Nat Rev Drug Discov 18(6):463–477. https://doi.org/10.1038/s41573-019-0024-5
Virmani J, Agarwal R (2020) Deep feature extraction and classification of breast ultrasound images. Multimedia Tools Appl 79(37):27257–27292. https://doi.org/10.1007/s11042-020-09337-z
Article Google Scholar
Virmani J, Agarwal R (2021) A review of Segmentation Algorithms Applied to B-Mode breast ultrasound images: a characterization Approach. Arch Comput Methods Eng 28(4):2567–2606. https://doi.org/10.1007/s11831-020-09469-3
Article Google Scholar
Wang X, Gong G, Li N (2019) Automated recognition of epileptic EEG states using a combination of symlet wavelet processing, gradient boosting machine, and grid search optimizer. Sensors 19(2):219
Article MathSciNet Google Scholar
Wang Y, Choi EJ, Choi Y, Zhang H, Jin GY, Ko SB (2020) Breast cancer classification in automated breast ultrasound using multiview convolutional neural networks with transfer learning. Ultrasound Med Biol 46(5):1119–1132
Article Google Scholar
Wani SM, Sabut S, Nalbalwar SL (2019) Detection of epileptic seizure using wavelet transform and neural network classifier. In: Computing, communication and signal processing. Springer, Singapore, pp 739–747
Widodo CE, Adi K, Gernowo R (2020) Medical image processing using python and open cv. In: Journal of Physics: Conference Series, vol 1524, no 1. IOP Publishing, pp 012003. https://doi.org/10.1088/1742-6596/1524/1/012003
Willemink MJ, Koszek WA, Hardell C, Wu J, Fleischmann D, Harvey H … Lungren MP (2020) Preparing medical imaging data for machine learning. Radiology 295(1):4–15. https://doi.org/10.1148/radiol.2020192224
Xu Y, Mo T, Feng Q, Zhong P, Lai M, Eric I, Chang C (2014) Deep learning of feature representation with multiple instance learning for medical image analysis. In: 2014 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 1626–1630. https://doi.org/10.1109/ICASSP.2014.6853873
Yanowitz FG (2012) Introduction to ECG interpretation. LDS Hospital and Intermountain Medical Center
Zubarev I, Zetter R, Halme HL, Parkkonen L (2019) Adaptive neural network classifier for decoding MEG signals. NeuroImage 197:425–434
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, DIT University, Dehradun, India
Meghavi Rana & Megha Bhushan

Authors

Meghavi Rana
View author publications
You can also search for this author in PubMed Google Scholar
Megha Bhushan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Megha Bhushan.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Rana, M., Bhushan, M. Machine learning and deep learning approach for medical image analysis: diagnosis to detection. Multimed Tools Appl 82, 26731–26769 (2023). https://doi.org/10.1007/s11042-022-14305-w

Download citation

Received: 01 August 2022
Revised: 01 November 2022
Accepted: 10 December 2022
Published: 24 December 2022
Issue Date: July 2023
DOI: https://doi.org/10.1007/s11042-022-14305-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Machine learning and deep learning approach for medical image analysis: diagnosis to detection

Abstract

Similar content being viewed by others

Medical image analysis based on deep learning approach

Enhancing Medical Diagnosis Through Deep Learning and Machine Learning Approaches in Image Analysis

Medical Image Analysis Using Machine Learning and Deep Learning: A Comprehensive Review

1 Introduction

2 Background

2.1 Machine learning

2.2 Deep learning

3 Review technique

3.1 Research questions

3.2 Source material

3.3 Search criteria

3.4 Quality assessment

3.5 Data extraction

4 ML and DL techniques for medical imaging

4.1 Breast disease

4.2 Brain disease

4.3 Lung disease

4.4 Diabetes

4.5 Multiple disease detection

5 Modalities for medical image

6 Tools and techniques

6.1 Tools used for medical images

6.2 Techniques used for medical images

6.2.1 Description of techniques

6.2.2 ML and DL techniques

7 Dataset description

8 Experimental description

8.1 Dataset description

8.2 Experimental setup

8.3 Methodology

8.4 Results

9 Analytical discussion

9.1 Publisher by citations

9.2 Scholarly articles published between 2014 and 2022

9.3 Most commonly used keywords in the primary studies

9.4 Disease types

9.5 Publication by techniques and statistical analysis of techniques

9.6 Imaging modalities

9.7 Type of publication

10 Discussion

11 Conclusions and future work

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation