Applications of Machine Learning (ML) and Mathematical Modeling (MM) in Healthcare with Special Focus on Cancer Prognosis and Anticancer Therapy: Current Status and Challenges

Hassan, Jasmin; Saeed, Safiya Mohammed; Deka, Lipika; Uddin, Md Jasim; Das, Diganta B.

doi:10.3390/pharmaceutics16020260

Open AccessReview

Applications of Machine Learning (ML) and Mathematical Modeling (MM) in Healthcare with Special Focus on Cancer Prognosis and Anticancer Therapy: Current Status and Challenges

by

Jasmin Hassan

^1,†

,

Safiya Mohammed Saeed

^1,†

,

Lipika Deka

²

,

Md Jasim Uddin

^3,*

and

Diganta B. Das

^4,*

¹

Drug Delivery & Therapeutics Lab, Dhaka 1212, Bangladesh

²

Faculty of Computing, Engineering and Media, De Montfort University, Leicester LE1 9BH, UK

³

Department of Pharmaceutical Technology, Faculty of Pharmacy, Universiti Malaya, Kuala Lumpur 50603, Malaysia

⁴

Department of Chemical Engineering, Loughborough University, Loughborough LE11 3TU, UK

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Pharmaceutics 2024, 16(2), 260; https://doi.org/10.3390/pharmaceutics16020260

Submission received: 8 December 2023 / Revised: 29 January 2024 / Accepted: 7 February 2024 / Published: 9 February 2024

(This article belongs to the Section Drug Delivery and Controlled Release)

Download

Browse Figures

Versions Notes

Abstract

:

The use of data-driven high-throughput analytical techniques, which has given rise to computational oncology, is undisputed. The widespread use of machine learning (ML) and mathematical modeling (MM)-based techniques is widely acknowledged. These two approaches have fueled the advancement in cancer research and eventually led to the uptake of telemedicine in cancer care. For diagnostic, prognostic, and treatment purposes concerning different types of cancer research, vast databases of varied information with manifold dimensions are required, and indeed, all this information can only be managed by an automated system developed utilizing ML and MM. In addition, MM is being used to probe the relationship between the pharmacokinetics and pharmacodynamics (PK/PD interactions) of anti-cancer substances to improve cancer treatment, and also to refine the quality of existing treatment models by being incorporated at all steps of research and development related to cancer and in routine patient care. This review will serve as a consolidation of the advancement and benefits of ML and MM techniques with a special focus on the area of cancer prognosis and anticancer therapy, leading to the identification of challenges (data quantity, ethical consideration, and data privacy) which are yet to be fully addressed in current studies.

Keywords:

machine learning; mathematical modeling; cancer; computational oncology; tumor; telemedicine; carcinoma

1. Introduction

Over the past several centuries, humans have made remarkable strides in the treatment of different lethal diseases; however, cancer management is still a terrifying challenge worldwide due to the aftermath in both the physical and mental health of the individuals involved [1,2,3]. There were an estimated 18.1 million newly diagnosed cancer patients worldwide in 2018, and nearly 9.6 million people died of cancer in that year [4]. The numbers for newly diagnosed patients and total deaths increased to 19.3 million and 10 million, respectively, in 2020 [5,6]. The diversity in types of cancer is the reason behind this growing rate of newly diagnosed cases and deaths each year, which leads scientists worldwide to adopt varied approaches to cancer treatment [2,7]. Figure 1 shows more detailed information regarding the newly diagnosed cases and death percentages among men and women, separately, in different geographical areas worldwide.

Currently, immunotherapy, radiotherapy, chemotherapy, targeted therapy, and surgery are the main first-line treatments for cancer [8,9]. Among them, radiotherapy, chemotherapy, and surgery fail to treat in cases of metastatic and last-stage tumors [10]. Moreover, immunotherapy has shown very restricted clinical outcomes due to factors such as the heterogeneity of cancer cells, inadequate antigen presentation, and immune checkpoint upregulation, etc. [11,12]. All of these drawbacks make it imperative to come up with a different approach that will save time and improve the effectiveness of the treatment process. Furthermore, when it comes to cancer diagnostic techniques, the visual examination and manual interpretation of biomedical images used to be typically performed before; however, these diagnostic techniques require extended time and are highly susceptible to errors. On top of that, early cancer detection is essential for saving an individual’s life. Considering these points, researchers in the early 1980s started to adopt computer-aided approaches for cancer diagnosis, prognosis, and treatment [13,14,15,16,17].

As per the documentation found, it can be observed that the conceptualization of machine learning (ML) algorithms was observed around the year 1958 [18]. Since then, ML has been revolutionized substantially from only a subject of curiosity in the research laboratory to an empirical technology that is being used worldwide for practical purposes these days [19]. However, the use of mathematical modeling (MM) in biological sciences is far older than the advent of ML. This technique was first applied by Daniel Bernoulli, who formulated MM to evaluate the effectiveness of the variolation of healthy people with the smallpox virus in 1760 [20]. After that, thousands of articles that use MM have been published, and approximately 97% have been published since 1990 [21]. Since biomedical problems can be explained through MM to evaluate and predict future medical conditions, this method has gradually gained popularity because of its potential [22]. Figure 2 describes the history behind the evolution of MMs in the study of biomedical problems.

The history behind the association of ML with the field of biomedical science is long and intricate [23]. Currently, ML has created a broad wave throughout the healthcare system by assisting physicians worldwide in making quicker and more accurate clinical decisions [24,25,26,27]. Continuous technological evolution in Artificial Intelligence (AI) and ML has led the direction towards self-determining disease diagnosis tools by using big datasets to measure the future obstacles in detecting human diseases at a primal stage, especially in cancer [28,29].

Some studies show that sometimes patients are not bothered about specific diagnostic information. Rather, they are more interested in discussing prognosis, the time duration of their diseased state, the side effects of treatment, life span, the impact of cancer on their daily life, and the possibility of a cure, alongside financial liability. This puts pressure on physicians to make more accurate decisions regarding treatment and cure [30,31,32,33,34]. In addition, biomedical researchers are in search of a technological tool that can find and detect different patterns in a complex dataset and also uncover the relationships among them, and, at the same time, efficiently prognosticate the treatment outcome of a particular type of cancer, which has made them happily accept ML technology in cancer research [15].

1.1. General Concepts of Machine Learning (ML) and Mathematical Modeling (MM)

1.1.1. ML

ML has transpired as a subset of the artificial intelligence (AI) discipline as the method of choice for establishing empirical computer software for the purpose of natural language processing, visual and speech recognition, robot control, and many other applications [19,28]. In a broader sense, ML refers to the category of algorithms/models which help to develop tools and methods for pattern identification within datasets. The identified patterns can be used afterwards: (i) as an aid to enhance the existing knowledge regarding the current situation of the world, for example—in the biomedical field, for the risk factor identification of an infectious diseases, and (ii) predicting the future, for example—predicting the possibility of getting infected for an individual [23,35,36]. In short, the task of ML is to utilize algorithms to parse existing real-world datasets, assimilate the knowledge as an ML model, and subsequently utilize the developed model in conjunction with new data to perform the task intended, which may be classification or forecasting, etc. Some popular ML methods are decision trees (DT), the support-vector machines (SVM) algorithm, the naïve bayes (NB) algorithm, the k-nearest neighbor (KNN) algorithm, the random forest algorithm (RF), and neural networks (NN), etc. A DT is a visual representation that shows decisions and their outcomes in the shape of a tree. The graph’s edges reflect the decision, whereas the nodes indicate an event or option, regulations, or requirements [37]. Using an NB classifier, we suppose that the presence of a particular feature in a class is independent of any additional feature. NB focuses mostly on the text categorization sector. The principal applications are clustering, and the conditional probability determines the classification’s purpose of occurrence. SVMs in ML are supervised learning models with related learning algorithms that examine data used for regression and classification analyses. SVMs may effectively perform non-linear classification in addition to linear classification by implicitly translating their inputs into high-dimensional feature spaces. This technique is known as the kernel trick. In essence, it draws lines between the classes. The margins are designed to have the longest possible distance between them and the classes, which minimizes classification error. KNN is a supervised ML technique that can be used to tackle classification and regression issues. It is simple to use and comprehend, but it has the critical problem of becoming noticeably slower as the amount of data in use increases [38].

It is said that the ML process comprises no less than 80% of data processing along with cleaning and 20% of application of algorithms. Hence, the predictive accuracy of any ML approach depends upon the availability of a notable amount of high-quality data. [39,40]. Furthermore, to obtain an accurate outcome, an ML-based tool is required to be trained by all possible types of data generated from diverse clinical activities (e.g., diagnostic images, test results, chemical screening, identification of diseases, diagnostic errors, treatment, after treatment outcome, and unwanted events, etc.) [24,26,41,42,43,44]. Figure 3 describes the basic working objective of one of the popular ML methods, the NN algorithm.

Neural network (NN) algorithms [28], one of the most popular ML approaches, mimics the human brain in order to come up with an end result. Deep learning (DL) is another subfield of ML which is considered to be an advanced and sophisticated form of NN suitable for the identification of objects and images, the processing of languages, improvement in disease diagnosis, drug discovery, and precision medicines, and aiding humans in making clinical decisions. Another significance of DL is that, from its prior experiences, it can propose an output as well [28,45,46]. Moreover, with an artificial neural network (ANN), it can also analyze data that contain medical images mimicking the neuronal architecture of humans and consists of an input, output, and variety of hidden multi-layer networks to improve the ML processing ability [47,48].

Real-world data are mostly inconsistent. Since AI algorithms generally deal with large datasets in the biomedical field, it becomes essential to shape these data into being submission worthy through data pre-processing (DP), so that it can be inputted into the desired ML algorithm for further processing and predictive results [49,50]. The DP procedure involves three segments, which are: (i) data reduction, (ii) data projection, and (iii) missing data treatment, and these segments involve multiple methods as well [51]. To understand these methods, we also need to understand the following terms along with them.

A dataset is basically a set of information that is provided to train the ML tool for predicting future events [52]. It forms the foundation for training and analyzing ML models. Also, it is engaged in the fundamental role of further developing the process. Moreover, it gives information regarding the issues of a particular field and the approach methods to develop algorithms out of the process of data collection, construction, and allocation [53].

Principal component analysis (PCA) holds the most importance, being a classical tool for the analysis of datasets. It can be implemented on a given raw dataset without any prior training. This trait of PCA enables computers with lower specifications to perform better [54]. In PCA, the equation for a measured matrix is

M \in ℝ^{(n_{1} \times n_{2})}

. Here, n₁ signifies the number of sample dataset and n₂ signifies the number of variables. PCA has two more variants, such as robust principal component analysis (RPCA) and kernel principal component analysis (KPCA) [55]. Recently, Kang et al. proposed one more PCA method called self-paced principal component analysis (SPCA) [56]. These PCA methods are discussed briefly in this paper for the completeness of the discussion.

RPCA: This is the refined version of PCA by the decomposition of a measured matrix [57,58]. For instance, if a measured matrix is M,

M \in ℝ^{(n_{1} \times n_{2})}

(1)

RPCA decomposes Equation (1) into a lower-grade matrix L and a sparse matrix S. Hence,

L \in ℝ^{(n_{1} \times n_{2})}

(2)

S \in ℝ^{(n_{1} \times n_{2})}

(3)

By decoding the principal component pursuit (convex program) [55,59]:

m i n_{L, S} ‖ L ‖_{*} + λ ‖ S ‖_{1 subject to} M = L + S

(4)

Here, ‖⋅‖_∗ = norm of the matrix

‖⋅‖₁ = L₁-norm of a matrix

and λ = tuning parameter, as interpreted in [55,58].

We can evaluate the value of λ using the following equation:

λ = \frac{1}{\sqrt{m a x (n_{1}, n_{2})}}

(5)

From the estimated value, we can further fine tune it [55].

KPCA: This is the refined version of PCA as well. Using a kernel function, it maps raw data into a new (feature) space F and, subsequently, the classical PCA algorithm is applied in F [60,61]. Typically, PCA performs well when it is linear variation of the data, whereas its performance is bad with non-linear variation of the data. As per the Cover’s theorem, a non-linear dataset sample obtains a linear form after it is mapped to the feature space F, which can be delineated by a kernel function. KPCA can handle both linear and non-linear data [62].

The data collection process is costly. Moreover, due to the carelessness of the associated teams, missing data seem to appear quite frequently in a dataset [63,64,65]. Missing data have a notable impact on the performance of ML algorithms [66]. On account of this, it is required to implement the missing data treatment (MDT), which involves the deletion of missing data or replacement with their estimates [63,67,68,69]. The MDT can be executed by applying two strategies, such as (a) the deletion method and b. imputation methods. The deletion method has two types, which are (i) listwise deletion and (ii) pairwise deletion, and the imputation method includes many types such as regression imputation, mean imputation, hot-deck imputation, and cold-deck imputation, etc. [51,64,69,70,71].

Scaling is a vital step for ML techniques, which refers to the transformation of feature data, as per defined commands, to the same scaled data that have similar level of impact. Therefore, the technique is correctly encrypted to the selected units [51,72]. Generally, two intervals are used as scaling targets, which are [0, 1] and [−1, 1], which are interpreted as follows,

[0, 1] interval = \frac{actual Value - \min (all Values)}{\max (all Values) - \min (all Values)}

(6)

[- 1, 1] interval = \frac{actual Value - (\max (all Values) + \min (all Values)) / 2}{(\max (all Values) - \min (all Values)) / 2}

(7)

It was observed from a survey conducted by Huang et al. that the [0, 1] interval was considered to be used for scaling by default in most of the studies [51].

Despite the fact that ML is useful for intensive data analysis, this study shows that the use of high-dimensional data, also known as multivariate data, for training and analyzing decreases its reliability and performance. Multivariate data are common in datasets and make it hard to contrive information from the analyzed data for ML algorithms [73,74,75]. Trying to solve the complex problems of the real world using these multivariate data without an increase in sample size causes blind spots (continuous sections of feature space without any scrutiny), which creates challenges while developing the model. This lack of ability of ML to manage multivariate data is familiar, as the curse of dimensionality is one of the major drawbacks of ML [76,77]. For instance, the Watson supercomputer was trained using a small sample size of 106 ovarian cancer cases and 635 lung cancer cases. A small multivariate data sample would be highly inclined to cause a blind spot here. Now, if data from this blind spot come across after deployment, an inappropriate treatment recommendation can be produced, which is not detected while developing the model [76].

A dataset contains data points (objective) which are described through a number of features (variables, where the number of variables is fixed). These features can be of two types: (a) continuous (continuous numerical values) and (b) categorical (only discrete values). Mostly, this categorical feature is merely binary and functions by true or false, in binary, 1 or 0, respectively [78]. Raw data can act discontinuously during the computation of continuous sensors (see Figure 4) because of the diverse range of parameter variations and high dimensionality because of multi-sensor computation.

Dimensionality reduction is necessary to avoid the misestimation of original data via ML algorithms. Datasets must be pre-processed for the prognosis of any human disease. At this point, dimensionality reduction has an important role in multivariate data’s dimension reduction. It requires the mapping of higher-dimensional inputs into lower dimensions so that almost identical points in the input space are mapped into neighboring points on the manifold learning, which is the procedure of non-linear dimensionality reduction, and another is linear dimensionality reduction [79,80]. Apart from that, there are two ways to reduce dimensionality, which are, (a) feature selection and (b) feature extraction.

Mathematically, for instance, there is an n-dimensional vector denoted as X.

X = [x₁, x₂, …. X_n]^T

(8)

Now, X is mapped to an m-dimensional vector denoted as Y through a map f, where,

Y = [y₁, y₂, …. Y_m]^T

(9)

and, m < < n

(10)

The condition is m-dimensional vector Y, which should contain the principal features of n-dimensional vector X.

So, the mapping function can be expressed as:

Y = f (X)

(11)

This is the fundamental mathematical process of feature selection (FS) and feature extraction (FE) [81,82]. Here, mapping f is the algorithm that needs to be found for feature reduction, and the choice differs depending on the pending real-world problems [83].

FS is a process of feature subset selection that is implemented on the construction of a model [84]. FS differs from dimensionality reduction, as FS does not change the original features but simply selects a subset, whereas dimensionality reduction could comprise creating new features to preserve all features. The prior condition of using FS is to delete the redundant features of data without losing the necessary information. The purposes of using FS are, (a) model simplification to make it user-friendly to interpret, (b) cutback on run time, (c) subside curses of dimensionality, and (d) variance reduction [83]. On the other hand, FE produces (map) new features of data from the original ones. Its benefit is the efficient compression of the mapped features, and its drawback is the mapped feature set might lose meaning, although the original one has a clear structural meaning. FE executes two functions that are (i) necessary detail separation from redundant data, and (ii) classifier performance reduction via decreasing the dimension [85,86]. Some commonly used FS/FE techniques are PCA, linear discernment analysis (LDA), discrete Fourier transform (DFT), factor analysis (FA), independent component analysis (ICA), and an autoencoder [79,81,84,87].

Classification is the activity of predicting the dependent variables by analyzing the parameters and values of different features in a set of independent variables (Figure 5). A variety of parameters is learned by a classifier from a training dataset. A distinct dataset is returned by a classifier afterwards. If the values of this dataset can be made to be mutually exclusive, in that case, they are termed as class, and they do not need to be mutually exclusive, in that case, they are called label. As an example, the residue of a protein is supposed to be in only one of the multiple secondary structure classes. However, at the same time, it could be assigned to the non-exclusive labels of being transmembrane and ⍺-helical. They are generally denoted by an encoding. A trained classifier is basically a representation of the interconnection between the voxels (raw/test data) and the class label from a training dataset. Mathematically, if the voxel denoted as x and classifier denoted as function f that predicts the class label denoted as y, then this can be demonstrated as: y = f(x) [78,88,89]. There are a variety of classifiers in ML that are used in cancer research which will be mentioned later on.

From the aforementioned sequence of discussion regarding the techniques or terms used in the ML algorithms throughout Section 1.1.1, we can obtain an idea of their working processes.

1.1.2. MM

MM can be defined as the art of representing real-world issues through mathematical terms to predict a probable future or provide an insight [90]. It is customarily a schematization of the real-world situation and, therefore, based on the objectives and the available variables, it is possible to develop different MMs for the same incident [91,92]. MM is not new in the field of cancer research (Figure 6). There are many hypotheses with respect to cancer which are required to be tested in the laboratory and run in in vivo experiments. However, this is time-consuming, costly, and sometimes not possible, also due to the lack of appropriate technology or the potential involvement of humans. In such conditions, MM plays an alternate role in checking the potential of these hypotheses. If the hypotheses fail to exhibit the claims, then they must be revised before proceeding further towards the lab [93,94].

One of the initial MMs in cancer research is that of Armitage and Doll’s multistage theory, which explains the series of mutations a cell undergoes before becoming cancerous. It also mentions that the risk of cancer development grows with the power of a person’s age. Supposing the power of age is five, then a cell has to go through four stages to become cancerous [95,96,97,98]. The mathematical illustration of this model is as follows [99]:

1.

I = \frac{N_{p_{1,} p_{2,} p_{3,} p_{4,} \dots \dots \dots p_{r}}}{r - 1!} t^{r - 1}

(12)

2.

I = N_{p_{1}} (1 - e^{p \frac{2}{k} (e^{k t} - 1)})

(13)

Here, k = a constant

N = Number of cells at risk

I = Cancer incidence

p = Probability of change in any cell at any age

r = Number of changes [99].

Even though this theory provides an outstanding demonstration of the cancer incidence rate in terms of stomach, pancreas, and colon cancer, it is unfit to demonstrate others, which include prostate and breast cancer. Moreover, it does not present any mechanistic insight regarding the bio-functional changes accountable for the progression of cancer [100,101]. This Armitage and Doll’s theory was motivated by the study of the mortality statistics of cancer, which was proposed by Nordling in 1953 [98].

Discussing historical models, the linear-quadratic model is one of the classical MMs in radio biology, which gives us insight into the relationship between cell survival probability and a single dose of radiation. Around 50 years have passed since this model was first proposed, and eventually, it has become the method of choice for both researchers and clinicians for characterizing the effects of radiation on cells [102]. It can be represented mathematically as follows:

S = e^{- α D - β D^{2}}

(14)

Here, S = A single dose of radiation

α = Linear parameter for radio-sensitivity of a cell at a low dose

β = Quadric parameter for radio-sensitivity of a cell at a high dose

D = Dose at which cells are vulnerable [102].

The same as Armitage and Doll’s theory, this linear-quadratic model also has limitations. In an article published in 2020, Loap and co-workers mentioned the linear-quadratic model as being questionable at the tumor level, since this model talks about the probability of surviving a single dose of radiation therapy for a given single cell type. This is a fact, because when it comes to tumor cells, the real scenario is far more complex than we can even think, due to the complexity of the micro-environment a single cell carries [103].

These early models seem quite simple compared to the MM models being developed to analyze solid tumor growth at present [104,105]. We will discuss the newly formulated approaches with their applications in the study of cancer prognosis and treatment later.

In this review article, we will discuss both the ML and MM approaches recently undertaken to improve cancer prognosis and treatment procedures and their related challenges.

2. Paradigms of ML

Supervised and unsupervised approaches are the two main paradigms of ML [15,23]. In supervised learning, a trained dataset is applied to labeled data with the purpose of formulating a system that is capable of accurately predicting the type of raw dataset based on available features. It is also applied to predict the categorical (class/label) characteristic, which is known as discriminant analysis, or the continuous characteristic, which is known as regression analysis. Contrarily, in supervised learning, no class/labeled data are provided while mapping the algorithm. Hence, it is capable of predicting the patterns of non-labeled data without being trained by a predefined class/label dataset [15,23,106,107]. Another type of ML is semi-supervised learning, which is the intersection of supervised and unsupervised learning and learns from both the labeled and unlabeled types of data [108]. Figure 7 provides an overview of these three ML categories with their further classification.

2.1. Supervised ML

In supervised learning, the classifier returns a value of doubt or an outlier. Doubt indicates when the decision making is unclear. For example, it is unsure of the new data and which class/label they would be fitted to assign. An outlier indicates the unlikeliness of new data compared to any of the previously observed data, which makes their possibility to be predicted as something certain questionable [23].

Classification supervised learning identifies certain entities and examines them to determine how to categorize them when labeled, whereas regression is also a kind of supervised learning that gains knowledge from labeled datasets to forecast continuous results for various inputs in an algorithm. To comprehend the connection between reliable and independent variables, regression is employed. It is frequently employed in situations where the output must be a finite value, such as when determining a person’s height or weight, etc. Further subcategories include:

For classification: (i) DT; (ii) SVM; (iii) NB; (iv) NN; (v) KNN; (vi) RF; and (vii) linear classifiers; and for regression: (i) simple linear; (ii) multiple linear; (iii) polynomial; (iv) SVM regression; and (v) logistic [109]. Supervised ML learning in biological and healthcare innovations demands knowledge-based health management systems, quality data consciousness, and expertise. There is a known workflow for supervised ML learning methods in the field of healthcare, established by Roy and co-workers. They illustrated the following workflow: (a) data collection (structured/non-structured data); (b) data processing (implement, foresee, understand, and take appropriate action); (c) outcome (achievement rate and its emerging issues); (d) evaluation (analysis, execution, and its subsequent response); and (e) validation (thorough correction and verification in light of further data) [110].

2.2. Unsupervised ML

In unsupervised learning, the purpose is to inspect the data and identify similarities among them. These similarities characterize a group of data, which is referred to as a cluster. To be precise, it is used for the revelation of naturally occurring groupings in the data. So, in this case, no data are labeled, and the learning procedure comprises characterizing the nonlabeled data by matching the raw data with them [23,111].

For evaluating high-dimensional data, such as those from transcriptomic, metabolomic, and proteomic research, clustering is frequently utilized. The key influences on the readouts and modules with a high degree of coregulation are often identified using hierarchical clustering. Non-hierarchical clustering is used in single-cell sequencing to recognize the different cell types present in the sample. In order to find associations between individuals, tissues, illnesses, or even disease symptoms, clustering is also employed. In order to direct drug discovery, drug compounds may also be grouped based on the characteristics of target proteins, such as sensitivity and gene expression [112,113,114,115,116]. In transcriptomic and other -omics investigations, dimensionality reduction is frequently utilized to find outliers and possible batch implications. Dimensionality reduction may also be employed as a pre-processing stage to improve the algorithmic efficiency of an ML model or to comprehend the high-dimensional chemical space [117,118].

2.3. Reinforced Learning

Reinforcement learning is a semi-supervised ML technique that focuses on engaging with its surroundings by taking action, learning from its mistakes, and identifying patterns. It tries several behaviors to see the ones that maximize the cumulative reward in an environment instead of being given instructions on what to do, whereas supervised ML employs a collection of input and output data pairs for training. Q-learning, hierarchical reinforcement learning algorithms, temporal difference learning, and policy gradient algorithms are a few examples of well-known reinforcement learning algorithms. The summary of long texts is one real-world business use of reinforcement learning [111,119].

Although reinforcement learning has been around for a while, its applications in healthcare are just beginning to shine. Recently, a sensor-assisted pump was shown to be outperformed by a closed-loop control system created by combining an ML-type control algorithm with structural PK/PD models that are already in use and are well-known to pharmacometricians [120]. Another study by Popova et al. [121] focused on using a variety of ML methods to combine novel molecules via traditional reinforcement learning, of which 95% of the molecules obtained were practical [121]. One study employed reinforcement learning to tailor anemia therapy via pharmacological means [122]. Reinforcement learning was employed by Turki and Taguchi [123] to speed up the process of finding potentially helpful medications. They showed that the process of identifying drugs lasted only 46 days, which is a significant reduction from the time required by traditional approaches. They concluded that various algorithm coding changes still need to be performed to assure that synthesized chemicals have distinct formulae from items currently on the market [123].

3. ML and MM Approaches in Healthcare

The purpose of ML and MM is to create predictive outcomes of a certain phenomenon [124]. ML utilizes data and algorithms shaped by using MMs to simulate how people learn. Over time, as more good data are submitted to ML approaches, their accuracy also increases. By utilizing data collected from patients and minimizing human engagement in analysis, ML is utilized in healthcare to enhance the efficacy and overall quality of treatment. A useful feature of ML is that, once it understands the method to solve a given problem, it provides the solution at a faster rate, as its algorithm has the information implanted, with zero chances of error. These features strengthen the base for ML to facilitate healthcare services and may assist in varied clinical capabilities, from giving medical assistance to the overall automatization of the entire clinical system for the ease of task management, in order to minimize human intervention [125,126,127].

ML and MM can operate healthcare management and services with full efficacy, as there is a specific goal, and its utilization will provide the quickest but most efficient way to come up with a solution, thus reducing human time and effort for menial tasks. By removing the human’s role from the analyzing system, these technologies can decrease mistakes and execute repeated tasks with more efficiency than manual efforts [128,129].

The same ML approach can be used to solve varied problems. According to Goecks et al., in a clinical context, ML will be used to analyze high-fidelity imaging and molecular tests so that medical practitioners can find notable biomarkers to obtain a final diagnosis [130]. Multiscale modeling and automated search results for comparable patient conditions will be used to help make treatment decisions for an unknown disease [131]. Following diagnosis and treatment, health management recommences with continuing personal health monitoring, and for that, an ML system must achieve many objectives, which includes tracking the patient’s response to therapy, keeping an eye out for potential negative effects, tracking general health, and tracking the changes from the starting point that are unrelated to the course of treatment [130,132,133].

3.1. Discovery of New Drug Molecule

Introducing a new drug molecule to the pharma market is a vast area of research in the field of both biomedical and pharmaceutical sciences. There are obstacles in the way of drug discovery and development, for example, high expenditure, consumption of time, off-target delivery, lower efficiency, complex omics data, and lengthy clinical trial phases. Nowadays, new drug molecules are being introduced to the market due to the successful applications of ML and MM in varied phases of drug discovery and development by researchers (see Figure 8). The advancement of ML algorithms makes the whole process rationale cost effective and, all in all, more beneficial to humankind [134].

Target identification and priority setting are the first steps in the conventional target discovery process. This involves the discovery of a target with a causal relationship with some component of a pathophysiology and a convincing argument for the idea that modulating this target will modulate the illness itself [135]. Target identification is, without a doubt, an important step along this road, even if evidence of a successful treatment approach will initially come from in vivo drug response studies followed by demonstrating efficacy in a randomized clinical trial [136]. The discovery and verification of chemically active substances, target detection, protein production, the assessment of medicinal contaminants and physicochemical characteristics, medicinal surveillance, the assessment of medicinal effectiveness and efficacy, and drug reposition are all made possible by computational modeling built around ML principles [137].

For well-defined problems with a lot of useful data, ML techniques offer a set of tools that can enhance discovery and decision-making processes. ML applications are possible at every level of the drug development process. Examples include the discovery of prognostic biomarkers, target validation, and the evaluation of digital pathology data in drug trials. The creation and implementation of ML algorithms and software have begun at all stages of drug discovery and development, such as clinical trials, the identification of novel targets, strengthening proof for target–disease interactions, enhancing small-molecule compound design and optimization, increasing understanding of disease mechanisms, raising understanding of disease and non-disease phenotypes, constructing new biomarkers for prognosis, progression, and drug efficacy, and improving the analytical methods for biometric and other data from patient monitoring and wearable devices, improving digital pathology imaging as well as extracting high-content information from images at all levels of resolution [138,139,140,141,142,143]. Table 1 describes the features of ML tools used in drug discovery and design.

3.2. Prediction and Management of Global Pandemic

Pandemic management is way more costly than the cost of avoiding them. For instance, the global cost of the severe acute respiratory syndrome (SARS) outbreak was an estimation from USD 13 billion to 50 billion for the 2003′s single outbreak [150,151]. The use of MM and ML can aid at this point and, for this reason, in the recent COVID-19 pandemic, researchers applied varied approaches using ML and MM for prediction and management to minimize costs [152,153,154,155,156,157].

Several papers on accurate and early identification related to COVID-19 using ML along with MM have been published. Accumulated case reports, medical pictures, management techniques, personnel in the healthcare industry, demographics, and migration during the outbreak helped in preparing some of the datasets that are now available. ML and MM demonstrated themselves to be effective tools in the battle against this epidemic. Additionally, a number of COVID-19-related datasets have been gathered and made available as open source. Both ML and MM have exhibited some working models for the prediction and management of the COVID-19 pandemic. In the literature, several research projects were produced for the simulation of COVID-19 behavior and dissemination. For MM, the Susceptible–Exposed–Infected–Removed (SEIR) model and the Susceptible–Infected–Recovered (SIR) model were the main models upon which the majority of these were built. In the past, these models were heavily utilized to explore how epidemics spread across different types of transmission networks [158,159,160,161]. For ML, Naive Bayes (NB), Convolutional Neural Networks (CNN), Linear Discriminant Analysis (LDA), Logistic Regression (LR), Random Forest (RF), Support Vector Machines (SVM), and Decision Trees (DT) were a few of the supervised learning algorithms used to identify COVID-19. However, there is still more to be conducted to diversify these databases [162]. Table 2 summarizes the MM models studied for the prediction and management of COVID-19, whereas Table 3 shows the different ML algorithms employed for COVID-19, along with their accuracy.

3.3. Epigenomics

The study of genetics has made substantial use of ML techniques (Table 4) [197,198]. According to the Encyclopaedia of DNA Elements (ENCODE), a genomic sequence which either encodes a specific product (such as a protein or noncoding RNA) or has a recurrent biochemical attribute (such as being attached to a protein or bearing a specific biochemical mark) is referred to as a functional element. The second stage of the ENCODE project, which covers the whole genome, was explained by De Souza [199]. The majority of genomics databases, most notably those produced via sequencing, are usually made available to the public for free. The majority of genomics journals need public access identification for each dataset connected to a publication, serving as proof of this practice. This widespread embrace of data openness may be a reflection of how the field of genetics has developed over time [200]. A few databases and tools have been developed to make genomic analysis feasible, such as StemBase [201], PluriNetWork [202], FunGenES [203], Plurinet [204], iScMiD [205], and SyStemmCell [206]. But, generally, all these databases have substantial amounts of information about transcriptome measurements and leave other essential information unnoticed. Thus, there are calls for further data integration, which brings us to the Embryonic Stem Cell Atlas from Pluripotency Evidence (ESCAPE). This is a database that accommodates a great deal of diverse data, ranging from transcriptomics and epigenetics to proteomics and phosphor proteomics. These datasets are organized as protein–protein and gene–gene interactions, gene lists, and data tables. Hence, they can be easily downloaded and utilized as per the user’s desire [207]. Classifications of metastatic brain tumors, prostate cancer, coronary heart disease, neurodevelopmental disorders, and tumors of the central nervous system are a few examples for the utilization of epigenetic data by ML [208,209,210].

ML algorithms are applied to epigenetic data because of their characteristics, and support vector machines (SVM), random forests (RF), LASSO regression, non-metric multidimensional scaling, logistic regression, convolutional neural network, and stacked denoising autoencoders are some examples [211]. Additionally, numerous samples are made available through large-scale, data-rich archives like The Cancer Genome Atlas (TCGA), ENCODE, and the BLUEPRINT project, so that thorough, high-throughput statistical studies can be performed [212,213]. Such repositories might offer an ML algorithm’s training data or an independent test set to assess the algorithm’s external validity and eventual clinical applicability [214,215]. The creation of these databanks is a crucial step, since ML algorithms need an extensive amount of data to generate accurate predictions. The majority of datasets are made up of DNA methylation profiles that were obtained from peripheral blood; therefore, patients just need to give a tiny quantity of blood. The utilization of peripheral blood as a gauge of DNA methylation may be less beneficial in diseases like certain malignancies, with higher clinical relevance in conditions like obesity [216]. DNA methylation patterns that are tissue-specific should also be highlighted [217,218].

Table 4. ML models with different epigenomic purposes along with key challenges.

Area of Purpose	ML Tools	Prediction Details	Challenges	Reference(s)
Protein sequencing	ResNets, 2D convolutional neural networks (CNNs)	Structure	Data accessibility is tough, and leakage of these data make the evaluation tougher	[219]
	Multilayer perceptrons with windowing	Function		[220]
	Transformers	Protein–protein interaction		[221]
Gene sequencing	1D CNNs	Accessibility of genome	Genome contains repetition of codes	[222]
	Recurrent neural networks (RNNs)	Arrangement of 3D genome	Missing data of interest	[223]
	Transformers	Interactions between enhancer and promoter	Lengthy sequences	[224]
Genetic expression	Clustering	Intergenic interactions or co-expression	Link between function and co-expression is not clear	[225]
	CNNs	Intergenic interactions or co-expression	Multidimensional	[225]
	Autoencoders	Organizing transcription machinery	Loud noise	[226]
Interactions between proteins	GCNs	Side effects of poly-pharmacology	Networks for interactions can be incomplete	[227]
	Graph embedding	Protein function	Protein’s interaction depends on cellular location	[228]
	Graph embedding	Protein function	Number of possible combinations is higher	[228]

3.4. Protein Engineering

The goal of protein engineering is to create or find proteins with features that can be applied in technical, scientific, or medical fields, whereas the basic objective of ML-guided protein engineering is to locate variants that differ from wild-type sequencing in terms of a certain feature. The amino acid sequence of a protein affects factors associated with its function, including its expression level and catalytic activity. This link is reversed in protein engineering to identify a sequence that fulfils a certain function. However, the functional levels of closely related proteins cannot be distinguished using existing biophysical prediction techniques. Furthermore, the number of potential proteins is too huge for a thorough natural laboratory or computer search [229,230,231,232].

The phases of protein sequences and the corresponding functional measurements of the proteins are used to train ML models. What the model can learn depends on the examples that were used to develop the model. It is possible to choose the initial set of variations to screen randomly from a library [233] or to gather as much information as possible about potential mutations [234,235]. The most straightforward approach is typically to choose variations at random. However, for low-throughput screens, it might be crucial to optimize the information gained from expensive tests, because doing so would increase the model accuracy for undiscovered sequences. Maximizing the variety in the training sequences is approximately similar to maximizing knowledge about the rest of the library. The user must choose the sort of ML model to employ, train the data in a suitable way, and then train the model after gathering the first training data.

Decision trees (classification and regression trees), RFs, SVMs, and Gaussian process models are implemented in protein engineering [229]. Due to the promising outcome of ML in protein engineering, new ML models and techniques have been developed quickly, necessitating the creation of strict performance standards and criteria for model comparison. Comparing ML modeling techniques is challenging, since different datasets, train-test splits, and evaluation criteria have different effects on model performance [236]. Designing clever combinatorial libraries for controlled protein evolution is also an attractive use of ML in protein engineering [237].

4. ML Algorithms in Specific Types of Cancer

4.1. Lung Cancer

Lung cancer causes the maximum number of cancer-related fatalities. Its percentage in men is 29.2% for those aged from 45 to 64 and 22.8% for those aged 65 and above, and in women, 17.9% for those aged from 45 to 64 and 13.7% for 65 and older [238]. The diagnosis of lung cancer includes chest computed tomography, bronchoscopy, sputum cytology, video-assisted thoracoscopy, and lymph node biopsy, etc. [239]. Imaging tests that are computed tomography scans (CT scans), magnetic resonance imaging (MRI scans), and positron emission tomography scans (PET scans) have specific limitations. With imaging tests like CT and PET scans, the absence of repeatability is a well-known issue in radiomics. This is mostly due to the lack of defined procedures and settings throughout the workflow [240,241]. Numerous studies also experience significant constraints at the stage of validation. Inadequate statistical evaluation (such as failing to adjust the p-value across several tests) and/or the lack of a standalone validation dataset to support the findings are two examples. This might easily result in skewed discovery rates and increase type-I mistakes [242]. Publication bias also tends to exaggerate good outcomes in contrast to negative ones [243,244]. Expense and accessibility are two obstacles that discourage the general public from implementing CT scans. However, the chance of developing lung, breast, or thyroid cancer over time is also increased by low-dose radiation exposure, particularly in patients who have a history of having several CT scans. False-positive results from low-dose CT scans can force patients to undergo more intrusive procedures like biopsies and surgery to remove the abnormality, which comes with extra intra- and post-operative risks and problems [245,246]. Even sputum analysis shows similar limitations, which are: (a) inadequate sensitivity or accuracy during analysis and (b) failure to detect small-diameter carcinomas. However, these particular limitations could be overcome by immunostaining [247].

ML helps to minimize or overcome these limitations and go above and beyond to save time and minimize errors by increasing accuracy and sensitivity to the diagnosis. For early lung cancer diagnosis, some methods use the clinical data from the patient along with the observed texture properties of specific nodules in CT images as input features to train ML classifiers, such as Logistic Regression (LR) or Linear Discriminant Analysis (LDA), for estimating the likelihood of malignancy. Clinical factors include the patient’s age, gender, specimen collection timing, family history of lung cancer, and smoking exposure, etc. Usually, these metrics include nodule size, type, location, count, border, and emphysema information in CT scans [248,249,250]. One study focused on using six different ML classifiers (Support Vector Machine (SVM), Naïve Bayes, Random Forest, K-nearest neighbor, Neural Network with 10-cross fold methodology, and AdaBoost) to determine the best lung tumor forecast depending on metabolomic biomarkers features. They gathered a total of 110 lung cancer patients, as well as 43 individuals in good health. For early lung tumor prediction, Naive Bayes is advised as a useful method [251]. Large pan-cancer sequencing datasets from earlier ML research have demonstrated the effectiveness and progress of early detection and cancer type classification, which may support lung cancer diagnosis [252,253,254]. Cancer cells exhibit a wide range of genetic abnormalities, and the aggregation of these differences can serve as markers to identify the mutational patterns of various cancer types. To improve the accuracy of ML models, recent research has focused on obtaining better genetic signatures as input features. Blood-based liquid biopsy (cell-free DNA fragments, circulating tumor DNA, microRNA, methylation, exosomes, and circulating tumor cells) is regarded as a reliable technique for early identification to examine prospective circulating tumor markers [255,256]. Even Gould et al. performed a study on AUC, sensitivity, and diagnostic odds ratio, which are used as performance indicators in developing a model to predict a future diagnosis of lung cancer based on routine clinical and laboratory data. The model is then compared to traditional risk screening and eligibility criteria screening [257]. Li et al. also illustrated various ML applications for lung cancer, specifically in (a) early detection as well as diagnosis, (b) immunotherapy, and (c) treatment and survival prediction [258]. Currently, a Support Vector Machine (SVM) is working on a trending classifier. For one study, lung cancer patients were identified utilizing an SVM classifier according to their symptoms, and the Python programming language was also used to advance the model’s execution. The performance of the SVM model was assessed utilizing a number of different metrics. The suggested model was contrasted with the SVM and SMOTE techniques currently in use. Compared to the new approach, the current ones yield a 98.8% accuracy rate [259]. Some ML algorithms were employed for lung cancer via feature selection. Whitney et al. and Liang et al. chose the best indicators for model training using the least absolute shrinkage and selection operator (LASSO) technique [260,261]. To obtain better findings, a few investigations [262,263] integrated the -omics signatures with clinical markers. Many algorithms, including K-nearest neighbors (KNN), Naïve Bayes, SVMs, decision trees (DT), logistic regression, random forest, linear discriminant analysis, gradient boosting, and neural networks, have shown their capacity to successfully identify and classify various lung cancer patterns utilizing these specific types of tumor -omics signals. Li et al. even tabulated the studies performed by different researchers using different ML models and classifiers for the early detection and diagnosis of cancer [258]. Figure 9 summarizes all the applications of ML algorithms to date in lung cancer management.

4.2. Colon Cancer

The primary method for diagnosing colon cancer is colonoscopy, which is a surgical procedure, and diagnostic resources are typically limited. Additionally, making a diagnosis is a difficult procedure that involves a chain of interactions between the patient, the original consulting doctor, and the available healthcare technologies [264]. There are a number of datasets that are currently used in colon cancer diagnosis. To prevent overfitting, a technique for minimizing the number of features (genes) unrelated to the target illness is required. The likelihood of recovery from cancer is increased with detection at an early stage. In a study, Loey et al. chose the vital features from the provided data patterns using information gain (IG) and incorporated them into an ML system. The grey wolf optimization technique is then used to minimize the number of chosen features. A vital performance metric in illness diagnosis, classification accuracy, was used to assess it. In order to classify cancer types, an SVM classifier was used [265]. Figure 10 mentions the commonly used ML classifiers in colon cancer.

4.3. Pancreatic Cancer

There are certain instances in which it is challenging to separate the diagnostic results of pancreatic cancer from other benign pancreatic disorders. Given the varied treatment consequences, accurate diagnosis in these situations is essential [267]. The capability of ML models like K-nearest neighbor (k-NN), artificial neural networks (ANNs), and SVMs to extract distinctive signatures from medical scans that could potentially be utilized in the diagnosis of pancreatic cancer has been studied [268,269]. Some widely used classifiers for the diagnosis of pancreatic cancer include LR, RF, and SVM. The radiomics and RF classifier in research using CT scans from 190 patients with pancreatic ductal adenocarcinoma and 190 healthy subjects obtained 99.2% correctness in binary categorization [270]. Schultz and co-workers used microRNA panel recognition as a classifier for pancreatic cancer, utilizing two brand-new microRNA panels that combine 4 or 10 microRNAs from whole blood to detect pancreatic cancer [271].

4.4. Glioma

Glioma research using ML techniques has grown significantly in recent years. In one study, three datasets with three MRI sequences that are T1-weighted (T1W), T2-weighted (T2W), and fluid-attenuated inversion recovery (FLAIR) were created to improve the distinguishability between low-grade and high-grade gliomas. The same assessment values were obtained with six identical chosen features using the RF and LASSO methods [272]. In another study, a novel deep feature fusion-based multiclass brain tumor classification system was proposed by Kibriya and the team. As a preprocessing phase, a min–max normalization algorithm with data augmentation was applied. From transfer learning architectures like AlexNet, GoogleNet, and ResNet18, deep CNN features were taken out and combined to form one feature vector. On this feature vector, a classifier using SVM and KNN models was employed. The proposed framework was trained and assessed using a dataset of 15,320 MR images. The study’s findings show that the fused feature vector performed better than the component vectors. The new method also performed better than the existing methods, obtaining 99.7% accuracy [273]. Takahashi et al. attempted to gain preoperative (before surgical resection or biopsy) MR images from 951 adult diffuse glioma patients from 10 facilities in Japan. Of the 951 patients, 673 cases had at least one series of preoperative digital MR pictures, and 544 instances matched their criterion. The tumor tissues taken from these individuals were subjected to sequencing analysis after thorough clinical data collection. These pictures are referred to as the Japanese Cohort (JC) dataset. This is the largest glioma imaging collection with access to clinical data as well as genetic/epigenetic profiles after the further division of the 544 subjects, which were classified into three groups. Compared to the Multimodal Brain Tumor Image Segmentation Benchmark (BraTS) dataset, the number of patients included in the JC dataset was 1.6 times higher [274].

4.5. Skin Cancer

The difficulty of making an early diagnosis of melanoma, even by professionals, is a serious issue. As a result, doctors may find it useful to use a procedure that simplifies its diagnosis [275]. The use of processing images and machine vision technologies for various medical imaging applications has been growing rapidly in the last ten years. Currently, non-melanoma and melanoma skin cancer diagnosis and surgical planning employ supplementary imaging technologies most commonly [276].

A group of researchers created the PH² dataset to aid in the study of classification as well as segmentation techniques [277]. PH² is commonly employed as a dataset for evaluating skin disease detection algorithms. To dynamically diagnose and segment the dermoscopic pictures in PH², for instance, the SegNet framework was employed, and the classification accuracy was ultimately 94% [278]. It includes 200 color dermoscopy pictures (768 × 560 pixels in size) of three different skin conditions, namely melanomas, atypical nevi, and common nevi. Additionally, it has comprehensive medical annotations, including pathological diagnosis and the findings of lesion segmentation. BCN20000 is used to examine skin cancer lesions in challenging-to-diagnose areas, such as the mucous membranes and nails. In total, 5583 skin lesions and 19,424 dermoscopic pictures captured using high-resolution dermoscopy make up the BCN200005 dataset. From 2010 until 2016, they were all united together. The collector also used a number of computer vision algorithms to clean up the photographs of background noise along with other distractions [279]. Algorithms based on ML for skin cancer diagnosis are rapidly being developed using publicly available datasets of skin imaging data, such as those maintained by the International Skin Imaging Collaboration (ISIC) archive [280,281]. ML algorithms, nevertheless, are prone to overfitting with training data from restricted populations that are frequently curated retroactively, and their generalizability is significantly impacted by the participants and training pictures that are subject to selection bias [282]. Neural networks are one of the most widely utilized methods for image analysis. Deep neural networks, such as convolutional neural networks (CNNs), are often utilized in ML for healthcare purposes [283]. It is recommended to use a variety of ML approaches in a computerized decision framework to recognize melanoma skin lesions accurately and automatically. In the current study, the Clinical Proteomic Tumour Analysis Consortium Cutaneous Melanoma (CPTAC-CM) dataset and the International Skin Imaging Collaboration (ISIC 2019) dataset are both used [284,285].

4.6. Oral Cancer

There are several behavioral characteristics that oral cancer might exhibit. Early detection and precise prognostic prediction are crucial for correct and efficient oral cancer care. To achieve this, ML, a branch of AI, has been hailed for its capacity to change cancer management through enhanced diagnostic accuracy and the forecasting of outcomes [286]. The three factors crucial for early diagnosis and prognosis were discovered to be advantageous in the ML technique. These are more accurate forecasts of cancer susceptibility, recurrence, and survival [287], which increase survival rates by effectively managing patients [288,289,290]. Due to its viability and numerous benefits, the ML technique will continue to be used to identify oral cancer [291], forecast oral cancer recurrence, identify occult node metastases [292,293], and estimate oral cancer survival rates [294].

A study developed a 3DCNNs-based image processing system and compared it to a 2DCNNs-based algorithm for the early detection of oral malignancies. The same hierarchical structure was used to build the 3D and 2D CNNs to categorize oral cancers as benign or malignant. The results indicated that, for the discriminating of oral cancer lesions, 3DCNNs with dynamic features of the enhancement rate picture outperformed 2DCNNS with a single enhancing sequence. These findings suggest that the spatial dynamics and characteristics collected from 3DCNNs may guide the design of a future CT-assisted diagnostic system [295]. A research team created an ANN model that uses information on risk factors, overall health, and clinic-pathological characteristics to forecast a person’s likelihood of acquiring oral cancer. The ML model for forecasting was created using the well-known data mining technique, ANN. The model was constructed utilizing a total of 29 variables related to the patients. The training dataset had 54 (75%) instances, whereas the testing dataset contained 19 (25%) cases, and the prediction accuracy for oral cancer was 78.95%. Based on the datasets, ML techniques may aid in the detection and diagnosis of oral cancer [296].

According to López-Cortés et al., SVMs, ANNs, and LR were the primary algorithms used in the context of medical applications for oral cancer, making up 87.71% of all studies published on this subject. SVMs are one of the ML algorithms with the broadest range of usage. A total of 45.45% of all studies for diagnosis and prevention and 63.63% of all studies for malignant oral lesions (precancer) are focused on SVMs. Additionally, with 43.75% of the prognostic workload, LR was the method most commonly utilized. With algorithms like SVMs, ANNs, LR, and CNNs, ML is a potent technique that can accurately forecast outcomes to assist in diagnosis and prevention, prognosis, possibly malignant oral lesions (pre-cancer), treatment, and quality of life. It is essential to remember that not all existing algorithms are instinctual. The outputs produced by ANNs and SVMs are nonlinear and perplexing. In this way, unlike DT, which reveals the set of rules behind the classification, medical professionals frequently lack confidence in the outputs of clinical decision support systems because it is unclear how the algorithm generates the classification result [297].

5. MM Techniques in Specific Types of Cancer

5.1. Tumor Growth

The early spatio-temporal modeling of avascular tumor maturation discusses the changes in the structure and size of 3D multicellular spheroids with regard to manipulation in the ambience of culture [298,299]. As we have mentioned before, MMs in the present day are more complex, and most of them are a continuum of the early established MMs [104,105]. Let us assume that spheroids are radially symmetric and their development is modulated by only a diffusible growth factor, which is either externally supplied, e.g., oxygen, or internally produced, e.g., tumor necrosis factor (TNF). The dispersal of growth factor in the spheroid modulates its functional activity if the cell does not reach the point of regression, death, or else. With the integration of these contributory activities on the tumor volume, the following equation can be written. The equation connects the evolution time of the tumor radius, denoted as R(t), to the growth factor distribution in the spheroid, denoted as c(r,t), as follows:

\frac{d R}{d t} = \frac{1}{R^{2}} \int_{r = 0}^{R} F (c) r^{2} d r

(15)

Here, c = growth factor

F (c)

= The influence of c on the net cell growth rate at each point in the spheroid.

Suppose c is oxygen or glucose. So, it can be assumed that F increases along with c, and it approaches a max value for greater values of c. Hence, the equation for the spatial distribution of c is as follows:

\frac{\partial c}{\partial t} = \frac{D}{r^{2}} \frac{\partial}{\partial r} (r^{2} \frac{\partial c}{\partial r}) - g (c, R)

(16)

Here, D = Diffusion coefficient of c

g(c,R) = Local consumption rate of c

These equations can also be used to predict the spheroid structure. The function of F(c) is basically cell-growth-specific. It might be dependent upon cell proliferation, quiescence, or death [299,300].

In 1987, after identifying the retinoblastoma 1 (Rb1; a tumor suppressor gene), the two-hit hypothesis by Knudson was confirmed. This theorem has helped researchers to characterize the inactivation of other tumor suppressor genes, for instance, the adenomatous polyposis coli (APC) gene in colon cancer and tumor protein 53 (TP53), which has been mutated in more than 50% of human tumors [301]. Currently, the ‘hallmarks of cancer’ model [29] is being used to sequence the mutation and observe its evolution timing, along with the influence of environmental factors on the progression of tumors [302,303,304].

There comes another MM model that regards tumor angiogenesis, which was proposed by Balding and McElwain [305,306,307]. This model focuses on the tumor angiogenesis factor along with blood capillary end and vessels that are denoted as a, n, and b, respectively. To represent a one-dimensional model, where x represents the distance between the focal point of a tumor and the vasculature, the equation for a(x,t) is as follows:

\frac{\partial a}{\partial t} = D_{a} \frac{\partial^{2} a}{\partial x} - µ_{a} a

(17)

Here,

D_{a}

= Presumed constant tumor angiogenesis factor diffusion coefficient

µ_{a}

= Natural decomposition rate of tumor cell

Since it has already been presumed that a tumor will produce angiogenesis factor at a constant rate, the concentration of angiogenesis factor at the outer edge of the tumor is also maintained at a constant value. The end of blood capillary is supposed to emerge from existing blood vessels and tips at a rate that increases proportionately with the level of tumor angiogenesis factor to start chemotaxis using the upper spatial gradients of angiogenesis factor, and, thus, create end-to-end anastomosis. By combining these interactions, the equations for n(x,t) is as follows:

\frac{\partial n}{\partial t} = - χ \frac{\partial}{\partial x} (n \frac{\partial a}{\partial x}) + (λ_{n b} b + λ_{n n} n) (\frac{a}{k_{\partial} + a}) - μ_{n} n - ϑ_{n} n^{2}

(18)

Here, χ = Chemotaxis coefficient

λ_{n b}

= The rate at which blood capillary emerges from existing blood vessels

λ_{n n}

= The rate at which an end of blood capillary emerges from existing blood vessels

μ_{n}

= Net rate at which blood capillary tips expire

ϑ_{n}

= The rate at which capillaries form end-to-end anastomoses [307,308].

After the migration of capillary vessels near the tumor, the edge of the capillary extends towards it to connect to new vessels and leave the residue behind. Now, as per this speculation, the equation for the blood vessels b(x,t) is as follows:

\frac{\partial b}{\partial t} = χ^{n} \frac{\partial a}{\partial x} - μ_{b} b

(19)

Here,

μ_{b}

= the rate at which blood vessels revert

Mathematical simulations of Balding and McElwain’s model and its ensuing additions reveal varied architectural attributes of tumor angiogenesis that include the acceleration of vasculature development in the direction of the tumor and peak density of the capillary tips, foregoing the peak density of blood vessels. By using this model, antiangiogenic treatment for counterpoising tumor angiogenesis factor was compared to other treatments that inhibit endothelial cell proliferation or the chemotaxis of cells [305,309,310]. Furthermore, intricate MMs of the genetic regulatory network (GRN) and principal signaling pathways are widely being applied in the study of angiogenesis [311,312]. In 2009, Wu and team applied a compartment model with the purpose of mathematically proving that an innately derived, soluble vascular endothelial growth factor receptor 1 (VEGFR1) fails to notably block the VEGF signaling pathway, and came up with a conclusion that if any signal limiting effect is observed, then the heterogeneous blood flow might be the reason [313].

Swanson’s reaction–diffusion model gives us insight into the growth and invasion of glioma [314,315,316]. Tracqui and co-workers proposed a spatio-temporal model which tells us about the uncontrolled proliferation of cancer cells, along with their capability for metastasis [317]. Harpold et al. simplified this spatio-temporal model based on the fact that glioma cells never take part in metastasis outside the brain. Suppose, as per mm³, that the concentration of tumor cells is c = c(x,t) at time t and location x, where brain domain B is enclosed. So now, as per the model, the tumor cell proliferation and net diffusion can be described as follows [318]:

Rate of change in concentration of glioma cells = Net glioma cell dispersion + Net glioma cell proliferation

(20)

Hence, Equation (20) can be mathematically written as follows:

\frac{\partial c}{\partial t} = \nabla \cdot (D (x) \nabla c) + ρ c

(21)

⇨ x \in B, t \geq 0, c (x, 0) = c_{0} ⇨ n \cdot \nabla c = 0 o n \partial B

(22)

Here, D = Spatially resolved diffusion coefficient at mm²/year

c_{0}

= Initial distribution of tumor cells

ρ

= Net rate of proliferation per year

n \cdot \nabla c

= Zero flux boundary condition

\partial B

= Boundary of brain domain

Equation (22) describes that, at the boundary of the brain domain, a zero-flux boundary condition (n bullet dell c = 0) inhibits tumor cells from leaving the brain domain [318,319,320,321].

5.2. Treatment

MM is also being utilized in optimizing therapeutic protocols that involve combining chemotherapy, radiotherapy, and surgery. Moreover, it is used in advanced cancer treatment development [322]. In 1982, a model was proposed by Barendsen based on a linear-quadratic model for the total dose calculation of a treatment regimen, which is presently known as the biologically effective dose (BED) [323]. Previously, this formula used to be known as the extrapolated response dose, which was changed by Fowler later on in 1989 [324]. It is widely applied to calculate fractions (doses) of external radiation therapy in the treatment of cancer [325,326,327]. The equation for radiational dose calculation is as follows:

B E D = - \frac{\ln (σ)}{α} = D (1 + \frac{d}{α / β}) = n d_{1}, d_{2}, d_{3} \dots \dots \dots d_{n} (1 + \frac{d}{α / β})

(23)

Here, n = Number of fractions

d = Radiational dose per fraction

D = Total dose delivered

α/β = Fractionation sensitivity

Hence, we can say that BED = total physical dose × relative effectiveness [325,328]. Generally, the value of α/β is tumor-site-dependent [329,330,331], because it is a vital factor in determining the radiobiological properties of cells [332,333].

In 1988, Jain and his colleague proposed a model to find out the reason behind the lower delivery of anti-cancer drugs in the case of vascular tumors. Their model concluded that the uneven regional perfusion and blood flow and the higher pressure of interstitial fluid prevent the delivery of drugs to vascular tumors, which was later confirmed in 2004 by Owen et al. [334,335]. Rockne et al. combined the aforementioned linear-quadratic model with Swanson’s reaction–diffusion model to compare the effectiveness of radiational dose distribution and different dose timings [321]. Another mathematical approach was conducted by Siegmund and co-workers, who analyzed DNA methylation patterns so that genetic evolution could be blocked at several sites in the tumor cells of colorectal cancer [336].

In another work, Swanson and team implemented the reaction–diffusion model on the magnetic resonance imaging (MRI) data of glioma patients to predict the possibility of glioma regeneration after surgery [337].

MMs are also being used in cell signaling pathways, drug designing to find new targets, and determining the pharmacokinetic–pharmacodynamic effects of a new anti-cancer molecules [311,313,338]. Talking about targeted drug delivery in cancer, another MM was developed by Panetta that demonstrates the impact of “cell-cycle-specific” anti-cancer drugs (paclitaxel was used here) on both cancerous cells and normal tissues. To obtain this, this model considered two types of cells: (a) proliferating cells (these are sensitive to the paclitaxel) and (b) quiescent cells (these are resistant to paclitaxel) [339].

In 2005, Basse et al. developed an MM for a human tumor cell line that has not been perturbed by anti-cancer treatment [340]. The modeling was performed on the basis of the different phases of the cell cycle, that is, the gap 1/growth phase 1 (G₁ phase), gap 2/growth phase 2 (G₂ phase), synthesis (S), and mitosis (M). In this model, Basse and team assumed that the cell’s dynamics are regulated by the system of partial differential equations (with t, τS > 0 and 0 < x < X) as follows:

\frac{\partial G_{1}}{\partial t} (x, t) = 4 b M (2 x, t) - k_{1} G_{1} (x, t)

(24)

\frac{\partial \bar{S}}{\partial τ_{s}} (x, t; τ_{s}) + \frac{\partial \bar{S}}{\partial t} (x, t; τ_{s}) = D \frac{\partial^{2} \bar{S}}{\partial x^{2}} (x, t; τ_{s}) - g_{S} \frac{\partial \bar{S}}{\partial x} (x, t; τ_{s})

(25)

Here, 0 < x < X = Relative DNA content

t = Time

τ_{s}

= Amount of time cells spend in the synthesis phase

\bar{S} (x, t; τ_{s})

= Number density of cells that have been in the synthesis phase for τ_S hours at time t

G₁(x, t) = Number density of cells in gap 1 phase

D = Dispersion coefficient

g_{S}

= Average growth rate of DNA in the synthesis phase

There are two transitions in the cell cycle. One is the transition between gap 1 and the synthesis phase, and the hourly rate of this transition between these two phases can be denoted as k₁. Another one is the transition between gap 2 and the mitosis phase, and the hourly rate of this transition between these two phases can be denoted as k₂. If from the mitosis phase to the gap 1 phase, cells get divided with a rate b per hour, then, as per the prediction that cells spend a fixed time (can be denoted as T_S h) in the synthesis phase, the equation will be as follows,

\frac{\partial G_{2}}{\partial t} (x, t) = \bar{S} (x, t; T_{S}) - k_{2} G_{2} (x, t)

(26)

\frac{\partial M}{\partial t} (x, t) = k_{2} G_{2} (x, t) - b M (x, t)

(27)

Here, b = Division rate

G₂(x, t) = Number density of cells in gap 2 phase

M (x, t) = Number density of cells in the mitosis phase

After staying in the synthesis phase for

τ_{s}

= T_S hours, cells move to the gap 2 phase. Primary dispersals at time t = 0 in the gap 1, synthesis, gap 2, and mitosis phases are the capricious positive functions G₁(x, 0) = G₁₀(x),

\bar{S}

(x, 0; τ_S) =

\bar{S}

₀(x,

τ_{s}

), G₂(x, 0) = G₂₀(x), and M(x, 0) = M₀(x), respectively (where, 0 < x < X). Now, at the zero-flux boundary conditions in the synthesis phase, Equation (24) will be as follows:

D \frac{\partial \bar{S}}{\partial x} (0, t; τ_{s}) - g_{S} \bar{S} (0, t; τ_{s}) = 0 t, τ_{S} > 0

(28)

D \frac{\partial \bar{S}}{\partial x} (X, t; τ_{S}) - g_{S} \bar{S} (X, t; τ_{S}) = 0 t, τ_{S} > 0

(29)

The primary stage (τ_S = 0) for Equation (25), which represents those cells coming from the gap 1 phase (where, t > 0, 0 < x < X), will be as follows:

\bar{S} (x, t; τ_{s} = 0) = k_{1} G_{1} (x, t)

(30)

Hence, for any given time t₀, the distribution of the DNA of the cells can be attained by adding the profiles in each phase =

G_{1} (x, t_{0}) + \int_{0}^{T_{S}} \bar{S} (x, t_{0}; τ_{S}) d τ_{S} + G_{2} (x, t_{0}) + M (x, t_{0})

[340]. DNA profiles acquired by using these MMs on a human tumor cell line, which have not been perturbed by anti-cancer therapy, show a phenomenon called steady DNA distributions (SDD), regardless of the primary dispersal, which is the feature of another MM mentioned by Arino [341].

5.3. Interconnection between ML and MM

When it comes to any advanced technology, ML and MM are intertwined [342,343]. Machines learn from trained datasets with the help of ML. This learning process is attained through varied mathematical symbols, expressions, and logical equations. Hence, this fact self-describes the relationship between ML and MM. MM supplements an established ML model to provide better outcomes, however, sometimes, the opposite also happens [344,345,346]. For example, during the recent COVID-19 outbreak, when researchers were busy producing different strategies in every possible area of science to assist the worldwide situation, Liu and colleagues used a neural network (ML) to solve the limitation of the SEIR model (MM). Based on the assumption of the SEIR’s inflexion point, they used a neural network to acquire an accurate prediction and further refined the fitting at certain time points [347]. In 2014, Bezak et al. published an article where they used MM to map a robotic hand to overcome the shortcomings of their applied ML model [348]. Additive manufacturing (AM) is the process of adding layers upon layers to form a 3D structure [349]. Powder bed fusion (PBF) is one type of additive manufacturing process. Baturynska et al. used a combination of MM and ML optimization methods for the purpose of the optimization and evaluation of the parameters of this PBFAM process [350]. Similarly, numerous approaches have combined these techniques to achieve a more accurate result in the field of cancer research. Figure 11 shows that researchers achieved better therapeutic results when they used a combination of ML and MM, which is, most importantly, cost effective.

In terms of the benefits of combining ML and MM, telemedicine is a good case in point. Telemedicine can make the healthcare system more available and safe in case of infectious diseases, for example, COVID-19 [352,353]. Sirintrapun and Lopez stated teleoncology as a strategy for improved accessible care with lower treatment and maintenance costs [354]. Moreover, in the case of rural populations, oncologists are not required to travel to the location to provide care [355]. Doolittle et al. [356,357] and, recently, other researchers [358,359,360], also demonstrated the patient satisfaction, improved clinical efficacy, and cost-effectiveness of this telemedicinal approach from their experience in cancer patient care [354]. Shalowitz and Moore mentioned in their recently published article that telemedicines have the potential to reduce or eliminate geographical barriers in providing improved-quality care to cancer patients. They also mentioned their existing applications in cancer care, which are, (a) pre-diagnosis, (b) pre-treatment, (c) treatment, and (d) post-treatment monitoring, at different phases [361]. A study was conducted by Yunus et al. on 217 patients to evaluate the safety, quality, and cost for cancer patients receiving rural telehealthcare (134 patients) in comparison to “routine” urban healthcare (83 patients). The resulting data showed that the telemedicine visits were as good as or better compared to in-person visits, with a 95% patient satisfaction rate [358].

Wearable technology and at-home smart electronics offer a new way to control health outside hospital settings [362,363,364]. These gadgets have the capacity to gather substantial volumes of meticulous data on patient health conditions, which are utilized by ML algorithms to recommend one-time actions, adjustments to daily routines, or referrals to a doctor for evaluation and testing. Sensors for mobility, pulse, respiration rate, body temperature, blood pressure, oxygen levels, and other biometrics are becoming a common feature of wearable technology [365,366,367,368,369]. In the future, wearable and sound sensor data will probably be utilized to find novel biomarkers, possibly by merging data from various types of devices [130,370,371]. The continuous monitoring of a person’s behavior and bodily functions via wearable and home gadgets, along with readouts from routine blood tests, are features of health management. By modifying individual-level models with information gathered for each individual, a customized model of essential functions and activities will be created. The ability to construct individualized outlines and detect their shifts, which may signify a change in health condition, is a major benefit of this technique. ML-based applications will monitor people using customized models for any deviations from the normal and alert them when a change requires consulting a medical specialist. In order to create models using wearable data, both conventional supervised learning and deep learning are likely to be included [372,373,374,375,376].

6. Challenges of ML and MM Approaches in Cancer Prognosis and Therapy

Over the decades, a wide range of FS algorithms have been extensively used in the prognosis and prediction of ailments. The majority of published work discusses the use of ML techniques to simulate the development of cancer and find relevant aspects that are then used in a categorization system [15]. The major challenge in developing an ML tool starts from the very beginning during the training dataset preparation. Missing data are a common phenomenon in this case. The factor that usually leads to missing data is a participant not answering all sections in a questionnaire, maybe due to, (a) time shortage, (b) insufficient knowledge to respond to a particular question, (c) failure to understand the questions, (d) not feeling the desire to respond, and (e) finding certain questions embarrassing to answer. In addition, the quantity of missing data increases with the inclusion of multivariate data in a dataset. The productivity of ML tools is severely influenced by such missingness in datasets. Researchers always try to reduce missing values because of their unavoidable existence, which is a concern for empirical scientists in varied disciplines [377,378,379,380,381].

Another issue is that patient selection bias in ML models might result in subpar performance and inaccurate predictions in future unanticipated circumstances, because a large proportion of AI models are trained using retrospective, observational data [382]. A patient’s desire to maintain the privacy of their health and social details should be respected, so patients’ information must thus be meticulously gathered to obtain the data required to create models [383]. ML-based diagnostic systems could also be biased and prone to mistakes. Due to this, the outcomes from these models cannot be relied upon blindly, since they could potentially harm patients if there is any incorrect information. Health practitioners need to have a thorough grasp of the process of reviewing data sources, model construction, and algorithm formation in order to build intended ML models to predict outcomes. To successfully implement and utilize ML-based prediction models, cooperation between ML specialists and health professionals is required [384]. Figure 12 illustrates an in-depth understanding of ML challenges and categorizes them into different sectors. As mentioned before, these challenges could be sidestepped with the consultation of ML specialists to improve ML algorithms by frequent modification and upgradation. Ethical considerations while developing ML algorithms should be taken into account as well to overcome the issue of patient privacy [385].

MM coupled with well-designed in vitro experiments provides a way of cancer treatment investigation that may cause a reduction in the number of animal studies. We can now understand cancer from several angles with the assistance of MM. The processes involved in creating an MM are: (a) selecting a real-world issue; (b) simplifying biological phenomena; (c) creating a mathematical quantification; and (d) running numerical simulations [386]. The accuracy of these models is discerned only when the results are within the parameters. A significant difficulty for computational oncology is obtaining physiological-based findings for combination chemotherapy and antiangiogenic treatment [387].

6.1. Data Quantity

Any data-driven project must consider the quality and quantity of the data required for ML models. Data scientists may accurately assess the project’s scope, schedule, and viability by knowing the minimum dataset size needed. The kind of issue being handled, the model complexity, the quality and correctness of the data, and the accessibility of labeled data are all considered when estimating the volume of data required for an ML model [388,389,390,391]. Data amount estimation can be handled via statistical methods for large datasets and the rule of thumb approach for smaller datasets [392,393]. Applying the 10 times rule is the most typical technique to determine whether a data collection is enough. According to this principle, a model should include 10 times more input data (i.e., instances) than degrees of freedom. Degrees of freedom often refer to variables in the specified data collection. This could also mean that input data need at least 10 times as many data points (rows) as there are features (columns) in the dataset when using ML [394].

For an ML model to be adequately trained for a medical technology solution, data accessibility by itself is frequently insufficient. In healthcare initiatives, the accuracy of the data is crucial. Research in this area is challenging because of heterogeneous data formats. The varied forms of data from laboratory tests, medical pictures, vital signs, and genomes make it difficult to apply ML algorithms to all data concurrently. The widespread availability of medical datasets is another problem in making a dataset with good-quality data [395,396,397,398].

Applications for ML in healthcare span from completely autonomous AI for cancer detection to non-autonomous mortality estimates to help allocate resources [399]. It may be quite challenging to strike a balance between information quantity and contemporaneity. Modifications are frequently made to the information gathered and the testing procedures. This implies that the models that use data gathered over a lengthy period of time need a method to handle these modifications and to compare normal values and ranges. A model should be receptive rather than generalizable, recognizing incoming data that differ from the training or test set. A model with suitable governance of versions and data needs that can be modified and revalidated might prove more beneficial [400]. The fact that even DL, which is a subset of ML, is data greedy constitutes one of the biggest obstacles to its implementation. DL uses a lot of data to forecast a group of unobserved data and understand how features behave during training [401]. In order to avoid overfitting, modern deep neural networks generally feature numerous parameters that can be trained and need a corresponding amount of labeled pictures during training [402].

6.2. Ethical Consideration

Even though the methodology of how ML and MM work is known, ethical considerations need to be put into play when working with them. AI algorithms have special ethical and legal issues that restrict their widespread use and reproducibility, particularly their natural bias when developed on datasets that preferentially omit underrepresented people. Furthermore, algorithms must show dependability, validity, and transparency [403]. The security and confidentiality of medical information, biases in the data employed when constructing the model, trustee connection, dispute, a lack of duty or liability, relationships between doctors and patients that might alter the autonomy of patients, and a violation of the independence of patients, along with criticism by peers were among the ethical issues covered in research that attempts to offer a systematic evaluation of the moral and societal effects of ML models on the therapy of oral cancer [404]. Despite its promise, utilizing ML for forecasting mortality in the pediatric ICU raises ethical questions about bias, trust, and the effect on treatment. Forecasts are affected by a dataset’s lack of variety or correctness. ML is dependent on “learning” from large datasets. Simulations that depend on data regarding ICU fatalities, nonetheless, do not take into account the possibility of death following hospitalization, leading to a sort of reverse survivorship bias. A prejudiced algorithm may exist itself. In the event that providers grow reliant on AI, backup procedures must be in place in case of system failure or during routine upgrades. The black box phenomenon refers to some algorithms for forecasting that are so complicated that it is impossible to understand how judgments are made. If the models are not carefully examined for their safety and efficacy, this paucity of transparency can result in skepticism and may impair physician and patient acceptability, as well as the usage of such technologies [405]. How doctors and healthcare institutions use the data produced by risk estimation algorithms raises a separate ethical issue. A risk calculator would produce patient-specific risk data in a perfect world, enabling informed permission and collaborative decision making. In a reverse scenario, the dependency on risk calculators may reduce the practice of asking patients about their values, past experiences, and motives when deciding whether or not to implement a future medical intervention. Additionally, doctors may restrict the options offered to patients and their families when the dangers are significant, prioritizing paternalism above regard for autonomy. Additionally, there is a risk that healthcare programs or insurance providers will use risk calculators to select which patients are suitable candidates for particular initiatives, completely avoiding patient–physician decision making or penalizing doctors who function above the limit of an “acceptable” risk spectrum [406].

Ongoing discussion evokes the question of whether ML fits within existing legal categories or whether a new category with its special features and implications should be developed. Although the use of ML in clinical settings holds great promise for enhancing healthcare, it also raises ethical concerns that we need to consider. Four significant ethical concerns need to be resolved for ML in healthcare to realize its potential completely. Important things to take into account include: (a) informed permission to utilize data, (b) safety and transparency, (c) fairness through algorithms and biases, and (d) data privacy. The question of whether ML systems are lawful is controversial politically as well as legally [407,408]. The goal is to support policymakers so they may take prompt action to address the morally challenging circumstances in which mandating ML in medical facilities improves [408,409]. Many legal conversations on ML have been influenced by the limits of algorithmic openness. ML design and governance must now be more responsible, egalitarian, and transparent as ML is used more frequently in high-risk circumstances. The two most crucial components of transparency are information accessibility and understandability. Learning about an algorithm’s operation is usually made difficult on purpose. With machines that function by ML, in case of any issues, the manufacturer or operator will be held responsible for any harm caused [399,408,410,411].

Even though wearable technologies have been used in the medical industry to improve people’s health, they might also pose ethical issues [412,413,414]. While the algorithm and the guidelines are given by humans, the patient information is gathered in accordance with the requirements of the algorithm design. ML imitates how the human brain makes decisions; it is not as intelligent as a real person, which raises moral limitations as well. A significant issue in the medical internet of things (IoT) is the anti-interference capability and security hazards of transmission technologies [415,416,417]. Our understanding of privacy varies depending on the context, and the ethical analysis will differ depending on the person accessing a particular type of data for a certain purpose [418,419].

Regulations play a crucial role in the steady advancement of these technologies. It is necessary to address the problem of relevant regulatory policies by the algorithm developers from all research fields [420,421].

6.3. Data Privacy

Data privacy is a concern during ML algorithm data storage and use. Data might be stored in multiple locations to avoid data loss, and thus, there can be risks of data breaches or hacking [422]. Even though people are made aware that a company is gathering information about them, it is not always feasible to refuse to collect such information. Furthermore, privacy safeguards that apply to identifiable personal data in one area may not always be applicable to other places. Such regulatory differences allow public and private organizations to get around data protection laws that they find overly constricting. Therefore, it is even more critical to reconsider the legal conceptions of privacy and property regarding the data practices that enable algorithmic regulation [423,424,425,426]. The primary moral concern is the confidentiality of medical data. A lot of patient medical information is used to build ML models. As a result, the security and privacy of patients are at risk. To solve this problem, it is necessary to notify patients or participants in other studies about the gathering and utilization of their data in order to obtain their informed approval, avoid unlawful proprietary usage of their information, and safeguard their privacy. Approaches for AI-based illness diagnosis have significant challenges in the areas of data protection and privacy: (a) data leaks: AI-based ailments diagnostic technologies are vulnerable to hackers since they contain a lot of private patient information. Unapproved access to or disclosure of patient records might happen as a consequence of an information breach, which could seriously violate privacy rights; (b) sharing of data: AI-based illness diagnostic systems frequently divulge patient information with third parties, including healthcare professionals and research institutes. This may lead to questions about the safety and confidentiality of data as well as possible data misuse; (c) creating data anonymity: Anonymized data may be used by AI-based illness diagnostic systems to safeguard the confidentiality of patients. However, anonymous data continue to be utilized to re-create patients’ identities, which means there is a chance that the information will be abused; (d) storing of data: Systems for AI-based illness diagnosis keep a lot of patient data. The information in question may be kept in several different places and may be subject to security breaches, computer hacking, and data loss; (e) transparency is lacking: Patients may find it challenging to comprehend how their data are utilized and to govern accessibility to their data, since AI-based disease diagnosis methods may not be transparent in how they gather, store, and utilize information concerning patients; and (f) prejudice and bias: Bias and prejudice can have an impact on AI models, which might provide erroneous or unreliable findings, specifically for a particular demographic group [422]. By combining the health insurance portability and accountability act (HIPAA) with the general data protection regulation (GDPR) of Europe [427,428] and the California consumer privacy act (CCPA) [429,430], Bari and O’Neill, 2019, proposed a framework for reconsidering patient data privacy in the era of digital health [431]. One study suggests a brand-new, non-invasive, secure cancer diagnosis technique employing DL. In order to avoid data theft, the information gathered is encrypted before being transmitted across the channel. Correlation, entropy, contrast, structural content, and energy are some of the security metrics that are used to evaluate the effectiveness of the suggested encryption technology. A picture encryption method that uses chaos, discrete wavelet transforms, and bit-plane extraction to transfer data without being modified by hackers or unauthorized access protects the private medical photographs of patients used for cancer detection. The information is received in an encrypted format and then decrypted before being employed for the diagnosis of cancer [432].

Patients must be informed of their privacy rights, the types of PHI that will be shared with third parties, and the purposes for which such disclosures will be made before they are admitted to a healthcare facility. HIPAA now mandates that all patients, regardless of age or gender, receive notice of privacy practices. The patient must sign this form, and a single copy must be stored in the hospital records, which also acts as evidence that the patient received privacy notice. If the patient is unable to sign for whatever reason, the situation must be noted and witnessed. If someone else signs the paperwork, the justification for the signature must be recorded. The healthcare facility is not obligated to continuously inquire about the patient for disclosure of PHI during routine care once they have signed a notice of privacy practice. The note must be updated if the patient’s medical state changes or if they develop new privacy concerns. The patient may request that no friends or family members be allowed to collect their drugs or that the healthcare workers refrain from discussing the patient’s medical condition with them [430,433,434]. Effective data anonymity and security precautions must be implemented to address concerns with data privacy, protection, and governance [435].

7. Further Discussion and Future Directions

ML has made life easier due to its quick learning capabilities with almost zero chance of error so that we can focus on other tasks. The objective of ML and MM-driven strategy in healthcare is not only to save time, money, and resources, but also to provide medications to patients more quickly. The main objective is to deliver the same healthcare facilities to people from all socioeconomic categories regarding any ailments, diseases, and accidental cases.

To find causality, we must accurately forecast system dynamics, which motivates the integration of multiscale modeling with ML and MM for biological, biomedical, and behavioral systems. The fundamental question is whether we will eventually be able to use today’s models to detect appropriate biological traits and investigate their interplay in real time. If the progression of disease biomarkers is uncovered and processes are understood from vast datasets, for example, early biomarkers of cancer, it is a highly applicable example of direct translational usefulness.

Developing data- and theory-driven ways to formulate a mechanistic understanding of the genesis of biological function to explain occurrences at higher levels as a result of collective action on lower scales is the ultimate challenge, to put it more abstractly. In this review paper, we have seen quite a few successful ML, MM, and sometimes combined strategies in cancer prognosis and treatment. These days, the use of MM-driven ML tools has started to revolutionize the cancer management system. There are now several wearable technologies available for tracking physical health. In recent years, the smartwatch market has experienced rapid expansion. The sharp increase in wearable gadget sales indicates customers’ interest in daily monitoring activities. In both clinical settings and laboratory experiments, wearable technology can track physical activity and notify of unpredictable changes in the human body. It is now feasible to remotely track the physiological characteristics of a large number of cancer patients in real time thanks to cutting-edge communications technologies that enable immediate and enormous multidirectional data transfer. Physicians can then obtain these real-time data, subsequently facilitating prompt action. Electronic biosensors have been gradually reduced in size over the past few decades, enabling wearable devices to continuously monitor physiological parameters like the skin temperature, heart rate, respiration rate, oxygen saturation, perspiration, and activity of ambulatory subjects on a 24/7 basis.

For the successful and uninterrupted implementation of these ML-based tools in cancer management, they should be under the systematic vigilance of ML specialists. Systematic flaws in ML-based systems may be found by developers, who may then make appropriate revisions to the model creation process, such as eliminating the offending predictor or using an alternative model. Therefore, vigilance is a must, because ML interpretability might be incorrect or meaningless, especially when the disease is multidimensional, like cancer. Consequently, it may be preferable that a particular ML model be restricted if it is accountable for high risks in cancer patient management.

We have seen how ML and MM are used in various cancer treatment and management forms. This, however, is far from exhibiting their growing potential in the current world. We believe the opportunities for ML and MM are just hitting the tip of the iceberg and future developments are vast. To manage the distribution of healthcare resources, ML healthcare employs a range of techniques, from completely autonomous AI for cancer detection to non-autonomous death estimations. Treatment options using AI and ML are expanding, and they range from robots that are located in communities to virtual psychotherapists.

High-performance computing is an aspect of computing that can process massive amounts of data, and it has recently become widely employed in many different industries. Medical investigators can learn more about probable root causes and therapies for illnesses by using large-scale data analysis techniques based on high-performance computing. Researchers in medicine can learn the processes of cancer incidence and progression and, hence, create more efficient cancer therapies by examining large-scale genomic datasets. They are able to analyze and interpret picture data more quickly due to high-performance computing, which also increases the precision and efficiency of medical diagnoses. The functioning of human organs, along with disease processes, may be simulated and examined by medical scientists with the use of high-performance computing. Medical professionals may more correctly investigate human physiology and disease processes and create more efficient therapies by using MM based on high-performance computers.

With the help of virtual reality (VR) innovations, patients may gradually adjust to and recover from their psychiatric illnesses in a secure environment. Although the combination of MM with VR technology is not quite there yet, it has demonstrated an extensive spectrum of possible applications. With the use of MM and VR devices, medical professionals will be able to improve how they diagnose and treat patients in the future. They can be utilized to alleviate cancer-treatment-related side effects such as persistent pain and post-surgical discomforts.

One study stipulated that ML does not anticipate displacing radiologists any time soon. Instead, these methods are anticipated to assist radiologists, streamline radiology operations, and raise the diagnostic reliability of radiologists. The use of ML techniques may make it easier to spot linkages and patterns that would often escape human observation. Many AI programs are currently being built on straightforward tasks that pose no difficulty for humans. If efforts are concentrated on jobs that are difficult for radiologists to perform, these AI tools may be of more benefit. Perhaps the earliest medical specialties to apply ML algorithms may be imaging for diagnostic purposes, but other disciplines, including pathology, cardiology, dermatology, and gastrointestinal, all have potential applications [436].

An area of biotechnology known as “gene editing” uses instruments like CRISPR/Cas9 to alter gene sequences accurately. With the use of this technology, people with genetic disorders may be able to regain normal function by targeting and altering certain genes. Additionally, technology for editing genes can be utilized to treat conditions affecting the immune system, cancer, and cardiovascular disease. The majority of present therapies are created based on typical results and cannot be specifically adjusted to each patient’s particular circumstance. It would be a significant advancement if personalized treatment regimens could be developed using ML and MM algorithms to take into account each patient’s genetic makeup, medical background, and other clinical data.

8. Conclusions

Applications of ML and MM in the overall management of different cancers are undeniably powerful these days, and trendy as well. First, advancements in the different cancer biology research require initial MMs to investigate the potential of hypotheses. If researchers attain a positive outcome, they proceed to further investigation via ML models, where again, MMs are also sometimes required to make the experimental outcome more accurate. If researchers obtain a positive outcome from this experimental stage, they proceed further to laboratory-based experiments. By using MM and ML techniques, researchers can rapidly predict cancer susceptibility, recurrence, and survival rate along with the best possible treatment combination as well. Before scientists used to directly execute laboratory experiments, however, this was a time-consuming, costly, and tiring. This is why, before heading to the lab, multiple computational steps are now performed first. This combinatorial approach of ML and MM makes cancer management feasible, cost-effective, and more accurate than before.

Author Contributions

Conceptualization, J.H., D.B.D. and L.D.; writing—original draft preparation, J.H. and S.M.S.; writing—review and editing, J.H., D.B.D. and L.D.; supervision, M.J.U., D.B.D. and L.D.; project administration, M.J.U., D.B.D. and L.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data will be made available upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Xue, C.; Chu, Q.; Zheng, Q.; Jiang, S.; Bao, Z.; Su, Y.; Lu, J.; Li, L. Role of Main RNA Modifications in Cancer: N6-Methyladenosine, 5-Methylcytosine, and Pseudouridine. Signal Transduct. Target. Ther. 2022, 7, 142. [Google Scholar] [CrossRef]
Wang, Y.; Han, Y.; Jin, Y.; He, Q.; Wang, Z. The Advances in Epigenetics for Cancer Radiotherapy. Int. J. Mol. Sci. 2022, 23, 5654. [Google Scholar] [CrossRef]
Jiang, W.; Liang, M.; Lei, Q.; Li, G.; Wu, S. The Current Status of Photodynamic Therapy in Cancer Treatment. Cancers 2023, 15, 585. [Google Scholar] [CrossRef]
Bray, F.; Ferlay, J.; Soerjomataram, I.; Siegel, R.L.; Torre, L.A.; Jemal, A. Global Cancer Statistics 2018: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA. Cancer J. Clin. 2018, 68, 394–424. [Google Scholar] [CrossRef]
Sung, H.; Ferlay, J.; Siegel, R.L.; Laversanne, M.; Soerjomataram, I.; Jemal, A.; Bray, F. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA. Cancer J. Clin. 2021, 71, 209–249. [Google Scholar] [CrossRef] [PubMed]
Li, J.; Wang, S.; Fontana, F.; Tapeinos, C.; Shahbazi, M.A.; Han, H.; Santos, H.A. Nanoparticles-Based Phototherapy Systems for Cancer Treatment: Current Status and Clinical Potential. Bioact. Mater. 2023, 23, 471–507. [Google Scholar] [CrossRef] [PubMed]
Baskar, R.; Lee, K.A.; Yeo, R.; Yeoh, K.W. Cancer and Radiation Therapy: Current Advances and Future Directions. Int. J. Med. Sci. 2012, 9, 193. [Google Scholar] [CrossRef] [PubMed]
Liu, Y.; Yang, H.; Xiong, J.; Zhao, J.; Guo, M.; Chen, J.; Zhao, X.; Chen, C.; He, Z.; Zhou, Y.; et al. Icariin as an Emerging Candidate Drug for Anticancer Treatment: Current Status and Perspective. Biomed. Pharmacother. 2023, 157, 113991. [Google Scholar] [CrossRef] [PubMed]
Chen, J.; Cong, X. Surface-Engineered Nanoparticles in Cancer Immune Response and Immunotherapy: Current Status and Future Prospects. Biomed. Pharmacother. 2023, 157, 113998. [Google Scholar] [CrossRef] [PubMed]
Mohapatra, A.; Sathiyamoorthy, P.; Park, I.K. Metallic Nanoparticle-Mediated Immune Cell Regulation and Advanced Cancer Immunotherapy. Pharmaceutics 2021, 13, 1867. [Google Scholar] [CrossRef] [PubMed]
Tie, Y.; Tang, F.; Wei, Y.Q.; Wei, X.W. Immunosuppressive Cells in Cancer: Mechanisms and Potential Therapeutic Targets. J. Hematol. Oncol. 2022, 15, 61. [Google Scholar] [CrossRef]
Helissey, C.; Vicier, C.; Champiat, S. The Development of Immunotherapy in Older Adults: New Treatments, New Toxicities? J. Geriatr. Oncol. 2016, 7, 325–333. [Google Scholar] [CrossRef]
Doi, K. Computer-Aided Diagnosis in Medical Imaging: Historical Review, Current Status and Future Potential. Comput. Med. Imaging Graph. 2007, 31, 198–211. [Google Scholar] [CrossRef]
Wong, D.; Yip, S. Machine Learning Classifies Cancer. Nature 2018, 555, 446–447. [Google Scholar] [CrossRef]
Kourou, K.; Exarchos, T.P.; Exarchos, K.P.; Karamouzis, M.V.; Fotiadis, D.I. Machine Learning Applications in Cancer Prognosis and Prediction. Comput. Struct. Biotechnol. J. 2015, 13, 8–17. [Google Scholar] [CrossRef]
Cruz, J.A.; Wishart, D.S. Applications of Machine Learning in Cancer Prediction and Prognosis. Cancer Inform. 2006, 2, 59–78. [Google Scholar] [CrossRef]
Munir, K.; Elahi, H.; Ayub, A.; Frezza, F.; Rizzi, A. Cancer Diagnosis Using Deep Learning: A Bibliographic Review. Cancers 2019, 11, 1235. [Google Scholar] [CrossRef] [PubMed]
Rosenblatt, F. The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain. Psychol. Rev. 1958, 65, 386–408. [Google Scholar] [CrossRef] [PubMed]
Jordan, M.I.; Mitchell, T.M. Machine Learning: Trends, Perspectives, and Prospects. Science 2015, 349, 255–260. [Google Scholar] [CrossRef] [PubMed]
Hethcote, H.W. The Mathematics of Infectious Diseases. Soc. Ind. Appl. Math. Rev. 2000, 42, 599–653. [Google Scholar] [CrossRef]
Chambers, R.B. The Role of Mathematical Modeling in Medical Research: “Research Without Patients?”. Ochsner J. 2000, 2, 218. [Google Scholar]
Liu, Y.; Wu, R.; Yang, A. Research on Medical Problems Based on Mathematical Models. Mathematics 2023, 11, 2842. [Google Scholar] [CrossRef]
Tarca, A.L.; Carey, V.J.; Chen, X.W.; Romero, R.; Drǎghici, S. Machine Learning and Its Applications to Biology. PLoS Comput. Biol. 2007, 3, e116. [Google Scholar] [CrossRef]
Jiang, F.; Jiang, Y.; Zhi, H.; Dong, Y.; Li, H.; Ma, S.; Wang, Y.; Dong, Q.; Shen, H.; Wang, Y. Artificial Intelligence in Healthcare: Past, Present and Future. Stroke Vasc. Neurol. 2017, 2, 230–243. [Google Scholar] [CrossRef] [PubMed]
Dilsizian, S.E.; Siegel, E.L. Artificial Intelligence in Medicine and Cardiac Imaging: Harnessing Big Data and Advanced Computing to Provide Personalized Medical Diagnosis and Treatment. Curr. Cardiol. Rep. 2014, 16, 441. [Google Scholar] [CrossRef] [PubMed]
Murdoch, T.B.; Detsky, A.S. The Inevitable Application of Big Data to Health Care. JAMA 2013, 309, 1351–1352. [Google Scholar] [CrossRef] [PubMed]
Kononenko, I. Machine Learning for Medical Diagnosis: History, State of the Art and Perspective. Artif. Intell. Med. 2001, 23, 89–109. [Google Scholar] [CrossRef] [PubMed]
Iqbal, M.J.; Javed, Z.; Sadia, H.; Qureshi, I.A.; Irshad, A.; Ahmed, R.; Malik, K.; Raza, S.; Abbas, A.; Pezzani, R.; et al. Clinical Applications of Artificial Intelligence and Machine Learning in Cancer Diagnosis: Looking into the Future. Cancer Cell Int. 2021, 21, 270. [Google Scholar] [CrossRef] [PubMed]
Hanahan, D.; Weinberg, R.A. Hallmarks of Cancer: The Next Generation. Cell 2011, 144, 646–674. [Google Scholar] [CrossRef] [PubMed]
Friis, L.S.; Elverdam, B.; Schmidt, K.G. The Patient’s Perspective: A Qualitative Study of Acute Myeloid Leukaemia Patients’ Need for Information and Their Information-Seeking Behaviour. Support. Care Cancer 2003, 11, 162–170. [Google Scholar] [CrossRef]
Kaplowitz, S.A.; Campo, S.; Chiu, W.T. Cancer Patients’ Desires for Communication of Prognosis Information. Health Commun. 2009, 14, 221–241. [Google Scholar] [CrossRef] [PubMed]
Jenkins, V.; Fallowfield, L.; Saul, J. Information Needs of Patients with Cancer: Results from a Large Study in UK Cancer Centres. Br. J. Cancer 2001, 84, 48–51. [Google Scholar] [CrossRef] [PubMed]
Butow, P.N.; Maclean, M.; Dunn, S.M.; Tattersall, M.H.N.; Boyer, M.J. The Dynamics of Change: Cancer Patients’ Preferences for Information, Involvement and Support. Ann. Oncol. 1997, 8, 857–863. [Google Scholar] [CrossRef] [PubMed]
Lobb, E.A.; Kenny, D.T.; Butow, P.N.; Tattersall, M.H.N. Women’s Preferences for Discussion of Prognosis in Early Breast Cancer. Health Expect. 2001, 4, 48–57. [Google Scholar] [CrossRef] [PubMed]
Wiens, J.; Shenoy, E.S. Machine Learning for Healthcare: On the Verge of a Major Shift in Healthcare Epidemiology. Clin. Infect. Dis. 2018, 66, 149–153. [Google Scholar] [CrossRef] [PubMed]
Rácz, A.; Bajusz, D.; Héberger, K. Multi-Level Comparison of Machine Learning Classifiers and Their Performance Metrics. Molecules 2019, 24, 2811. [Google Scholar] [CrossRef]
Uddin, S.; Khan, A.; Hossain, M.E.; Moni, M.A. Comparing Different Supervised Machine Learning Algorithms for Disease Prediction. BMC Med. Inform. Decis. Mak. 2019, 19, 281. [Google Scholar] [CrossRef] [PubMed]
Mahesh, B. Machine Learning Algorithms—A Review. Int. J. Sci. Res. 2018, 9, 381–386. [Google Scholar] [CrossRef]
Vamathevan, J.; Clark, D.; Czodrowski, P.; Dunham, I.; Ferran, E.; Lee, G.; Li, B.; Madabhushi, A.; Shah, P.; Spitzer, M.; et al. Applications of Machine Learning in Drug Discovery and Development. Nat. Rev. Drug Discov. 2019, 18, 463–477. [Google Scholar] [CrossRef]
Althnian, A.; AlSaeed, D.; Al-Baity, H.; Samha, A.; Dris, A.B.; Alzakari, N.; Abou Elwafa, A.; Kurdi, H. Impact of Dataset Size on Classification Performance: An Empirical Evaluation in the Medical Domain. Appl. Sci. 2021, 11, 796. [Google Scholar] [CrossRef]
Patel, V.L.; Shortliffe, E.H.; Stefanelli, M.; Szolovits, P.; Berthold, M.R.; Bellazzi, R.; Abu-Hanna, A. The Coming of Age of Artificial Intelligence in Medicine. Artif. Intell. Med. 2009, 46, 5–17. [Google Scholar] [CrossRef]
Graber, M.L.; Franklin, N.; Gordon, R. Diagnostic Error in Internal Medicine. Arch. Intern. Med. 2005, 165, 1493–1499. [Google Scholar] [CrossRef]
Weingart, S.N.; Wilson, R.M.L.; Gibberd, R.W.; Harrison, B. Epidemiology of Medical Error. BMJ 2000, 320, 774–777. [Google Scholar] [CrossRef]
Winters, B.; Custer, J.; Galvagno, S.M.; Colantuoni, E.; Kapoor, S.G.; Lee, H.W.; Goode, V.; Robinson, K.; Nakhasi, A.; Pronovost, P.; et al. Diagnostic Errors in the Intensive Care Unit: A Systematic Review of Autopsy Studies. BMJ Qual. Saf. 2012, 21, 894–902. [Google Scholar] [CrossRef]
Lee, E.J.; Kim, Y.H.; Kim, N.; Kang, D.W. Deep into the Brain: Artificial Intelligence in Stroke Imaging. J. Stroke 2017, 19, 277. [Google Scholar] [CrossRef]
Sun, J.Y.; Shen, H.; Qu, Q.; Sun, W.; Kong, X.Q. The Application of Deep Learning in Electrocardiogram: Where We Came from and Where We Should Go? Int. J. Cardiol. 2021, 337, 71–78. [Google Scholar] [CrossRef]
Lecun, Y.; Bengio, Y.; Hinton, G. Deep Learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
Deng, L.; Yu, D. Deep Learning: Methods and Applications. Found. Trends® Signal Process. 2014, 7, 197–387. [Google Scholar] [CrossRef]
Cohen, S. Dealing with Data: Strategies of Preprocessing Data. In Artificial Intelligence and Deep Learning in Pathology; Elsevier: Amsterdam, The Netherlands, 2021; pp. 77–92. [Google Scholar] [CrossRef]
Ahsan, M.M.; Mahmud, M.A.P.; Saha, P.K.; Gupta, K.D.; Siddique, Z. Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance. Technologies 2021, 9, 52. [Google Scholar] [CrossRef]
Huang, J.; Li, Y.F.; Xie, M. An Empirical Analysis of Data Preprocessing for Machine Learning-Based Software Cost Estimation. Inf. Softw. Technol. 2015, 67, 108–127. [Google Scholar] [CrossRef]
López, J.A.H.; Cánovas Izquierdo, J.L.; Cuadrado, J.S. ModelSet: A Dataset for Machine Learning in Model-Driven Engineering. Softw. Syst. Model. 2022, 21, 967–986. [Google Scholar] [CrossRef]
Paullada, A.; Raji, I.D.; Bender, E.M.; Denton, E.; Hanna, A. Data and Its (Dis)Contents: A Survey of Dataset Development and Use in Machine Learning Research. Patterns 2021, 2, 100336. [Google Scholar] [CrossRef]
Kabir, M.F.; Chen, T.; Ludwig, S.A. A Performance Analysis of Dimensionality Reduction Algorithms in Machine Learning Models for Cancer Prediction. Healthcare Anal. 2023, 3, 100125. [Google Scholar] [CrossRef]
Lin, Y.; Zhu, X.; Zheng, Z.; Dou, Z.; Zhou, R. The Individual Identification Method of Wireless Device Based on Dimensionality Reduction and Machine Learning. J. Supercomput. 2019, 75, 3010–3027. [Google Scholar] [CrossRef]
Kang, Z.; Liu, H.; Li, J.; Zhu, X.; Tian, L. Self-Paced Principal Component Analysis. Pattern Recognit. 2023, 142, 109692. [Google Scholar] [CrossRef]
Candès, E.J.; Li, X.; Ma, Y.; Wright, J. Robust Principal Component Analysis? J. ACM 2011, 58, 11. [Google Scholar] [CrossRef]
Campbell-Washburn, A.E.; Atkinson, D.; Nagy, Z.; Chan, R.W.; Josephs, O.; Lythgoe, M.F.; Ordidge, R.J.; Thomas, D.L. Using the Robust Principal Component Analysis Algorithm to Remove RF Spike Artifacts from MR Images. Magn. Reson. Med. 2016, 75, 2517–2525. [Google Scholar] [CrossRef] [PubMed]
Tang, G.; Nehorai, A. Constrained Cramér-Rao Bound on Robust Principal Component Analysis. IEEE Trans. Signal Process. 2011, 59, 5070–5076. [Google Scholar] [CrossRef]
Schölkopf, B.; Smola, A.; Müller, K.R. Kernel Principal Component Analysis. In Proceedings of the 7th International Conference on Artificial Neural Networks (ICANN’97), Lausanne, Switzerland, 8–10 October 1997; Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer: Berlin/Heidelberg, Germany, 1997; Volume 1327, pp. 583–588. [Google Scholar]
Kim, K.I.; Jung, K.; Kim, H.J. Face Recognition Using Kernel Principal Component Analysis. IEEE Signal Process. Lett. 2002, 9, 40–42. [Google Scholar] [CrossRef]
Lee, J.M.; Yoo, C.K.; Choi, S.W.; Vanrolleghem, P.A.; Lee, I.B. Nonlinear Process Monitoring Using Kernel Principal Component Analysis. Chem. Eng. Sci. 2004, 59, 223–234. [Google Scholar] [CrossRef]
Kocaguneli, E.; Menzies, T.; Keung, J.W. Kernel Methods for Software Effort Estimation: Effects of Different Kernel Functions and Bandwidths on Estimation Accuracy. Empir. Softw. Eng. 2013, 18, 1–24. [Google Scholar] [CrossRef]
Myrtveit, I.; Stensrud, E.; Olsson, U.H. Analyzing Data Sets with Missing Data: An Empirical Evaluation of Imputation Methods and Likelihood-Based Methods. IEEE Trans. Softw. Eng. 2001, 27, 999–1013. [Google Scholar] [CrossRef]
Sentas, P.; Angelis, L. Categorical Missing Data Imputation for Software Cost Estimation by Multinomial Logistic Regression. J. Syst. Softw. 2006, 79, 404–414. [Google Scholar] [CrossRef]
Twala, B.; Cartwright, M. Ensemble Missing Data Techniques for Software Effort Prediction. Intell. Data Anal. 2010, 14, 299–331. [Google Scholar] [CrossRef]
Azzeh, M.; Neagu, D.; Cowling, P.I. Analogy-Based Software Effort Estimation Using Fuzzy Numbers. J. Syst. Softw. 2011, 84, 270–284. [Google Scholar] [CrossRef]
Huang, S.J.; Chiu, N.H. Optimization of Analogy Weights by Genetic Algorithm for Software Effort Estimation. Inf. Softw. Technol. 2006, 48, 1034–1045. [Google Scholar] [CrossRef]
Li, J.; Ruhe, G.; Al-Emran, A.; Richter, M.M. A Flexible Method for Software Effort Estimation by Analogy. Empir. Softw. Eng. 2007, 12, 65–106. [Google Scholar] [CrossRef]
Rodríguez, D.; Sicilia, M.A.; García, E.; Harrison, R. Empirical Findings on Team Size and Productivity in Software Development. J. Syst. Softw. 2012, 85, 562–570. [Google Scholar] [CrossRef]
Strike, K.; El Emam, K.; Madhavji, N. Software Cost Estimation with Incomplete Data. IEEE Trans. Softw. Eng. 2001, 27, 890–908. [Google Scholar] [CrossRef]
Angelis, L.; Stamelos, I. A Simulation Tool for Efficient Analogy Based Cost Estimation. Empir. Softw. Eng. 2000, 5, 35–68. [Google Scholar] [CrossRef]
Bzdok, D.; Krzywinski, M.; Altman, N. Points of Significance: Machine Learning: Supervised Methods. Nat. Methods 2018, 15, 5–6. [Google Scholar] [CrossRef]
Russell, S.J.; Norvig, P.; Davis, E.; Edwards, D.D.; Forsyth, D.; Hay, N.J.; Malik, J.M.; Mittal, V.; Sahami, M.; Thrun, S. Artificial Intelligence A Modern Approach, 3rd ed.; Prentice Hall: Saddle River, NJ, USA, 2016. [Google Scholar]
Zhang, Y.; Yu, C.; Wang, R.; Liu, X. Visual Dimension Analysis Based on Dimension Subdivision. J. Vis. 2021, 24, 117–131. [Google Scholar] [CrossRef]
Berisha, V.; Krantsevich, C.; Hahn, P.R.; Hahn, S.; Dasarathy, G.; Turaga, P.; Liss, J. Digital Medicine and the Curse of Dimensionality. NPJ Digit. Med. 2021, 4, 5061815. [Google Scholar] [CrossRef] [PubMed]
Aremu, O.O.; Hyland-Wood, D.; McAree, P.R. A Machine Learning Approach to Circumventing the Curse of Dimensionality in Discontinuous Time Series Machine Data. Reliab. Eng. Syst. Saf. 2020, 195, 106706. [Google Scholar] [CrossRef]
Greener, J.G.; Kandathil, S.M.; Moffat, L.; Jones, D.T. A Guide to Machine Learning for Biologists. Nat. Rev. Mol. Cell Biol. 2021, 23, 40–55. [Google Scholar] [CrossRef] [PubMed]
Reddy, G.T.; Reddy, M.P.K.; Lakshmanna, K.; Kaluri, R.; Rajput, D.S.; Srivastava, G.; Baker, T. Analysis of Dimensionality Reduction Techniques on Big Data. IEEE Access 2020, 8, 54776–54788. [Google Scholar] [CrossRef]
Tsai, F.S. Dimensionality Reduction Techniques for Blog Visualization. Expert Syst. Appl. 2011, 38, 2766–2773. [Google Scholar] [CrossRef]
Aziz, R.; Verma, C.K.; Srivastava, N. Artificial Neural Network Classification of High Dimensional Data with Novel Optimization Approach of Dimension Reduction. Ann. Data Sci. 2018, 5, 615–635. [Google Scholar] [CrossRef]
Zebari, R.R.; Mohsin Abdulazeez, A.; Zeebaree, D.Q.; Zebari, D.A.; Saeed, J.N. A Comprehensive Review of Dimensionality Reduction Techniques for Feature Selection and Feature Extraction. J. Appl. Sci. Technol. Trends 2020, 1, 56–70. [Google Scholar] [CrossRef]
Jia, W.; Sun, M.; Lian, J.; Hou, S. Feature Dimensionality Reduction: A Review. Complex Intell. Syst. 2022, 8, 2663–2693. [Google Scholar] [CrossRef]
Chandrashekar, G.; Sahin, F. A Survey on Feature Selection Methods. Comput. Electr. Eng. 2014, 40, 16–28. [Google Scholar] [CrossRef]
Li, L. Dimension Reduction for High-Dimensional Data. Methods Mol. Biol. 2010, 620, 417–434. [Google Scholar] [CrossRef]
Hira, Z.M.; Gillies, D.F. A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data. Adv. Bioinform. 2015, 2015, 98363. [Google Scholar] [CrossRef]
Abdulhammed, R.; Musafer, H.; Alessa, A.; Faezipour, M.; Abuzneid, A. Features Dimensionality Reduction Approaches for Machine Learning Based Network Intrusion Detection. Electronics 2019, 8, 322. [Google Scholar] [CrossRef]
Pereira, F.; Mitchell, T.; Botvinick, M. Machine Learning Classifiers and FMRI: A Tutorial Overview. Neuroimage 2009, 45, S199–S209. [Google Scholar] [CrossRef] [PubMed]
Penny, W.; Friston, K.; Ashburner, J.; Kiebel, S.; Nichols, T. Statistical Parametric Mapping: The Analysis of Functional Brain Images; Elsevier: Amsterdam, The Netherlands, 2007. [Google Scholar] [CrossRef]
Giersch, C. Mathematical Modelling of Metabolism. Curr. Opin. Plant Biol. 2000, 3, 249–253. [Google Scholar] [CrossRef] [PubMed]
Gombert, A.K.; Nielsen, J. Mathematical Modelling of Metabolism. Curr. Opin. Biotechnol. 2000, 11, 180–186. [Google Scholar] [CrossRef]
Bailey, J.E. Mathematical Modeling and Analysis in Biochemical Engineering: Past Accomplishments and Future Opportunities. Biotechnol. Prog. 2008, 14, 8–20. [Google Scholar] [CrossRef]
Byrne, H.M. Dissecting Cancer through Mathematics: From the Cell to the Animal Model. Nat. Rev. Cancer 2010, 10, 221–230. [Google Scholar] [CrossRef]
Barbolosi, D.; Ciccolini, J.; Lacarelle, B.; Barlési, F.; André, N. Computational Oncology—Mathematical Modelling of Drug Regimens for Precision Medicine. Nat. Rev. Clin. Oncol. 2015, 13, 242–254. [Google Scholar] [CrossRef]
Nordling, C.O. A New Theory on Cancer-Inducing Mechanism. Br. J. Cancer 1953, 7, 68–72. [Google Scholar] [CrossRef]
Moolgavkar, S.H. The Multistage Theory of Carcinogenesis and the Age Distribution of Cancer in Man. JNCI J. Natl. Cancer Inst. 1978, 61, 49–52. [Google Scholar] [CrossRef]
Hornsby, C.; Page, K.M.; Tomlinson, I.P. What Can We Learn from the Population Incidence of Cancer? Armitage and Doll Revisited. Lancet Oncol. 2007, 8, 1030–1038. [Google Scholar] [CrossRef]
Armitage, P.; Doll, R. The Age Distribution of Cancer and a Multi-Stage Theory of Carcinogenesis. Br. J. Cancer 1954, 8, 1983–1989. [Google Scholar] [CrossRef]
Ashley, D.J.B. The Two “Hit” and Multiple “Hit” Theories of Carcinogenesis. Br. J. Cancer 1969, 23, 313–328. [Google Scholar] [CrossRef]
Armitage, P.; Doll, R. The Age Distribution of Cancer and a Multi-Stage Theory of Carcinogenosis. Int. J. Epidemiol. 2004, 33, 1174–1179. [Google Scholar] [CrossRef]
Wilkins, A.; Corbett, R.; Eeles, R. Age Distribution and a Multi-Stage Theory of Carcinogenesis: 70 Years On. Br. J. Cancer 2022, 128, 404–406. [Google Scholar] [CrossRef] [PubMed]
Joseph, S. The Linear Quadratic Model: Usage, Interpretation and Challenges. Phys. Med. Biol. 2019, 64, 01TR01. [Google Scholar] [CrossRef]
Loap, P.; Fourquet, A.; Kirova, Y. The Limits of the Linear Quadratic (LQ) Model for Late Cardiotoxicity Prediction: Example of Hypofractionated Rotational Intensity Modulated Radiation Therapy (IMRT) for Breast Cancer. Int. J. Radiat. Oncol. 2020, 106, 1106–1108. [Google Scholar] [CrossRef] [PubMed]
Wang, Z.; Kerketta, R.; Chuang, Y.L.; Dogra, P.; Butner, J.D.; Brocato, T.A.; Day, A.; Xu, R.; Shen, H.; Simbawa, E.; et al. Theory and Experimental Validation of a Spatio-Temporal Model of Chemotherapy Transport to Enhance Tumor Cell Kill. PLoS Comput. Biol. 2016, 12, e1004969. [Google Scholar] [CrossRef] [PubMed]
Gong, C.; Milberg, O.; Wang, B.; Vicini, P.; Narwal, R.; Roskos, L.; Popel, A.S. A Computational Multiscale Agent-Based Model for Simulating Spatio-Temporal Tumour Immune Response to PD1 and PDL1 Inhibition. J. R. Soc. Interface 2017, 14, 20170320. [Google Scholar] [CrossRef]
Buchan, D.W.A.; Jones, D.T. The PSIPRED Protein Analysis Workbench: 20 Years On. Nucleic Acids Res. 2019, 47, W402–W407. [Google Scholar] [CrossRef] [PubMed]
Altman, N.; Krzywinski, M. Points of Significance: Clustering. Nat. Methods 2017, 14, 545–547. [Google Scholar] [CrossRef]
Ihme, M.; Chung, W.T.; Mishra, A.A. Combustion Machine Learning: Principles, Progress and Prospects. Prog. Energy Combust. Sci. 2022, 91, 101010. [Google Scholar] [CrossRef]
Badillo, S.; Banfai, B.; Birzele, F.; Davydov, I.I.; Hutchinson, L.; Kam-Thong, T.; Siebourg-Polster, J.; Steiert, B.; Zhang, J.D. An Introduction to Machine Learning. Clin. Pharmacol. Ther. 2020, 107, 871–885. [Google Scholar] [CrossRef]
Roy, S.; Meena, T.; Lim, S.J. Demystifying Supervised Learning in Healthcare 4.0: A New Reality of Transforming Diagnostic Medicine. Diagnostics 2022, 12, 2549. [Google Scholar] [CrossRef] [PubMed]
Alloghani, M.; Al-Jumeily, D.; Mustafina, J.; Hussain, A.; Aljaaf, A.J. A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science. In Supervised and Unsupervised Learning for Data Science; Springer: Berlin/Heidelberg, Germany, 2020; pp. 3–21. [Google Scholar] [CrossRef]
Koch, M.A.; Waldmann, H. Protein Structure Similarity Clustering and Natural Product Structure as Guiding Principles in Drug Discovery. Drug Discov. Today 2005, 10, 471–483. [Google Scholar] [CrossRef] [PubMed]
Gemma, A.; Li, C.; Sugiyama, Y.; Matsuda, K.; Seike, Y.; Kosaihira, S.; Minegishi, Y.; Noro, R.; Nara, M.; Seike, M.; et al. Anticancer Drug Clustering in Lung Cancer Based on Gene Expression Profiles and Sensitivity Database. BMC Cancer 2006, 6, 174. [Google Scholar] [CrossRef] [PubMed]
Ardlie, K.G.; DeLuca, D.S.; Segrè, A.V.; Sullivan, T.J.; Young, T.R.; Gelfand, E.T.; Trowbridge, C.A.; Maller, J.B.; Tukiainen, T.; Lek, M.; et al. The Genotype-Tissue Expression (GTEx) Pilot Analysis: Multitissue Gene Regulation in Humans. Science 2015, 348, 648–660. [Google Scholar] [CrossRef]
Walsh, D.; Rybicki, L. Symptom Clustering in Advanced Cancer. Support. Care Cancer 2006, 14, 831–836. [Google Scholar] [CrossRef]
Wang, C.; Machiraju, R.; Huang, K. Breast Cancer Patient Stratification Using a Molecular Regularized Consensus Clustering Method. Methods 2014, 67, 304–312. [Google Scholar] [CrossRef] [PubMed]
Becht, E.; McInnes, L.; Healy, J.; Dutertre, C.A.; Kwok, I.W.H.; Ng, L.G.; Ginhoux, F.; Newell, E.W. Dimensionality Reduction for Visualizing Single-Cell Data Using UMAP. Nat. Biotechnol. 2018, 37, 38–44. [Google Scholar] [CrossRef] [PubMed]
Ezzat, A.; Wu, M.; Li, X.L.; Kwoh, C.K. Drug-Target Interaction Prediction Using Ensemble Learning and Dimensionality Reduction. Methods 2017, 129, 81–88. [Google Scholar] [CrossRef]
Lee, I.; Shin, Y.J. Machine Learning for Enterprises: Applications, Algorithm Selection, and Challenges. Bus. Horiz. 2020, 63, 157–170. [Google Scholar] [CrossRef]
Benhamou, P.Y.; Franc, S.; Reznik, Y.; Thivolet, C.; Schaepelynck, P.; Renard, E.; Guerci, B.; Chaillous, L.; Lukas-Croisier, C.; Jeandidier, N.; et al. Closed-Loop Insulin Delivery in Adults with Type 1 Diabetes in Real-Life Conditions: A 12-Week Multicentre, Open-Label Randomised Controlled Crossover Trial. Lancet Digit. Health 2019, 1, e17–e25. [Google Scholar] [CrossRef] [PubMed]
Popova, M.; Isayev, O.; Tropsha, A. Deep Reinforcement Learning for de Novo Drug Design. Sci. Adv. 2018, 4, eaap7885. [Google Scholar] [CrossRef] [PubMed]
Gaweda, A.E.; Muezzinoglu, M.K.; Aronoff, G.R.; Jacobs, A.A.; Zurada, J.M.; Brier, M.E. Individualization of Pharmacological Anemia Management Using Reinforcement Learning. Neural Netw. 2005, 18, 826–834. [Google Scholar] [CrossRef]
Turki, T.; Taguchi, Y.H. Machine Learning Algorithms for Predicting Drugs–Tissues Relationships. Expert Syst. Appl. 2019, 127, 167–186. [Google Scholar] [CrossRef]
Holzinger, A. Interactive Machine Learning for Health Informatics: When Do We Need the Human-in-the-Loop? Brain Inform. 2016, 3, 119–131. [Google Scholar] [CrossRef]
Miller, D.D.; Brown, E.W. Artificial Intelligence in Medical Practice: The Question to the Answer? Am. J. Med. 2018, 131, 129–133. [Google Scholar] [CrossRef]
Jaber, M.M.; Alameri, T.; Ali, M.H.; Alsyouf, A.; Al-Bsheish, M.; Aldhmadi, B.K.; Ali, S.Y.; Abd, S.K.; Ali, S.M.; Albaker, W.; et al. Remotely Monitoring COVID-19 Patient Health Condition Using Metaheuristics Convolute Networks from IoT-Based Wearable Device Health Data. Sensors 2022, 22, 1205. [Google Scholar] [CrossRef]
Wani, S.U.D.; Khan, N.A.; Thakur, G.; Gautam, S.P.; Ali, M.; Alam, P.; Alshehri, S.; Ghoneim, M.M.; Shakeel, F. Utilization of Artificial Intelligence in Disease Prevention: Diagnosis, Treatment, and Implications for the Healthcare Workforce. Healthcare 2022, 10, 608. [Google Scholar] [CrossRef]
Krishna Bharadwaj, H.; Agarwal, A.; Chamola, V.; Lakkaniga, R.; Hassija, V.; Guizani, M.; Sikdar, B. A Review on the Role of Machine Learning in Enabling IoT Based Healthcare Applications. IEEE Access 2021, 9, 38859–38890. [Google Scholar] [CrossRef]
Dugdale, J.; Moghaddam, M.T.; Alnaim, A.K.; Alwakeel, A.M. Machine-Learning-Based IoT–Edge Computing Healthcare Solutions. Electronics 2023, 12, 1027. [Google Scholar] [CrossRef]
Goecks, J.; Jalili, V.; Heiser, L.M.; Gray, J.W. How Machine Learning Will Transform Biomedicine. Cell 2020, 181, 92. [Google Scholar] [CrossRef] [PubMed]
Alber, M.; Buganza Tepole, A.; Cannon, W.R.; De, S.; Dura-Bernal, S.; Garikipati, K.; Karniadakis, G.; Lytton, W.W.; Perdikaris, P.; Petzold, L.; et al. Integrating Machine Learning and Multiscale Modeling—Perspectives, Challenges, and Opportunities in the Biological, Biomedical, and Behavioral Sciences. NPJ Digit. Med. 2019, 2, 115. [Google Scholar] [CrossRef] [PubMed]
Bote-Curiel, L.; Muñoz-Romero, S.; Gerrero-Curieses, A.; Rojo-Álvarez, J.L. Deep Learning and Big Data in Healthcare: A Double Review for Critical Beginners. Appl. Sci. 2019, 9, 2331. [Google Scholar] [CrossRef]
Abdulmalek, S.; Nasir, A.; Jabbar, W.A.; Almuhaya, M.A.M.; Bairagi, A.K.; Khan, M.A.; Kee, A.-M.; Chen, T.; Abdulmalek, S.; Nasir, A.; et al. IoT-Based Healthcare-Monitoring System towards Improving Quality of Life: A Review. Healthcare 2022, 10, 1993. [Google Scholar] [CrossRef]
Gupta, R.; Srivastava, D.; Sahu, M.; Tiwari, S.; Ambasta, R.K.; Kumar, P. Artificial Intelligence to Deep Learning: Machine Intelligence Approach for Drug Discovery. Mol. Divers. 2021, 25, 1315. [Google Scholar] [CrossRef] [PubMed]
Lavecchia, A. Machine-Learning Approaches in Drug Discovery: Methods and Applications. Drug Discov. Today 2015, 20, 318–331. [Google Scholar] [CrossRef]
Cassidy, J.W.; Cassidy, J.W. Applications of Machine Learning in Drug Discovery I: Target Discovery and Small Molecule Drug Design. In Artificial Intelligence in Oncology Drug Discovery and Development; IntechOpen: London, UK, 2020. [Google Scholar] [CrossRef]
Zhong, F.; Xing, J.; Li, X.; Liu, X.; Fu, Z.; Xiong, Z.; Lu, D.; Wu, X.; Zhao, J.; Tan, X.; et al. Artificial Intelligence in Drug Design. Sci. China Life Sci. 2018, 61, 1191–1204. [Google Scholar] [CrossRef]
Riniker, S.; Wang, Y.; Jenkins, J.L.; Landrum, G.A. Using Information from Historical High-Throughput Screens to Predict Active Compounds. J. Chem. Inf. Model. 2014, 54, 1880–1891. [Google Scholar] [CrossRef]
Jeon, J.; Nim, S.; Teyra, J.; Datti, A.; Wrana, J.L.; Sidhu, S.S.; Moffat, J.; Kim, P.M. A Systematic Approach to Identify Novel Cancer Drug Targets Using Machine Learning, Inhibitor Design and High-Throughput Screening. Genome Med. 2014, 6, 57. [Google Scholar] [CrossRef] [PubMed]
Mamoshina, P.; Volosnikova, M.; Ozerov, I.V.; Putin, E.; Skibina, E.; Cortese, F.; Zhavoronkov, A. Machine Learning on Human Muscle Transcriptomic Data for Biomarker Discovery and Tissue-Specific Drug Target Identification. Front. Genet. 2018, 9, 378508. [Google Scholar] [CrossRef] [PubMed]
Ferrero, E.; Dunham, I.; Sanseau, P. In Silico Prediction of Novel Therapeutic Targets Using Gene-Disease Association Data. J. Transl. Med. 2017, 15, 182. [Google Scholar] [CrossRef] [PubMed]
Godinez, W.J.; Hossain, I.; Lazic, S.E.; Davies, J.W.; Zhang, X. A Multi-Scale Convolutional Neural Network for Phenotyping High-Content Cellular Images. Bioinformatics 2017, 33, 2010–2019. [Google Scholar] [CrossRef] [PubMed]
Olsen, T.; Jackson, B.; Feeser, T.; Kent, M.; Moad, J.; Krishnamurthy, S.; Lunsford, D.; Soans, R. Diagnostic Performance of Deep Learning Algorithms Applied to Three Common Diagnoses in Dermatopathology. J. Pathol. Inform. 2018, 9, 32. [Google Scholar] [CrossRef] [PubMed]
Lagarde, N.; Goldwaser, E.; Pencheva, T.; Jereva, D.; Pajeva, I.; Rey, J.; Tuffery, P.; Villoutreix, B.O.; Miteva, M.A. A Free Web-Based Protocol to Assist Structure-Based Virtual Screening Experiments. Int. J. Mol. Sci. 2019, 20, 4648. [Google Scholar] [CrossRef]
Ha, E.J.; Lwin, C.T.; Durrant, J.D. LigGrep: A Tool for Filtering Docked Poses to Improve Virtual-Screening Hit Rates. J. Cheminform. 2020, 12, 69. [Google Scholar] [CrossRef]
Hu, J.; Liu, Z.; Yu, D.J.; Zhang, Y. LS-Align: An Atom-Level, Flexible Ligand Structural Alignment Algorithm for High-Throughput Virtual Screening. Bioinformatics 2018, 34, 2209. [Google Scholar] [CrossRef]
Seifert, M.H.J. ProPose: Steered Virtual Screening by Simultaneous Protein-Ligand Docking and Ligand-Ligand Alignment. J. Chem. Inf. Model. 2005, 45, 449–460. [Google Scholar] [CrossRef]
Gattani, S.; Mishra, A.; Hoque, M.T. StackCBPred: A Stacking Based Prediction of Protein-Carbohydrate Binding Sites from Sequence. Carbohydr. Res. 2019, 486, 107857. [Google Scholar] [CrossRef] [PubMed]
Schellhammer, I.; Rarey, M. TrixX: Structure-Based Molecule Indexing for Large-Scale Virtual Screening in Sublinear Time. J. Comput. Aided. Mol. Des. 2007, 21, 223–238. [Google Scholar] [CrossRef]
Beutels, P.; Jia, N.; Zhou, Q.Y.; Smith, R.; Cao, W.C.; De Vlas, S.J. The Economic Impact of SARS in Beijing, China. Trop. Med. Int. Health 2009, 14, 85–91. [Google Scholar] [CrossRef] [PubMed]
Castillo-Chavez, C.; Curtiss, R.; Daszak, P.; Levin, S.A.; Patterson-Lomba, O.; Perrings, C.; Poste, G.; Towers, S. Beyond Ebola: Lessons to Mitigate Future Pandemics. Lancet Glob. Health 2015, 3, e354–e355. [Google Scholar] [CrossRef]
Nokes, D.J.; Anderson, R.M. The Use of Mathematical Models in the Epidemiological Study of Infectious Diseases and in the Design of Mass Immunization Programmes. Epidemiol. Infect. 1988, 101, 1–20. [Google Scholar] [CrossRef] [PubMed]
Sharma, M.K.; Dhiman, N.; Vandana; Mishra, V.N. Mediative Fuzzy Logic Mathematical Model: A Contradictory Management Prediction in COVID-19 Pandemic. Appl. Soft Comput. 2021, 105, 107285. [Google Scholar] [CrossRef] [PubMed]
Heidari, A.; Jafari Navimipour, N.; Unal, M.; Toumaj, S. Machine Learning Applications for COVID-19 Outbreak Management. Neural Comput. Appl. 2022, 34, 15313–15348. [Google Scholar] [CrossRef]
Kavadi, D.P.; Patan, R.; Ramachandran, M.; Gandomi, A.H. Partial Derivative Nonlinear Global Pandemic Machine Learning Prediction of COVID 19. Chaos Solitons Fractals 2020, 139, 110056. [Google Scholar] [CrossRef]
Dairi, A.; Harrou, F.; Zeroual, A.; Hittawe, M.M.; Sun, Y. Comparative Study of Machine Learning Methods for COVID-19 Transmission Forecasting. J. Biomed. Inform. 2021, 118, 103791. [Google Scholar] [CrossRef]
Masum, M.; Masud, M.A.; Adnan, M.I.; Shahriar, H.; Kim, S. Comparative Study of a Mathematical Epidemic Model, Statistical Modeling, and Deep Learning for COVID-19 Forecasting and Management. Socioecon. Plann. Sci. 2022, 80, 101249. [Google Scholar] [CrossRef]
Small, M.; Tse, C.K.; Walker, D.M. Super-Spreaders and the Rate of Transmission of the SARS Virus. Phys. D 2006, 215, 146. [Google Scholar] [CrossRef]
Small, M.; Tse, C.K. Clustering Model for Transmission of the SARS Virus: Application to Epidemic Control and Risk Assessment. Phys. A Stat. Mech. Its Appl. 2005, 351, 499–511. [Google Scholar] [CrossRef]
Chakrabarti, D.; Wang, Y.; Wang, C.; Leskovec, J.; Faloutsos, C. Epidemic Thresholds in Real Networks. ACM Trans. Inf. Syst. Secur. 2008, 10, 1. [Google Scholar] [CrossRef]
Hassan, J.; Haigh, C.; Ahmed, T.; Uddin, M.J.; Das, D.B. Potential of Microneedle Systems for COVID-19 Vaccination: Current Trends and Challenges. Pharmaceutics 2022, 14, 1066. [Google Scholar] [CrossRef] [PubMed]
Mohamadou, Y.; Halidou, A.; Kapen, P.T. A Review of Mathematical Modeling, Artificial Intelligence and Datasets Used in the Study, Prediction and Management of COVID-19. Appl. Intell. 2020, 50, 3913–3925. [Google Scholar] [CrossRef] [PubMed]
Gao, W.; Baskonus, H.M.; Shi, L. New Investigation of Bats-Hosts-Reservoir-People Coronavirus Model and Application to 2019-NCoV System. Adv. Differ. Equ. 2020, 2020, 391. [Google Scholar] [CrossRef] [PubMed]
Nazir, G.; Zeb, A.; Shah, K.; Saeed, T.; Khan, R.A.; Ullah Khan, S.I. Study of COVID-19 Mathematical Model of Fractional Order via Modified Euler Method. Alex. Eng. J. 2021, 60, 5287–5296. [Google Scholar] [CrossRef]
Chen, T.M.; Rui, J.; Wang, Q.P.; Zhao, Z.Y.; Cui, J.A.; Yin, L. A Mathematical Model for Simulating the Phase-Based Transmissibility of a Novel Coronavirus. Infect. Dis. Poverty 2020, 9, 24. [Google Scholar] [CrossRef] [PubMed]
Shaikh, A.S.; Shaikh, I.N.; Nisar, K.S. A Mathematical Model of COVID-19 Using Fractional Derivative: Outbreak in India with Dynamics of Transmission and Control. Adv. Differ. Equ. 2020, 2020, 373. [Google Scholar] [CrossRef] [PubMed]
Abdulwasaa, M.A.; Abdo, M.S.; Shah, K.; Nofal, T.A.; Panchal, S.K.; Kawale, S.V.; Abdel-Aty, A.H. Fractal-Fractional Mathematical Modeling and Forecasting of New Cases and Deaths of COVID-19 Epidemic Outbreaks in India. Results Phys. 2021, 20, 103702. [Google Scholar] [CrossRef] [PubMed]
Yaseen, A.S.A. Impact of Social Determinants on COVID-19 Infections: A Comprehensive Study from Saudi Arabia Governorates. Humanit. Soc. Sci. Commun. 2022, 9, 355. [Google Scholar] [CrossRef] [PubMed]
Yeh, S.S. Tourism Recovery Strategy against COVID-19 Pandemic. Tour. Recreat. Res. 2020, 46, 188–194. [Google Scholar] [CrossRef]
Wan, H.; Cui, J.A.; Yang, G.J. Risk Estimation and Prediction of the Transmission of Coronavirus Disease-2019 (COVID-19) in the Mainland of China Excluding Hubei Province. Infect. Dis. Poverty 2020, 9, 116. [Google Scholar] [CrossRef] [PubMed]
Mbuvha, R.; Marwala, T. Bayesian Inference of COVID-19 Spreading Rates in South Africa. PLoS ONE 2020, 15, e0237126. [Google Scholar] [CrossRef]
Poonvoralak, W. Bayesian Markov Chain Monte Carlo for Reparameterized Stochastic Volatility Models Using Asian FX Rates during COVID-19. J. Appl. Stat. 2022, 50, 1853–1875. [Google Scholar] [CrossRef]
Zhicheng, D.; Yuantao, H.; Yongyue, W.; Zhijie, Z.; Sipeng, S.; Yang, Z.; Jinling, T.; Feng, C.; Qingwu, J.; Liming, L. Using Markov Chain Monte Carlo Methods to Estimate the Age-Specific Case Fatality Rate of COVID-19. Chin. J. Epidemiol. 2020, 41, 1777–1781. [Google Scholar] [CrossRef]
Adiga, A.; Dubhashi, D.; Lewis, B.; Marathe, M.; Venkatramanan, S.; Vullikanti, A. Mathematical Models for COVID-19 Pandemic: A Comparative Analysis. J. Indian Inst. Sci. 2020, 100, 793–807. [Google Scholar] [CrossRef]
Humphries, R.; Spillane, M.; Mulchrone, K.; Wieczorek, S.; O’Riordain, M.; Hövel, P. A Metapopulation Network Model for the Spreading of SARS-CoV-2: Case Study for Ireland. Infect. Dis. Model. 2021, 6, 420. [Google Scholar] [CrossRef]
Zhao, Z.Y.; Zhu, Y.Z.; Xu, J.W.; Hu, S.X.; Hu, Q.Q.; Lei, Z.; Rui, J.; Liu, X.C.; Wang, Y.; Yang, M.; et al. A Five-Compartment Model of Age-Specific Transmissibility of SARS-CoV-2. Infect. Dis. Poverty 2020, 9, 35–49. [Google Scholar] [CrossRef]
Chen, M.; Li, M.; Hao, Y.; Liu, Z.; Hu, L.; Wang, L. The Introduction of Population Migration to SEIAR for COVID-19 Epidemic Modeling with an Efficient Intervention Strategy. Inf. Fusion 2020, 64, 252–258. [Google Scholar] [CrossRef] [PubMed]
Youssef, H.; Alghamdi, N.; Ezzat, M.A.; El-Bary, A.A.; Shawky, A.M. Study on the SEIQR Model and Applying the Epidemiological Rates of COVID-19 Epidemic Spread in Saudi Arabia. Infect. Dis. Model. 2021, 6, 678–692. [Google Scholar] [CrossRef] [PubMed]
Vyasarayani, C.P.; Chatterjee, A. New Approximations, and Policy Implications, from a Delayed Dynamic Model of a Fast Pandemic. Phys. D Nonlinear Phenom. 2020, 414, 132701. [Google Scholar] [CrossRef]
He, S.; Peng, Y.; Sun, K. SEIR Modeling of the COVID-19 and Its Dynamics. Nonlinear Dyn. 2020, 101, 1667–1680. [Google Scholar] [CrossRef] [PubMed]
Annas, S.; Isbar Pratama, M.; Rifandi, M.; Sanusi, W.; Side, S. Stability Analysis and Numerical Simulation of SEIR Model for Pandemic COVID-19 Spread in Indonesia. Chaos Solitons Fractals 2020, 139, 110072. [Google Scholar] [CrossRef] [PubMed]
Mwalili, S.; Kimathi, M.; Ojiambo, V.; Gathungu, D.; Mbogo, R. SEIR Model for COVID-19 Dynamics Incorporating the Environment and Social Distancing. BMC Res. Notes 2020, 13, 352. [Google Scholar] [CrossRef]
Reiner, R.C.; Barber, R.M.; Collins, J.K.; Zheng, P.; Adolph, C.; Albright, J.; Antony, C.M.; Aravkin, A.Y.; Bachmeier, S.D.; Bang-Jensen, B.; et al. Modeling COVID-19 Scenarios for the United States. Nat. Med. 2020, 27, 94–105. [Google Scholar] [CrossRef]
Haque, S.E.; Rahman, M. Association between Temperature, Humidity, and COVID-19 Outbreaks in Bangladesh. Environ. Sci. Policy 2020, 114, 253–255. [Google Scholar] [CrossRef]
Prem, K.; Liu, Y.; Russell, T.W.; Kucharski, A.J.; Eggo, R.M.; Davies, N.; Flasche, S.; Clifford, S.; Pearson, C.A.B.; Munday, J.D.; et al. The Effect of Control Strategies to Reduce Social Mixing on Outcomes of the COVID-19 Epidemic in Wuhan, China: A Modelling Study. Lancet Public Health 2020, 5, e261–e270. [Google Scholar] [CrossRef]
Cooper, I.; Mondal, A.; Antonopoulos, C.G. A SIR Model Assumption for the Spread of COVID-19 in Different Communities. Chaos Solitons Fractals 2020, 139, 110057. [Google Scholar] [CrossRef]
Kudryashov, N.A.; Chmykhov, M.A.; Vigdorowitsch, M. Analytical Features of the SIR Model and Their Applications to COVID-19. Appl. Math. Model. 2021, 90, 466–473. [Google Scholar] [CrossRef] [PubMed]
Odagaki, T. Exact Properties of SIQR Model for COVID-19. Phys. A Stat. Mech. Its Appl. 2021, 564, 125564. [Google Scholar] [CrossRef] [PubMed]
Odagaki, T. Analysis of the Outbreak of COVID-19 in Japan by SIQR Model. Infect. Dis. Model. 2020, 5, 691–698. [Google Scholar] [CrossRef] [PubMed]
Apostolopoulos, I.D.; Mpesiana, T.A. COVID-19: Automatic Detection from X-Ray Images Utilizing Transfer Learning with Convolutional Neural Networks. Phys. Eng. Sci. Med. 2020, 43, 635–640. [Google Scholar] [CrossRef]
Shi, F.; Xia, L.; Shan, F.; Song, B.; Wu, D.; Wei, Y.; Yuan, H.; Jiang, H.; He, Y.; Gao, Y.; et al. Large-Scale Screening to Distinguish between COVID-19 and Community-Acquired Pneumonia Using Infection Size-Aware Classification. Phys. Med. Biol. 2021, 66, 065031. [Google Scholar] [CrossRef] [PubMed]
Li, K.; Wu, J.; Wu, F.; Guo, D.; Chen, L.; Fang, Z.; Li, C. The Clinical and Chest CT Features Associated With Severe and Critical COVID-19 Pneumonia. Investig. Radiol. 2020, 55, 327–331. [Google Scholar] [CrossRef]
Wang, C.; Deng, R.; Gou, L.; Fu, Z.; Zhang, X.; Shao, F.; Wang, G.; Fu, W.; Xiao, J.; Ding, X.; et al. Preliminary Study to Identify Severe from Moderate Cases of COVID-19 Using Combined Hematology Parameters. Ann. Transl. Med. 2020, 8, 593. [Google Scholar] [CrossRef]
Narin, A.; Kaya, C.; Pamuk, Z. Automatic Detection of Coronavirus Disease (COVID-19) Using X-Ray Images and Deep Convolutional Neural Networks. Pattern Anal. Appl. 2020, 24, 1207–1220. [Google Scholar] [CrossRef]
Li, L.; Qin, L.; Xu, Z.; Yin, Y.; Wang, X.; Kong, B.; Bai, J.; Lu, Y.; Fang, Z.; Song, Q.; et al. Artificial Intelligence Distinguishes COVID-19 from Community Acquired Pneumonia on Chest CT. Radiology 2020, 296, E65–E71. [Google Scholar] [CrossRef]
Wang, L.; Lin, Z.Q.; Wong, A. COVID-Net: A Tailored Deep Convolutional Neural Network Design for Detection of COVID-19 Cases from Chest X-Ray Images. Sci. Rep. 2020, 10, 19549. [Google Scholar] [CrossRef]
Libbrecht, M.W.; Noble, W.S. Machine Learning Applications in Genetics and Genomics. Nat. Rev. Genet. 2015, 16, 321–332. [Google Scholar] [CrossRef] [PubMed]
Telenti, A.; Lippert, C.; Chang, P.C.; DePristo, M. Deep Learning of Genomic Variation and Regulatory Network Data. Hum. Mol. Genet. 2018, 27, R63–R71. [Google Scholar] [CrossRef] [PubMed]
De Souza, N. The ENCODE Project. Nat. Methods 2012, 9, 1046. [Google Scholar] [CrossRef]
Navarro, F.C.P.; Mohsen, H.; Yan, C.; Li, S.; Gu, M.; Meyerson, W.; Gerstein, M. Genomics and Data Science: An Application within an Umbrella. Genome Biol. 2019, 20, 109. [Google Scholar] [CrossRef] [PubMed]
Porter, C.J.; Palidwor, G.A.; Sandie, R.; Krzyzanowski, P.M.; Muro, E.M.; Perez-Iratxeta, C.; Andrade-Navarro, M.A. StemBase: A Resource for the Analysis of Stem Cell Gene Expression Data. Methods Mol. Biol. 2007, 407, 137–148. [Google Scholar] [CrossRef] [PubMed]
Som, A.; Harder, C.; Greber, B.; Siatkowski, M.; Paudel, Y.; Warsow, G.; Cap, C.; ler, H.S.; Fuellen, G. The PluriNetWork: An Electronic Representation of the Network Underlying Pluripotency in Mouse, and Its Applications. PLoS ONE 2010, 5, e15165. [Google Scholar] [CrossRef] [PubMed]
Schulz, H.; Kolde, R.; Adler, P.; Aksoy, I.; Anastassiadis, K.; Bader, M.; Billon, N.; Boeuf, H.; Bourillot, P.Y.; Buchholz, F.; et al. The FunGenES Database: A Genomics Resource for Mouse Embryonic Stem Cell Differentiation. PLoS ONE 2009, 4, 6804. [Google Scholar] [CrossRef] [PubMed]
Müller, F.J.; Laurent, L.C.; Kostka, D.; Ulitsky, I.; Williams, R.; Lu, C.; Park, I.H.; Rao, M.S.; Shamir, R.; Schwartz, P.H.; et al. Regulatory Networks Define Phenotypic Classes of Human Stem Cell Lines. Nature 2008, 455, 401. [Google Scholar] [CrossRef]
Xu, H.; Schaniel, C.; Lemischka, I.R.; Ma’ayan, A. Toward a Complete in Silico, Multi-Layered Embryonic Stem Cell Regulatory Network. Wiley Interdiscip. Rev. Syst. Biol. Med. 2010, 2, 708. [Google Scholar] [CrossRef]
Yu, J.; Xing, X.; Zeng, L.; Sun, J.; Li, W.; Sun, H.; He, Y.; Li, J.; Zhang, G.; Wang, C.; et al. SyStemCell: A Database Populated with Multiple Levels of Experimental Data from Stem Cell Differentiation Research. PLoS ONE 2012, 7, e35230. [Google Scholar] [CrossRef]
Xu, H.; Baroukh, C.; Dannenfelser, R.; Chen, E.Y.; Tan, C.M.; Kou, Y.; Kim, Y.E.; Lemischka, I.R.; Ma’ayan, A. ESCAPE: Database for Integrating High-Content Published Data Collected from Human and Mouse Embryonic Stem Cells. Database J. Biol. Databases Curation 2013, 2013, bat045. [Google Scholar] [CrossRef]
Dogan, M.V.; Grumbach, I.M.; Michaelson, J.J.; Philibert, R.A. Integrated Genetic and Epigenetic Prediction of Coronary Heart Disease in the Framingham Heart Study. PLoS ONE 2018, 13, e0190549. [Google Scholar] [CrossRef]
Capper, D.; Jones, D.T.W.; Sill, M.; Hovestadt, V.; Schrimpf, D.; Sturm, D.; Koelsche, C.; Sahm, F.; Chavez, L.; Reuss, D.E.; et al. DNA Methylation-Based Classification of Central Nervous System Tumours. Nature 2018, 555, 469–474. [Google Scholar] [CrossRef]
Aref-Eshghi, E.; Rodenhiser, D.I.; Schenkel, L.C.; Lin, H.; Skinner, C.; Ainsworth, P.; Paré, G.; Hood, R.L.; Bulman, D.E.; Kernohan, K.D.; et al. Genomic DNA Methylation Signatures Enable Concurrent Diagnosis and Clinical Genetic Variant Classification in Neurodevelopmental Syndromes. Am. J. Hum. Genet. 2018, 102, 156–174. [Google Scholar] [CrossRef]
How Kit, A.; Nielsen, H.M.; Tost, J. DNA Methylation Based Biomarkers: Practical Considerations and Applications. Biochimie 2012, 94, 2314–2337. [Google Scholar] [CrossRef]
Aryee, M.J.; Jaffe, A.E.; Corrada-Bravo, H.; Ladd-Acosta, C.; Feinberg, A.P.; Hansen, K.D.; Irizarry, R.A. Minfi: A Flexible and Comprehensive Bioconductor Package for the Analysis of Infinium DNA Methylation Microarrays. Bioinformatics 2014, 30, 1363–1369. [Google Scholar] [CrossRef] [PubMed]
Silva, T.C.; Colaprico, A.; Olsen, C.; D’Angelo, F.; Bontempi, G.; Ceccarelli, M.; Noushmehr, H. TCGA Workflow: Analyze Cancer Genomics and Epigenomics Data Using Bioconductor Packages. F1000Research 2016, 5, 1542. [Google Scholar] [CrossRef] [PubMed]
Jaffe, A.E.; Murakami, P.; Lee, H.; Leek, J.T.; Fallin, M.D.; Feinberg, A.P.; Irizarry, R.A. Bump Hunting to Identify Differentially Methylated Regions in Epigenetic Epidemiology Studies. Int. J. Epidemiol. 2012, 41, 200–209. [Google Scholar] [CrossRef] [PubMed]
Leung, M.K.K.; Delong, A.; Alipanahi, B.; Frey, B.J. Machine Learning in Genomic Medicine: A Review of Computational Problems and Data Sets. Proc. IEEE 2016, 104, 176–197. [Google Scholar] [CrossRef]
Sina, A.A.I.; Carrascosa, L.G.; Liang, Z.; Grewal, Y.S.; Wardiana, A.; Shiddiky, M.J.A.; Gardiner, R.A.; Samaratunga, H.; Gandhi, M.K.; Scott, R.J.; et al. Epigenetically Reprogrammed Methylation Landscape Drives the DNA Self-Assembly and Serves as a Universal Cancer Biomarker. Nat. Commun. 2018, 9, 4915. [Google Scholar] [CrossRef] [PubMed]
Hewitt, A.W.; Januar, V.; Sexton-Oates, A.; Joo, J.E.; Franchina, M.; Wang, J.J.; Liang, H.; Craig, J.E.; Saffery, R. DNA Methylation Landscape of Ocular Tissue Relative to Matched Peripheral Blood. Sci. Rep. 2017, 7, srep46330. [Google Scholar] [CrossRef]
Huang, Y.T.; Chu, S.; Loucks, E.B.; Lin, C.L.; Eaton, C.B.; Buka, S.L.; Kelsey, K.T. Epigenome-Wide Profiling of DNA Methylation in Paired Samples of Adipose Tissue and Blood. Epigenetics 2016, 11, 227–236. [Google Scholar] [CrossRef] [PubMed]
Senior, A.W.; Evans, R.; Jumper, J.; Kirkpatrick, J.; Sifre, L.; Green, T.; Qin, C.; Žídek, A.; Nelson, A.W.R.; Bridgland, A.; et al. Improved Protein Structure Prediction Using Potentials from Deep Learning. Nature 2020, 577, 706–710. [Google Scholar] [CrossRef] [PubMed]
Pandarinath, C.; O’Shea, D.J.; Collins, J.; Jozefowicz, R.; Stavisky, S.D.; Kao, J.C.; Trautmann, E.M.; Kaufman, M.T.; Ryu, S.I.; Hochberg, L.R.; et al. Inferring Single-Trial Neural Population Dynamics Using Sequential Auto-Encoders. Nat. Methods 2018, 15, 805–815. [Google Scholar] [CrossRef] [PubMed]
Antczak, M.; Michaelis, M.; Wass, M.N. Environmental Conditions Shape the Nature of a Minimal Bacterial Genome. Nat. Commun. 2019, 10, 3100. [Google Scholar] [CrossRef]
Kelley, D.R.; Snoek, J.; Rinn, J.L. Basset: Learning the Regulatory Code of the Accessible Genome with Deep Convolutional Neural Networks. Genome Res. 2016, 26, 990–999. [Google Scholar] [CrossRef]
Fudenberg, G.; Kelley, D.R.; Pollard, K.S. Predicting 3D Genome Folding from DNA Sequence with Akita. Nat. Methods 2020, 17, 1111–1117. [Google Scholar] [CrossRef]
Zeng, W.; Wu, M.; Jiang, R. Prediction of Enhancer-Promoter Interactions via Natural Language Processing. BMC Genom. 2018, 19, 13–22. [Google Scholar] [CrossRef]
Pires, D.E.V.; Ascher, D.B.; Blundell, T.L. DUET: A Server for Predicting Effects of Mutations on Protein Stability Using an Integrated Computational Approach. Nucleic Acids Res. 2014, 42, W314–W319. [Google Scholar] [CrossRef]
Yuan, Y.; Bar-Joseph, Z. Deep Learning for Inferring Gene Relationships from Single-Cell Expression Data. Proc. Natl. Acad. Sci. USA 2019, 116, 27151–27158. [Google Scholar] [CrossRef]
Zitnik, M.; Agrawal, M.; Leskovec, J. Modeling Polypharmacy Side Effects with Graph Convolutional Networks. Bioinformatics 2018, 34, i457–i466. [Google Scholar] [CrossRef] [PubMed]
Das, P.; Sercu, T.; Wadhawan, K.; Padhi, I.; Gehrmann, S.; Cipcigan, F.; Chenthamarakshan, V.; Strobelt, H.; dos Santos, C.; Chen, P.Y.; et al. Accelerated Antimicrobial Discovery via Deep Generative Models and Molecular Dynamics Simulations. Nat. Biomed. Eng. 2021, 5, 613–623. [Google Scholar] [CrossRef]
Yang, K.K.; Wu, Z.; Arnold, F.H. Machine-Learning-Guided Directed Evolution for Protein Engineering. Nat. Methods 2019, 16, 687–694. [Google Scholar] [CrossRef] [PubMed]
Dou, J.; Doyle, L.; Greisen, P.; Schena, A.; Park, H.; Johnsson, K.; Stoddard, B.L.; Baker, D. Sampling and Energy Evaluation Challenges in Ligand Binding Protein Design. Protein Sci. 2017, 26, 2426–2437. [Google Scholar] [CrossRef]
Garcia-Borràs, M.; Houk, K.N.; Jiménez-Osés, G. Computational Design of Protein Function. Comput. Tools Chem. Biol. 2017, 3, 87–107. [Google Scholar] [CrossRef]
Luo, Y.; Jiang, G.; Yu, T.; Liu, Y.; Vo, L.; Ding, H.; Su, Y.; Qian, W.W.; Zhao, H.; Peng, J. ECNet Is an Evolutionary Context-Integrated Deep Learning Framework for Protein Engineering. Nat. Commun. 2021, 12, 5743. [Google Scholar] [CrossRef] [PubMed]
Fox, R.J.; Davis, S.C.; Mundorff, E.C.; Newman, L.M.; Gavrilovic, V.; Ma, S.K.; Chung, L.M.; Ching, C.; Tam, S.; Muley, S.; et al. Improving Catalytic Function by ProSAR-Driven Enzyme Evolution. Nat. Biotechnol. 2007, 25, 338–344. [Google Scholar] [CrossRef]
Musdal, Y.; Govindarajan, S.; Mannervik, B. Exploring Sequence-Function Space of a Poplar Glutathione Transferase Using Designed Information-Rich Gene Variants. Protein Eng. Des. Sel. 2017, 30, 543–549. [Google Scholar] [CrossRef]
Bedbrook, C.N.; Yang, K.K.; Rice, A.J.; Gradinaru, V.; Arnold, F.H. Machine Learning to Design Integral Membrane Channelrhodopsins for Efficient Eukaryotic Expression and Plasma Membrane Localization. PLoS Comput. Biol. 2017, 13, e1005786. [Google Scholar] [CrossRef]
Freschlin, C.R.; Fahlberg, S.A.; Romero, P.A. Machine Learning to Navigate Fitness Landscapes for Protein Engineering. Curr. Opin. Biotechnol. 2022, 75, 102713. [Google Scholar] [CrossRef]
Li, G.; Dong, Y.; Reetz, M.T. Can Machine Learning Revolutionize Directed Evolution of Selective Enzymes? Adv. Synth. Catal. 2019, 361, 2377–2386. [Google Scholar] [CrossRef]
Dyba, T.; Randi, G.; Bray, F.; Martos, C.; Giusti, F.; Nicholson, N.; Gavin, A.; Flego, M.; Neamtiu, L.; Dimitrova, N.; et al. The European Cancer Burden in 2020: Incidence and Mortality Estimates for 40 Countries and 25 Major Cancers. Eur. J. Cancer 2021, 157, 308. [Google Scholar] [CrossRef]
Collins, L.G.; Haines, C.; Perkel, R.; Enck, R.E. Lung Cancer: Diagnosis and Management. Am. Fam. Physician 2007, 75, 56–63. [Google Scholar] [PubMed]
Thawani, R.; McLane, M.; Beig, N.; Ghose, S.; Prasanna, P.; Velcheti, V.; Madabhushi, A. Radiomics and Radiogenomics in Lung Cancer: A Review for the Clinician. Lung Cancer 2018, 115, 34–41. [Google Scholar] [CrossRef] [PubMed]
Bianconi, F.; Fravolini, M.L.; Palumbo, I.; Palumbo, B. Shape and Texture Analysis of Radiomic Data for Computer-Assisted Diagnosis and Prognostication: An Overview. In Design Tools and Methods in Industrial Engineering: Proceedings of the International Conference on Design Tools and Methods in Industrial Engineering (ADM 2019), Modena, Italy, 9–10 September 2019; Lecture Notes in Mechanical Engineering; Springer: Berlin/Heidelberg, Germany, 2020; pp. 3–14. [Google Scholar] [CrossRef]
Chalkidou, A.; O’Doherty, M.J.; Marsden, P.K. False Discovery Rates in PET and CT Studies with Texture Features: A Systematic Review. PLoS ONE 2015, 10, e0124165. [Google Scholar] [CrossRef] [PubMed]
Buvat, I.; Orlhac, F. The Dark Side of Radiomics: On the Paramount Importance of Publishing Negative Results. J. Nucl. Med. 2019, 60, 1543–1544. [Google Scholar] [CrossRef] [PubMed]
Joober, R.; Schmitz, N.; Annable, L.; Boksa, P. Publication Bias: What Are the Challenges and Can They Be Overcome? J. Psychiatry Neurosci. 2012, 37, 149–152. [Google Scholar] [CrossRef] [PubMed]
Sobue, T.; Moriyama, N.; Kaneko, M.; Kusumoto, M.; Kobayashi, T.; Tsuchiya, R.; Kakinuma, R.; Ohmatsu, H.; Nagai, K.; Nishiyama, H.; et al. Screening for Lung Cancer With Low-Dose Helical Computed Tomography: Anti-Lung Cancer Association Project. J. Clin. Oncol. 2016, 20, 911–920. [Google Scholar] [CrossRef] [PubMed]
Toyoda, Y.; Nakayama, T.; Kusunoki, Y.; Iso, H.; Suzuki, T. Sensitivity and Specificity of Lung Cancer Screening Using Chest Low-Dose Computed Tomography. Br. J. Cancer 2008, 98, 1602–1607. [Google Scholar] [CrossRef] [PubMed]
Nooreldeen, R.; Bach, H. Current and Future Development in Lung Cancer Diagnosis. Int. J. Mol. Sci. 2021, 22, 8661. [Google Scholar] [CrossRef]
Van Riel, S.J.; Ciompi, F.; Winkler Wille, M.M.; Dirksen, A.; Lam, S.; Scholten, E.T.; Rossi, S.E.; Sverzellati, N.; Naqibullah, M.; Wittenberg, R.; et al. Malignancy Risk Estimation of Pulmonary Nodules in Screening CTs: Comparison between a Computer Model and Human Observers. PLoS ONE 2017, 12, e0185032. [Google Scholar] [CrossRef] [PubMed]
Winkler Wille, M.M.; van Riel, S.J.; Saghir, Z.; Dirksen, A.; Pedersen, J.H.; Jacobs, C.; Thomsen, L.H.; Scholten, E.T.; Skovgaard, L.T.; van Ginneken, B. Predictive Accuracy of the PanCan Lung Cancer Risk Prediction Model -External Validation Based on CT from the Danish Lung Cancer Screening Trial. Eur. Radiol. 2015, 25, 3093–3099. [Google Scholar] [CrossRef]
Kriegsmann, M.; Casadonte, R.; Kriegsmann, J.; Dienemann, H.; Schirmacher, P.; Kobarg, J.H.; Schwamborn, K.; Stenzinger, A.; Warth, A.; Weichert, W. Reliable Entity Subtyping in Non-Small Cell Lung Cancer by Matrix-Assisted Laser Desorption/Ionization Imaging Mass Spectrometry on Formalin-Fixed Paraffinembedded Tissue Specimens. Mol. Cell. Proteom. 2016, 15, 3081–3089. [Google Scholar] [CrossRef]
Xie, Y.; Meng, W.Y.; Li, R.Z.; Wang, Y.W.; Qian, X.; Chan, C.; Yu, Z.F.; Fan, X.X.; Pan, H.D.; Xie, C.; et al. Early Lung Cancer Diagnostic Biomarker Discovery by Machine Learning Methods. Transl. Oncol. 2021, 14, 100907. [Google Scholar] [CrossRef] [PubMed]
Zeng, Z.; Mao, C.; Vo, A.; Li, X.; Nugent, J.O.; Khan, S.A.; Clare, S.E.; Luo, Y. Deep Learning for Cancer Type Classification and Driver Gene Identification. BMC Bioinform. 2021, 22, 491. [Google Scholar] [CrossRef] [PubMed]
Eraslan, G.; Avsec, Ž.; Gagneur, J.; Theis, F.J. Deep Learning: New Computational Modelling Techniques for Genomics. Nat. Rev. Genet. 2019, 20, 389–403. [Google Scholar] [CrossRef]
Jiao, W.; Atwal, G.; Polak, P.; Karlic, R.; Cuppen, E.; Al-Shahrour, F.; Atwal, G.; Bailey, P.J.; Biankin, A.V.; Boutros, P.C.; et al. A Deep Learning System Accurately Classifies Primary and Metastatic Cancers Using Passenger Mutation Patterns. Nat. Commun. 2020, 11, 728. [Google Scholar] [CrossRef]
Zhang, Y.; Li, Y.; Li, T.; Shen, X.; Zhu, T.; Tao, Y.; Li, X.; Wang, D.; Ma, Q.; Hu, Z.; et al. Genetic Load and Potential Mutational Meltdown in Cancer Cell Populations. Mol. Biol. Evol. 2019, 36, 541–552. [Google Scholar] [CrossRef]
Herath, S.; Sadeghi Rad, H.; Radfar, P.; Ladwa, R.; Warkiani, M.; O’Byrne, K.; Kulasinghe, A. The Role of Circulating Biomarkers in Lung Cancer. Front. Oncol. 2022, 11, 801269. [Google Scholar] [CrossRef]
Gould, M.K.; Huang, B.Z.; Tammemagi, M.C.; Kinar, Y.; Shiff, R. Machine Learning for Early Lung Cancer Identification Using Routine Clinical and Laboratory Data. Am. J. Respir. Crit. Care Med. 2021, 204, 445–453. [Google Scholar] [CrossRef]
Li, Y.; Wu, X.; Yang, P.; Jiang, G.; Luo, Y. Machine Learning for Lung Cancer Diagnosis, Treatment, and Prognosis. Genom. Proteom. Bioinform. 2022, 20, 850–866. [Google Scholar] [CrossRef]
Anil Kumar, C.; Harish, S.; Ravi, P.; Svn, M.; Kumar, B.P.P.; Mohanavel, V.; Alyami, N.M.; Priya, S.S.; Asfaw, A.K. Lung Cancer Prediction from Text Datasets Using Machine Learning. BioMed Res. Int. 2022, 2022, 6254177. [Google Scholar] [CrossRef]
Liang, W.; Zhao, Y.; Huang, W.; Gao, Y.; Xu, W.; Tao, J.; Yang, M.; Li, L.; Ping, W.; Shen, H.; et al. Non-Invasive Diagnosis of Early-Stage Lung Cancer Using High-Throughput Targeted DNA Methylation Sequencing of Circulating Tumor DNA (CtDNA). Theranostics 2019, 9, 2056–2070. [Google Scholar] [CrossRef]
Whitney, D.H.; Elashoff, M.R.; Porta-Smith, K.; Gower, A.C.; Vachani, A.; Ferguson, J.S.; Silvestri, G.A.; Brody, J.S.; Lenburg, M.E.; Spira, A. Derivation of a Bronchial Genomic Classifier for Lung Cancer in a Prospective Study of Patients Undergoing Diagnostic Bronchoscopy. BMC Med. Genom. 2015, 8, 18. [Google Scholar] [CrossRef]
Raman, L.; Van Der Linden, M.; Van Der Eecken, K.; Vermaelen, K.; Demedts, I.; Surmont, V.; Himpe, U.; Dedeurwaerdere, F.; Ferdinande, L.; Lievens, Y.; et al. Shallow Whole-Genome Sequencing of Plasma Cell-Free DNA Accurately Differentiates Small from Non-Small Cell Lung Carcinoma. Genome Med. 2020, 12, 35. [Google Scholar] [CrossRef] [PubMed]
Choi, Y.; Qu, J.; Wu, S.; Hao, Y.; Zhang, J.; Ning, J.; Yang, X.; Lofaro, L.; Pankratz, D.G.; Babiarz, J.; et al. Improving Lung Cancer Risk Stratification Leveraging Whole Transcriptome RNA Sequencing and Machine Learning across Multiple Cohorts. BMC Med. Genom. 2020, 13, 151. [Google Scholar] [CrossRef] [PubMed]
Vega, P.; Valentín, F.; Cubiella, J. Colorectal Cancer Diagnosis: Pitfalls and Opportunities. World J. Gastrointest. Oncol. 2015, 7, 422–433. [Google Scholar] [CrossRef] [PubMed]
Loey, M.; Jasim, M.W.; EL-Bakry, H.M.; Taha, M.H.N.; Khalifa, N.E.M. Breast and Colon Cancer Classification from Gene Expression Profiles Using Data Mining Techniques. Symmetry 2020, 12, 408. [Google Scholar] [CrossRef]
Zhang, W.; Chen, X.; Wong, K.C. Noninvasive Early Diagnosis of Intestinal Diseases Based on Artificial Intelligence in Genomics and Microbiome. J. Gastroenterol. Hepatol. 2021, 36, 823–831. [Google Scholar] [CrossRef] [PubMed]
Zhang, L.; Sanagapalli, S.; Stoita, A. Challenges in Diagnosis of Pancreatic Cancer. World J. Gastroenterol. 2018, 24, 2047–2060. [Google Scholar] [CrossRef] [PubMed]
Davatzikos, C.; Sotiras, A.; Fan, Y.; Habes, M.; Erus, G.; Rathore, S.; Bakas, S.; Chitalia, R.; Gastounioti, A.; Kontos, D. Precision Diagnostics Based on Machine Learning-Derived Imaging Signatures. Magn. Reson. Imaging 2019, 64, 49. [Google Scholar] [CrossRef]
Chen, W.; Chen, Q.; Parker, R.A.; Zhou, Y.; Lustigova, E.; Wu, B.U. Risk Prediction of Pancreatic Cancer in Patients With Abnormal Morphologic Findings Related to Chronic Pancreatitis: A Machine Learning Approach. Gastro Hep Adv. 2022, 1, 1014–1026. [Google Scholar] [CrossRef]
Chu, L.C.; Park, S.; Kawamoto, S.; Fouladi, D.F.; Shayesteh, S.; Zinreich, E.S.; Graves, J.S.; Horton, K.M.; Hruban, R.H.; Yuille, A.L.; et al. Utility of CT Radiomics Features in Differentiation of Pancreatic Ductal Adenocarcinoma from Normal Pancreatic Tissue. Am. J. Roentgenol. 2019, 213, 349–357. [Google Scholar] [CrossRef] [PubMed]
Schultz, N.A.; Dehlendorff, C.; Jensen, B.V.; Bjerregaard, J.K.; Nielsen, K.R.; Bojesen, S.E.; Calatayud, D.; Nielsen, S.E.; Yilmaz, M.; Holländer, N.H.; et al. MicroRNA Biomarkers in Whole Blood for Detection of Pancreatic Cancer. JAMA 2014, 311, 392–404. [Google Scholar] [CrossRef]
Booth, T.C.; Williams, M.; Luis, A.; Cardoso, J.; Ashkan, K.; Shuaib, H. Machine Learning and Glioma Imaging Biomarkers. Clin. Radiol. 2020, 75, 20–32. [Google Scholar] [CrossRef] [PubMed]
Kibriya, H.; Amin, R.; Alshehri, A.H.; Masood, M.; Alshamrani, S.S.; Alshehri, A. A Novel and Effective Brain Tumor Classification Model Using Deep Feature Fusion and Famous Machine Learning Classifiers. Comput. Intell. Neurosci. 2022, 2022, 7897669. [Google Scholar] [CrossRef]
Takahashi, S.; Takahashi, M.; Kinoshita, M.; Miyake, M.; Kawaguchi, R.; Shinojima, N.; Mukasa, A.; Saito, K.; Nagane, M.; Otani, R.; et al. Fine-Tuning Approach for Segmentation of Gliomas in Brain Magnetic Resonance Images with a Machine Learning Method to Normalize Image Differences among Facilities. Cancers 2021, 13, 1415. [Google Scholar] [CrossRef] [PubMed]
Khodaei, H.; Hajiali, M.; Darvishan, A.; Sepehr, M.; Ghadimi, N. Fuzzy-Based Heat and Power Hub Models for Cost-Emission Operation of an Industrial Consumer Using Compromise Programming. Appl. Therm. Eng. 2018, 137, 395–405. [Google Scholar] [CrossRef]
Mohan, G.; Subashini, M.M. MRI Based Medical Image Analysis: Survey on Brain Tumor Grade Classification. Biomed. Signal Process. Control 2018, 39, 139–161. [Google Scholar] [CrossRef]
Mendonca, T.; Ferreira, P.M.; Marques, J.S.; Marcal, A.R.S.; Rozeira, J. PH²—A Dermoscopic Image Database for Research and Benchmarking. In Proceedings of the 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Osaka, Japan, 3–7 July 2013; pp. 5437–5440. [Google Scholar] [CrossRef]
Brahmbhatt, P. Skin Lesion Segmentation Using SegNet with Binary Cross-Entropy. Int. J. Res. 2019, 10, 22–31. [Google Scholar]
Wu, Y.; Chen, B.; Zeng, A.; Pan, D.; Wang, R.; Zhao, S. Skin Cancer Classification With Deep Learning: A Systematic Review. Front. Oncol. 2022, 12, 893972. [Google Scholar] [CrossRef] [PubMed]
Marchetti, M.A.; Liopyris, K.; Dusza, S.W.; Codella, N.C.F.; Gutman, D.A.; Helba, B.; Kalloo, A.; Halpern, A.C.; Soyer, H.P.; Curiel-Lewandrowski, C.; et al. Computer Algorithms Show Potential for Improving Dermatologists’ Accuracy to Diagnose Cutaneous Melanoma: Results of the International Skin Imaging Collaboration 2017. J. Am. Acad. Dermatol. 2020, 82, 622–627. [Google Scholar] [CrossRef] [PubMed]
Marchetti, M.A.; Codella, N.C.F.; Dusza, S.W.; Gutman, D.A.; Helba, B.; Kalloo, A.; Mishra, N.; Carrera, C.; Celebi, M.E.; DeFazio, J.L.; et al. Results of the 2016 International Skin Imaging Collaboration International Symposium on Biomedical Imaging Challenge: Comparison of the Accuracy of Computer Algorithms to Dermatologists for the Diagnosis of Melanoma from Dermoscopic Images. J. Am. Acad. Dermatol. 2018, 78, 270–277.e1. [Google Scholar] [CrossRef] [PubMed]
Young, A.T.; Xiong, M.; Pfau, J.; Keiser, M.J.; Wei, M.L. Artificial Intelligence in Dermatology: A Primer. J. Investig. Dermatol. 2020, 140, 1504–1512. [Google Scholar] [CrossRef] [PubMed]
Pisner, D.A.; Schnyer, D.M. Support Vector Machine. In Machine Learning Methods and Applications to Brain Disorders; Academic Press: Cambridge, MA, USA, 2020; pp. 101–121. [Google Scholar] [CrossRef]
Tschandl, P.; Rosendahl, C.; Kittler, H. The HAM10000 Dataset, a Large Collection of Multi-Source Dermatoscopic Images of Common Pigmented Skin Lesions. Sci. Data 2018, 5, 180161. [Google Scholar] [CrossRef]
Codella, N.C.F.; Gutman, D.; Celebi, M.E.; Helba, B.; Marchetti, M.A.; Dusza, S.W.; Kalloo, A.; Liopyris, K.; Mishra, N.; Kittler, H.; et al. Skin Lesion Analysis toward Melanoma Detection: A Challenge at the 2017 International Symposium on Biomedical Imaging (ISBI), Hosted by the International Skin Imaging Collaboration (ISIC). In Proceedings of the 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), Washington, DC, USA, 4–7 April 2018; pp. 168–172. [Google Scholar] [CrossRef]
Alabi, R.O.; Youssef, O.; Pirinen, M.; Elmusrati, M.; Mäkitie, A.A.; Leivo, I.; Almangush, A. Machine Learning in Oral Squamous Cell Carcinoma: Current Status, Clinical Concerns and Prospects for Future—A Systematic Review. Artif. Intell. Med. 2021, 115, 102060. [Google Scholar] [CrossRef]
Huang, S.; Yang, J.; Fong, S.; Zhao, Q. Artificial Intelligence in Cancer Diagnosis and Prognosis: Opportunities and Challenges. Cancer Lett. 2020, 471, 61–71. [Google Scholar] [CrossRef]
Qian, Z.; Li, Y.; Wang, Y.; Li, L.; Li, R.; Wang, K.; Li, S.; Tang, K.; Zhang, C.; Fan, X.; et al. Differentiation of Glioblastoma from Solitary Brain Metastases Using Radiomic Machine-Learning Classifiers. Cancer Lett. 2019, 451, 128–135. [Google Scholar] [CrossRef]
Denkert, C.; von Minckwitz, G.; Darb-Esfahani, S.; Lederer, B.; Heppner, B.I.; Weber, K.E.; Budczies, J.; Huober, J.; Klauschen, F.; Furlanetto, J.; et al. Tumour-Infiltrating Lymphocytes and Prognosis in Different Subtypes of Breast Cancer: A Pooled Analysis of 3771 Patients Treated with Neoadjuvant Therapy. Lancet Oncol. 2018, 19, 40–50. [Google Scholar] [CrossRef]
Tan, A.; Huang, H.; Zhang, P.; Li, S. Network-Based Cancer Precision Medicine: A New Emerging Paradigm. Cancer Lett. 2019, 458, 39–45. [Google Scholar] [CrossRef] [PubMed]
Al-Ma’aitah, M.; AlZubi, A.A. Enhanced Computational Model for Gravitational Search Optimized Echo State Neural Networks Based Oral Cancer Detection. J. Med. Syst. 2018, 42, 205. [Google Scholar] [CrossRef]
Mermod, M.; Jourdan, E.F.; Gupta, R.; Bongiovanni, M.; Tolstonog, G.; Simon, C.; Clark, J.; Monnier, Y. Development and Validation of a Multivariable Prediction Model for the Identification of Occult Lymph Node Metastasis in Oral Squamous Cell Carcinoma. Head Neck 2020, 42, 1811–1820. [Google Scholar] [CrossRef]
Bur, A.M.; Holcomb, A.; Goodwin, S.; Woodroof, J.; Karadaghy, O.; Shnayder, Y.; Kakarala, K.; Brant, J.; Shew, M. Machine Learning to Predict Occult Nodal Metastasis in Early Oral Squamous Cell Carcinoma. Oral Oncol. 2019, 92, 20–25. [Google Scholar] [CrossRef]
Karadaghy, O.A.; Shew, M.; New, J.; Bur, A.M. Development and Assessment of a Machine Learning Model to Help Predict Survival Among Patients With Oral Squamous Cell Carcinoma. JAMA Otolaryngol. Head Neck Surg. 2019, 145, 1115. [Google Scholar] [CrossRef]
Xu, S.; Liu, Y.; Hu, W.; Zhang, C.; Liu, C.; Zong, Y.; Chen, S.; Lu, Y.; Yang, L.; Ng, E.Y.K.; et al. An Early Diagnosis of Oral Cancer Based on Three-Dimensional Convolutional Neural Networks. IEEE Access 2019, 7, 158603–158611. [Google Scholar] [CrossRef]
Alhazmi, A.; Alhazmi, Y.; Makrami, A.; Masmali, A.; Salawi, N.; Masmali, K.; Patil, S. Application of Artificial Intelligence and Machine Learning for Prediction of Oral Cancer Risk. J. Oral Pathol. Med. 2021, 50, 444–450. [Google Scholar] [CrossRef]
López-Cortés, X.A.; Matamala, F.; Venegas, B.; Rivera, C. Machine-Learning Applications in Oral Cancer: A Systematic Review. Appl. Sci. 2022, 12, 5715. [Google Scholar] [CrossRef]
Greenspan, H.P. Models for the Growth of a Solid Tumor by Diffusion. Stud. Appl. Math. 1972, 51, 317–340. [Google Scholar] [CrossRef]
Folkman, J.; Hochberg, M. Self-Regulation of Growth in Three Dimensions. J. Exp. Med. 1973, 138, 745. [Google Scholar] [CrossRef] [PubMed]
Matzavinos, A.; Chaplain, M.A.J.; Kuznetsov, V.A. Mathematical Modelling of the Spatio-temporal Response of Cytotoxic T-lymphocytes to a Solid Tumour. Math. Med. Biol. A J. IMA 2004, 21, 1–34. [Google Scholar] [CrossRef] [PubMed]
Knudson, A.G. Mutation and Cancer: Statistical Study of Retinoblastoma. Proc. Natl. Acad. Sci. USA 1971, 68, 820–823. [Google Scholar] [CrossRef]
Spencer, S.L.; Gerety, R.A.; Pienta, K.J.; Forrest, S. Modeling Somatic Evolution in Tumorigenesis. PLoS Comput. Biol. 2006, 2, e108. [Google Scholar] [CrossRef]
Smallbone, K.; Gavaghan, D.J.; Gatenby, R.A.; Maini, P.K. The Role of Acidity in Solid Tumour Growth and Invasion. J. Theor. Biol. 2005, 235, 476–484. [Google Scholar] [CrossRef]
Quaranta, V.; Rejniak, K.A.; Gerlee, P.; Anderson, A.R.A. Invasion Emerges from Cancer Cell Adaptation to Competitive Microenvironments: Quantitative Predictions from Multiscale Mathematical Models. Semin. Cancer Biol. 2008, 18, 338–348. [Google Scholar] [CrossRef]
Byrne, H.; Chaplain, M. Mathematical Models for Tumour Angiogenesis: Numerical Simulations and Nonlinear Wave Solutions. Bull. Math. Biol. 1995, 57, 461–486. [Google Scholar] [CrossRef]
Flegg, J.A.; McElwain, D.L.S.; Byrne, H.M.; Turner, I.W. A Three Species Model to Simulate Application of Hyperbaric Oxygen Therapy to Chronic Wounds. PLoS Comput. Biol. 2009, 5, e1000451. [Google Scholar] [CrossRef] [PubMed]
Pettet, G.J.; Byrne, H.M.; Mcelwain, D.L.S.; Norbury, J. A Model of Wound-Healing Angiogenesis in Soft Tissue. Math. Biosci. 1996, 136, 35–63. [Google Scholar] [CrossRef] [PubMed]
Balding, D.; McElwain, D.L.S. A Mathematical Model of Tumour-Induced Capillary Growth. J. Theor. Biol. 1985, 114, 53–73. [Google Scholar] [CrossRef] [PubMed]
Byrne, H.M.; Chaplain, M.A.J. Explicit Solutions of a Simplified Model of Capillary Sprout Growth during Tumor Angiogenesis. Appl. Math. Lett. 1996, 9, 69–74. [Google Scholar] [CrossRef]
Muthukkaruppan, V.R.; Kubai, L.; Auerbach, R. Tumor-Induced Neovascularization in the Mouse. JNCI J. Natl. Cancer Inst. 1982, 69, 699–708. [Google Scholar] [CrossRef] [PubMed]
Alarcón, T.; Page, K.M. Mathematical Models of the VEGF Receptor and Its Role in Cancer Therapy. J. R. Soc. Interface 2007, 4, 283. [Google Scholar] [CrossRef]
Stefanini, M.O.; Wu, F.T.H.; Mac Gabhann, F.; Popel, A.S. A Compartment Model of VEGF Distribution in Blood, Healthy and Diseased Tissues. BMC Syst. Biol. 2008, 2, 77. [Google Scholar] [CrossRef]
Wu, F.T.H.; Stefanini, M.O.; Mac Gabhann, F.; Popel, A.S. A Compartment Model of VEGF Distribution in Humans in the Presence of Soluble VEGF Receptor-1 Acting as a Ligand Trap. PLoS ONE 2009, 4, e5108. [Google Scholar] [CrossRef] [PubMed]
Özuǧurlu, E. A Note on the Numerical Approach for the Reaction–Diffusion Problem to Model the Density of the Tumor Growth Dynamics. Comput. Math. Appl. 2015, 69, 1504–1517. [Google Scholar] [CrossRef]
Weis, J.A.; Miga, M.I.; Arlinghaus, L.R.; Li, X.; Chakravarthy, A.B.; Abramson, V.; Farley, J.; Yankeelov, T.E. A Mechanically Coupled Reaction–Diffusion Model for Predicting the Response of Breast Tumors to Neoadjuvant Chemotherapy. Phys. Med. Biol. 2013, 58, 5851. [Google Scholar] [CrossRef] [PubMed]
Borasi, G.; Nahum, A. Modelling the Radiotherapy Effect in the Reaction-Diffusion Equation. Phys. Medica 2016, 32, 1175–1179. [Google Scholar] [CrossRef] [PubMed]
Tracqui, P.; Cruywagen, G.C.; Woodward, D.E.; Bartoo, G.T.; Murray, J.D.; Alvord, E.C. A Mathematical Model of Glioma Growth: The Effect of Chemotherapy on Spatio-Temporal Growth. Cell Prolif. 1995, 28, 17–31. [Google Scholar] [CrossRef] [PubMed]
Harpold, H.L.P.; Alvord, E.C.; Swanson, K.R. The Evolution of Mathematical Modeling of Glioma Proliferation and Invasion. J. Neuropathol. Exp. Neurol. 2007, 66, 1–9. [Google Scholar] [CrossRef]
Swanson, K.R.; Alvord, J.; Murray, J.D. A Quantitative Model for Differential Motility of Gliomas in Grey and White Matter. Cell Prolif. 2000, 33, 317. [Google Scholar] [CrossRef]
Burgess, P.K.; Kulesa, P.M.; Murray, J.D.; Alvord, E.C. The Interaction of Growth Rates and Diffusion Coefficients in a Three-Dimensional Mathematical Model of Gliomas. J. Neuropathol. Exp. Neurol. 1997, 56, 704–713. [Google Scholar] [CrossRef]
Rockne, R.; Alvord, E.C.; Rockhill, J.K.; Swanson, K.R. A Mathematical Model for Brain Tumor Response to Radiation Therapy. J. Math. Biol. 2009, 58, 561. [Google Scholar] [CrossRef]
Deisboeck, T.S.; Zhang, L.; Yoon, J.; Costa, J. In Silico Cancer Modeling: Is It Ready for Primetime? Nat. Clin. Pract. Oncol. 2009, 6, 34. [Google Scholar] [CrossRef] [PubMed]
Barendsen, G.W. Dose Fractionation, Dose Rate and Iso-Effect Relationships for Normal Tissue Responses. Int. J. Radiat. Oncol. Biol. Phys. 1982, 8, 1981–1997. [Google Scholar] [CrossRef] [PubMed]
Fowler, J.F. The Linear-Quadratic Formula and Progress in Fractionated Radiotherapy. Br. J. Radiol. 1989, 62, 679–694. [Google Scholar] [CrossRef] [PubMed]
Dale, R. Use of the Linear-Quadratic Radiobiological Model for Quantifying Kidney Response in Targeted Radiotherapy. Cancer Biother. Radiopharm. 2004, 19, 363–370. [Google Scholar] [CrossRef] [PubMed]
Sun, J.; Zhang, T.; Wang, J.; Li, W.; Zhang, A.; He, W.; Zhang, D.; Li, D.; Ding, J.; Duan, X. Biologically Effective Dose (BED) of Stereotactic Body Radiation Therapy (SBRT) Was an Important Factor of Therapeutic Efficacy in Patients with Hepatocellular Carcinoma (≤5 cm). BMC Cancer 2019, 19, 846. [Google Scholar] [CrossRef] [PubMed]
Mireștean, C.C.; Iancu, R.I.; Iancu, D.P.T. Active Immune Phenotype in Head and Neck Cancer: Reevaluating the Iso-Effect Fractionation Based on the Linear Quadratic (LQ) Model—A Narrative Review. Curr. Oncol. 2023, 30, 4805–4816. [Google Scholar] [CrossRef] [PubMed]
O’Rourke, S.F.C.; McAneney, H.; Hillen, T. Linear Quadratic and Tumour Control Probability Modelling in External Beam Radiotherapy. J. Math. Biol. 2009, 58, 799–817. [Google Scholar] [CrossRef]
Thames, H.D.; Bentzen, S.M.; Turesson, I.; Overgaard, M.; Van Den Bogaert, W. Fractionation Parameters for Human Tissues and Tumors. Int. J. Radiat. Biol. 2009, 56, 701–710. [Google Scholar] [CrossRef]
Bentzen, S.M.; Joiner, M.C. The Linear-Quadratic Approach in Clinical Practice. In Basic Clinical Radiobiology; CRC Press: Boca Raton, FL, USA, 2018; pp. 112–124. [Google Scholar] [CrossRef]
Schneider, U. Mechanistic Model of Radiation-Induced Cancer after Fractionated Radiotherapy Using the Linear-Quadratic Formula. Med. Phys. 2009, 36, 1138–1143. [Google Scholar] [CrossRef]
van Leeuwen, C.M.; Oei, A.L.; Crezee, J.; Bel, A.; Franken, N.A.P.; Stalpers, L.J.A.; Kok, H.P. The Alfa and Beta of Tumours: A Review of Parameters of the Linear-Quadratic Model, Derived from Clinical Radiotherapy Studies. Radiat. Oncol. 2018, 13, 96. [Google Scholar] [CrossRef]
Williams, M.V.; Denekamp, J.; Fowler, J.F. A Review of Alpha/Beta Ratios for Experimental Tumors: Implications for Clinical Studies of Altered Fractionation. Int. J. Radiat. Oncol. Biol. Phys. 1985, 11, 87–96. [Google Scholar] [CrossRef] [PubMed]
Jain, R.; Baxter, L.T. Mechanisms of Heterogeneous Distribution of Monoclonal Antibodies and Other Macromolecules in Tumors: Significance of Elevated Interstitial Pressure. Cancer Res. 1988, 48, 7022–7032. [Google Scholar] [PubMed]
Owen, M.R.; Byrne, H.M.; Lewis, C.E. Mathematical Modelling of the Use of Macrophages as Vehicles for Drug Delivery to Hypoxic Tumour Sites. J. Theor. Biol. 2004, 226, 377–391. [Google Scholar] [CrossRef] [PubMed]
Siegmund, K.D.; Marjoram, P.; Woo, Y.J.; Tavaré, S.; Shibata, D. Inferring Clonal Expansion and Cancer Stem Cell Dynamics from DNA Methylation Patterns in Colorectal Cancers. Proc. Natl. Acad. Sci. USA 2009, 106, 4828–4833. [Google Scholar] [CrossRef]
Swanson, K.R.; Bridge, C.; Murray, J.D.; Alvord, E.C. Virtual and Real Brain Tumors: Using Mathematical Modeling to Quantify Glioma Growth and Invasion. J. Neurol. Sci. 2003, 216, 1–10. [Google Scholar] [CrossRef] [PubMed]
Araujo, R.P.; Liotta, L.A.; Petricoin, E.F. Proteins, Drug Targets and the Mechanisms They Control: The Simple Truth about Complex Networks. Nat. Rev. Drug Discov. 2007, 6, 871–880. [Google Scholar] [CrossRef] [PubMed]
Panetta, J.C. A Mathematical Model of Breast and Ovarian Cancer Treated with Paclitaxel. Math. Biosci. 1997, 146, 89–113. [Google Scholar] [CrossRef] [PubMed]
Basse, B.; Baguley, B.C.; Marshall, E.S.; Wake, G.C.; Wall, D.J.N. Modelling the Flow Cytometric Data Obtained from Unperturbed Human Tumour Cell Lines: Parameter Fitting and Comparison. Bull. Math. Biol. 2005, 67, 815–830, Erratum in Bull. Math. Biol. 2005, 67, 1153. [Google Scholar] [CrossRef]
Arino, O. A Survey of Structured Cell Population Dynamics. Acta Biotheor. 1995, 43, 3–25. [Google Scholar] [CrossRef]
Galuzio, P.P.; Cherif, A. Recent Advances and Future Perspectives in the Use of Machine Learning and Mathematical Models in Nephrology. Adv. Chronic Kidney Dis. 2022, 29, 472–479. [Google Scholar] [CrossRef]
Zappone, A.; Di Renzo, M.; Debbah, M. Wireless Networks Design in the Era of Deep Learning: Model-Based, AI-Based, or Both? IEEE Trans. Commun. 2019, 67, 7331–7376. [Google Scholar] [CrossRef]
Shlezinger, N.; Whang, J.; Eldar, Y.C.; Dimakis, A.G. Model-Based Deep Learning: Key Approaches and Design Guidelines. In Proceedings of the 2021 IEEE Data Science and Learning Workshop (DSLW), Toronto, ON, Canada, 5–6 June 2021. [Google Scholar] [CrossRef]
Quarteroni, A. A Bit of Maths (Behind Artificial Intelligence and Machine Learning). In Algorithms for a New World: When Big Data and Mathematical Models Meet; Springer: Cham, Switzerland, 2022; pp. 43–53. [Google Scholar] [CrossRef]
Huang, S.; Fang, N. Predicting Student Academic Performance in an Engineering Dynamics Course: A Comparison of Four Types of Predictive Mathematical Models. Comput. Educ. 2013, 61, 133–145. [Google Scholar] [CrossRef]
Liu, Z.; Huang, S.; Lu, W.; Su, Z.; Yin, X.; Liang, H.; Zhang, H. Modeling the Trend of Coronavirus Disease 2019 and Restoration of Operational Capability of Metropolitan Medical Service in China: A Machine Learning and Mathematical Model-Based Analysis. Glob. Health Res. Policy 2020, 5, 20. [Google Scholar] [CrossRef] [PubMed]
Bezak, P.; Bozek, P.; Nikitin, Y. Advanced Robotic Grasping System Using Deep Learning. Procedia Eng. 2014, 96, 10–20. [Google Scholar] [CrossRef]
Uddin, M.J.; Hassan, J.; Douroumis, D. Thermal Inkjet Printing: Prospects and Applications in the Development of Medicine. Technologies 2022, 10, 108. [Google Scholar] [CrossRef]
Baturynska, I.; Semeniuta, O.; Martinsen, K. Optimization of Process Parameters for Powder Bed Fusion Additive Manufacturing by Combination of Machine Learning and Finite Element Method: A Conceptual Framework. Procedia CIRP 2018, 67, 227–232. [Google Scholar] [CrossRef]
Malinzi, J.; Basita, K.B.; Padidar, S.; Adeola, H.A. Prospect for Application of Mathematical Models in Combination Cancer Treatments. Inform. Med. Unlocked 2021, 23, 100534. [Google Scholar] [CrossRef]
Haleem, A.; Javaid, M.; Singh, R.P.; Suman, R. Telemedicine for Healthcare: Capabilities, Features, Barriers, and Applications. Sens. Int. 2021, 2, 100117. [Google Scholar] [CrossRef]
Royce, T.J.; Sanoff, H.K.; Rewari, A. Telemedicine for Cancer Care in the Time of COVID-19. JAMA Oncol. 2020, 6, 1698–1699. [Google Scholar] [CrossRef]
Sirintrapun, S.J.; Lopez, A.M. Telemedicine in Cancer Care. In American Society of Clinical Oncology Educational Book; ASCO Publications: Alexandria, VA, USA, 2018; pp. 540–545. [Google Scholar] [CrossRef]
Knudsen, K.E.; Willman, C.; Winn, R. Optimizing the Use of Telemedicine in Oncology Care: Postpandemic Opportunities. Clin. Cancer Res. 2021, 27, 933–936. [Google Scholar] [CrossRef]
Doolittle, G.C.; Allen, A. Practising Oncology via Telemedicine. J. Telemed. Telecare 1997, 3, 63–70. [Google Scholar] [CrossRef] [PubMed]
Doolittle, G.C.; Harmon, A.; Williams, A.; Allen, A.; Boysen, C.D.; Wittman, C.; Mair, F.; Carlson, E. A Cost Analysis of a Tele-Oncology Practice. J. Telemed. Telecare 1997, 3, 20–22. [Google Scholar] [CrossRef] [PubMed]
Yunus, F.; Gray, S.; Fox, K.C.; Allen, J.W.; Sachdev, J.; Merkel, M.; Chambley, B.; Yunus, R.; Waters, T.M. The Impact of Telemedicine in Cancer Care. J. Clin. Oncol. 2009, 27, e20508. [Google Scholar] [CrossRef]
Sharma, J.J.; Gross, G.; Sharma, P. Extending Oncology Clinical Services to Rural Areas of Texas Via Teleoncology. J. Oncol. Pract. 2012, 8, 68. [Google Scholar] [CrossRef] [PubMed]
Thaker, D.A.; Monypenny, R.; Olver, I.; Sabesan, S. Cost Savings from a Telemedicine Model of Care in Northern Queensland, Australia. Med. J. Aust. 2013, 199, 414–417. [Google Scholar] [CrossRef]
Shalowitz, D.I.; Moore, C.J. Telemedicine and Gynecologic Cancer Care. Obstet. Gynecol. Clin. N. Am. 2020, 47, 271–285. [Google Scholar] [CrossRef]
Goncu-Berk, G.; Topcuoglu, N. A Healthcare Wearable for Chronic Pain Management. Design of a Smart Glove for Rheumatoid Arthritis. Des. J. 2017, 20, S1978–S1988. [Google Scholar] [CrossRef]
Tavakoli, M.; Carriere, J.; Torabi, A. Robotics, Smart Wearable Technologies, and Autonomous Intelligent Systems for Healthcare During the COVID-19 Pandemic: An Analysis of the State of the Art and Future Vision. Adv. Intell. Syst. 2020, 2, 2000071. [Google Scholar] [CrossRef]
Khan, Y.; Su’ud, M.B.M.; Alam, M.M.; Ahmad, S.F.; Salim, N.A.; Khan, N. Architectural Threats to Security and Privacy: A Challenge for Internet of Things (IoT) Applications. Electronics 2022, 12, 88. [Google Scholar] [CrossRef]
Sunny, J.S.; Patro, C.P.K.; Karnani, K.; Pingle, S.C.; Lin, F.; Anekoji, M.; Jones, L.D.; Kesari, S.; Ashili, S. Anomaly Detection Framework for Wearables Data: A Perspective Review on Data Concepts, Data Analysis Algorithms and Prospects. Sensors 2022, 22, 756. [Google Scholar] [CrossRef]
Suresh Kumar, V.; Krishnamoorthi, C. Development of Electrical Transduction Based Wearable Tactile Sensors for Human Vital Signs Monitor: Fundamentals, Methodologies and Applications. Sens. Actuators A Phys. 2021, 321, 112582. [Google Scholar] [CrossRef]
Castaneda, D.; Esparza, A.; Ghamari, M.; Soltanpur, C.; Nazeran, H. A Review on Wearable Photoplethysmography Sensors and Their Potential Future Applications in Health Care. Int. J. Biosens. Bioelectron. 2018, 4, 195. [Google Scholar] [CrossRef]
Khan, Y.; Ostfeld, A.E.; Lochner, C.M.; Pierre, A.; Arias, A.C. Monitoring of Vital Signs with Flexible and Wearable Medical Devices. Adv. Mater. 2016, 28, 4373–4395. [Google Scholar] [CrossRef]
Dias, D.; Cunha, J.P.S. Wearable Health Devices—Vital Sign Monitoring, Systems and Technologies. Sensors 2018, 18, 2414. [Google Scholar] [CrossRef]
Qiao, Y.; Qiao, L.; Chen, Z.; Liu, B.; Gao, L.; Zhang, L. Wearable Sensor for Continuous Sweat Biomarker Monitoring. Chemosensors 2022, 10, 273. [Google Scholar] [CrossRef]
ZhuParris, A.; de Goede, A.A.; Yocarini, I.E.; Kraaij, W.; Groeneveld, G.J.; Doll, R.J. Machine Learning Techniques for Developing Remotely Monitored Central Nervous System Biomarkers Using Wearable Sensors: A Narrative Literature Review. Sensors 2023, 23, 5243. [Google Scholar] [CrossRef] [PubMed]
Swan, M. Sensor Mania! The Internet of Things, Wearable Computing, Objective Metrics, and the Quantified Self 2.0. J. Sens. Actuator Netw. 2012, 1, 217–253. [Google Scholar] [CrossRef]
Ates, H.C.; Nguyen, P.Q.; Gonzalez-Macia, L.; Morales-Narváez, E.; Güder, F.; Collins, J.J.; Dincer, C. End-to-End Design of Wearable Sensors. Nat. Rev. Mater. 2022, 7, 887–907. [Google Scholar] [CrossRef] [PubMed]
Vos, G.; Trinh, K.; Sarnyai, Z.; Rahimi Azghadi, M. Generalizable Machine Learning for Stress Monitoring from Wearable Devices: A Systematic Literature Review. Int. J. Med. Inform. 2023, 173, 105026. [Google Scholar] [CrossRef] [PubMed]
Nahavandi, D.; Alizadehsani, R.; Khosravi, A.; Acharya, U.R. Application of Artificial Intelligence in Wearable Devices: Opportunities and Challenges. Comput. Methods Programs Biomed. 2022, 213, 106541. [Google Scholar] [CrossRef]
Guk, K.; Han, G.; Lim, J.; Jeong, K.; Kang, T.; Lim, E.K.; Jung, J. Evolution of Wearable Devices with Real-Time Disease Monitoring for Personalized Healthcare. Nanomaterials 2019, 9, 813. [Google Scholar] [CrossRef]
Briand, L.C.; Basili, V.R.; Thomas, W.M. A Pattern Recognition Approach for Software Engineering Data Analysis. IEEE Trans. Softw. Eng. 1992, 18, 931–942. [Google Scholar] [CrossRef]
Briand, L.C. Quantitative Empirical Modeling for Managing Software Development: Constraints, Needs and Solutions. In Experimental Software Engineering Issues: Critical Assessment and Future Directions: International Workshop Proceedings, Dagstuhl Castle, Wadern, Germany, 14–18 September 1992; Lecture Notes in Computer Science (LNCS, including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Springer: Berlin/Heidelberg, Germany, 1993; Volume 706, pp. 158–163. [Google Scholar] [CrossRef]
Gray, A.R.; MacDonell, S.G. A Comparison of Techniques for Developing Predictive Models of Software Metrics. Inf. Softw. Technol. 1997, 39, 425–437. [Google Scholar] [CrossRef]
Heitjan, D.F. Annotation: What Can Be Done about Missing Data? Approaches to Imputation. Am. J. Public Health 1997, 87, 548. [Google Scholar] [CrossRef] [PubMed]
Kim, J.O.; Curry, J. The Treatment of Missing Data in Multivariate Analysis. Sociol. Methods Res. 1977, 6, 215–240. [Google Scholar] [CrossRef]
Koh, D.-M.; Papanikolaou, N.; Bick, U.; Illing, R.; Kahn, C.E., Jr.; Kalpathi-Cramer, J.; Matos, C.; Martí-Bonmatí, L.; Miles, A.; Ki Mun, S.; et al. Artificial Intelligence and Machine Learning in Cancer Imaging. Commun. Med. 2022, 2, 133. [Google Scholar] [CrossRef] [PubMed]
Abiiro, G.A.; Alhassan, F.; Alhassan, B.P.; Alhassan, B.P.; Akanbang, B.A.A. Socio-Demographic Correlates of Public Awareness of Patient Rights and Responsibilities in the Sagnarigu Municipality, Ghana. Int. J. Health Promot. Educ. 2020, 60, 38–48. [Google Scholar] [CrossRef]
Alboaneen, D.; Alqarni, R.; Alqahtani, S.; Alrashidi, M.; Alhuda, R.; Alyahyan, E.; Alshammari, T. Predicting Colorectal Cancer Using Machine and Deep Learning Algorithms: Challenges and Opportunities. Big Data Cogn. Comput. 2023, 7, 74. [Google Scholar] [CrossRef]
Qayyum, A.; Qadir, J.; Bilal, M.; Al-Fuqaha, A. Secure and Robust Machine Learning for Healthcare: A Survey. IEEE Rev. Biomed. Eng. 2021, 14, 156–180. [Google Scholar] [CrossRef]
Chen, J.; Weihs, D.; Vermolen, F.J. Computational Modeling of Therapy on Pancreatic Cancer in Its Early Stages. Biomech. Model. Mechanobiol. 2020, 19, 427–444. [Google Scholar] [CrossRef]
Kuznetsov, M.; Clairambault, J.; Volpert, V. Improving Cancer Treatments via Dynamical Biophysical Models. Phys. Life Rev. 2021, 39, 1–48. [Google Scholar] [CrossRef] [PubMed]
Goodall, C.R. Data Mining of Massive Datasets in Healthcare? J. Comput. Graph. Stat. 1999, 8, 620–634. [Google Scholar] [CrossRef]
Vabalas, A.; Gowen, E.; Poliakoff, E.; Casson, A.J. Machine Learning Algorithm Validation with a Limited Sample Size. PLoS ONE 2019, 14, e0224365. [Google Scholar] [CrossRef] [PubMed]
Figueroa, R.L.; Zeng-Treitler, Q.; Kandula, S.; Ngo, L.H. Predicting Sample Size Required for Classification Performance. BMC Med. Inform. Decis. Mak. 2012, 12, 8. [Google Scholar] [CrossRef] [PubMed]
Beleites, C.; Neugebauer, U.; Bocklitz, T.; Krafft, C.; Popp, J. Sample Size Planning for Classification Models. Anal. Chim. Acta 2013, 760, 25–33. [Google Scholar] [CrossRef] [PubMed]
Alwosheel, A.; van Cranenburgh, S.; Chorus, C.G. Is Your Dataset Big Enough? Sample Size Requirements When Using Artificial Neural Networks for Discrete Choice Analysis. J. Choice Model. 2018, 28, 167–182. [Google Scholar] [CrossRef]
Bzdok, D.; Nichols, T.E.; Smith, S.M. Towards Algorithmic Analytics for Large-Scale Datasets. Nat. Mach. Intell. 2019, 1, 296. [Google Scholar] [CrossRef]
Priyanath, H.M.S.; Rvspk, R.; Rgn, M. Methods and Rule-of-Thumbs in The Determination of Minimum Sample Size When Appling Structural Equation Modelling: A Review. J. Soc. Sci. Res. 2020, 15, 102–107. [Google Scholar] [CrossRef]
Richesson, R.L.; Nadkarni, P. Data Standards for Clinical Research Data Collection Forms: Current Status and Challenges. J. Am. Med. Inform. Assoc. 2011, 18, 341–346. [Google Scholar] [CrossRef]
McGuckin, T.; Crick, K.; Myroniuk, T.W.; Setchell, B.; Yeung, R.O.; Campbell-Scherer, D. Understanding Challenges of Using Routinely Collected Health Data to Address Clinical Care Gaps: A Case Study in Alberta, Canada. BMJ Open Qual. 2022, 11, e001491. [Google Scholar] [CrossRef]
Zhang, A.; Xing, L.; Zou, J.; Wu, J.C. Shifting Machine Learning for Healthcare from Development to Deployment and from Models to Data. Nat. Biomed. Eng. 2022, 6, 1330–1345. [Google Scholar] [CrossRef]
Javaid, M.; Haleem, A.; Pratap Singh, R.; Suman, R.; Rab, S. Significance of Machine Learning in Healthcare: Features, Pillars and Applications. Int. J. Intell. Netw. 2022, 3, 58–73. [Google Scholar] [CrossRef]
Char, D.S.; Abràmoff, M.D.; Feudtner, C. Identifying Ethical Considerations for Machine Learning Healthcare Applications. Am. J. Bioeth. 2020, 20, 7–17. [Google Scholar] [CrossRef]
Burnett, B.; Zhou, S.M.; Brophy, S.; Davies, P.; Ellis, P.; Kennedy, J.; Bandyopadhyay, A.; Parker, M.; Lyons, R.A. Machine Learning in Colorectal Cancer Risk Prediction from Routinely Collected Data: A Review. Diagnostics 2023, 13, 301. [Google Scholar] [CrossRef]
Basavegowda, H.S.; Dagnew, G. Deep Learning Approach for Microarray Cancer Data Classification. CAAI Trans. Intell. Technol. 2020, 5, 22–33. [Google Scholar] [CrossRef]
Cherian Kurian, N.; Sethi, A.; Reddy Konduru, A.; Mahajan, A.; Rane, S.U. A 2021 Update on Cancer Image Analytics with Deep Learning. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2021, 11, e1410. [Google Scholar] [CrossRef]
Geis, J.R.; Brady, A.; Wu, C.C.; Spencer, J.; Ranschaert, E.; Jaremko, J.L.; Langer, S.G.; Kitts, A.B.; Birch, J.; Shields, W.F.; et al. Ethics of Artificial Intelligence in Radiology: Summary of the Joint European and North American Multisociety Statement. Insights Imaging 2019, 10, 1516–1521. [Google Scholar] [CrossRef] [PubMed]
Alabi, R.O.; Vartiainen, T.; Elmusrati, M. Machine Learning for Prognosis of Oral Cancer: What Are the Ethical Challenges? In Proceedings of the Conference on Technology Ethics 2020 (Tethics 2020), Turku, Finland, 21 October 2020. [Google Scholar]
Michelson, K.N.; Klugman, C.M.; Kho, A.N.; Gerke, S. Ethical Considerations Related to Using Machine Learning-Based Prediction of Mortality in the Pediatric Intensive Care Unit. J. Pediatr. 2022, 247, 125–128. [Google Scholar] [CrossRef] [PubMed]
O’Reilly-Shah, V.N.; Gentry, K.R.; Walters, A.M.; Zivot, J.; Anderson, C.T.; Tighe, P.J. Bias and Ethical Considerations in Machine Learning and the Automation of Perioperative Risk Assessment. Br. J. Anaesth. 2020, 125, 843–846. [Google Scholar] [CrossRef] [PubMed]
Braun, M.; Hummel, P.; Beck, S.; Dabrock, P. Primer on an Ethics of AI-Based Decision Support Systems in the Clinic. J. Med. Ethics 2021, 47, e3. [Google Scholar] [CrossRef]
Mulvenna, M.D.; Bond, R.; Delaney, J.; Dawoodbhoy, F.M.; Boger, J.; Potts, C.; Turkington, R. Ethical Issues in Democratizing Digital Phenotypes and Machine Learning in the Next Generation of Digital Health Technologies. Philos. Technol. 2021, 34, 1945–1960. [Google Scholar] [CrossRef]
Morley, J.; Machado, C.C.V.; Burr, C.; Cowls, J.; Joshi, I.; Taddeo, M.; Floridi, L. The Ethics of AI in Health Care: A Mapping Review. Soc. Sci. Med. 2020, 260, 113172. [Google Scholar] [CrossRef] [PubMed]
Rodrigues, R. Legal and Human Rights Issues of AI: Gaps, Challenges and Vulnerabilities. J. Responsible Technol. 2020, 4, 100005. [Google Scholar] [CrossRef]
Tigard, D.W. There Is No Techno-Responsibility Gap. Philos. Technol. 2021, 34, 589–607. [Google Scholar] [CrossRef]
Kaplan, A.; Cao, H.; FitzGerald, J.M.; Iannotti, N.; Yang, E.; Kocks, J.W.H.; Kostikas, K.; Price, D.; Reddel, H.K.; Tsiligianni, I.; et al. Artificial Intelligence/Machine Learning in Respiratory Medicine and Potential Role in Asthma and COPD Diagnosis. J. Allergy Clin. Immunol. Pract. 2021, 9, 2255–2261. [Google Scholar] [CrossRef]
Fan, X.; Yao, Q.; Cai, Y.; Miao, F.; Sun, F.; Li, Y. Multiscaled Fusion of Deep Convolutional Neural Networks for Screening Atrial Fibrillation From Single Lead Short ECG Recordings. IEEE J. Biomed. Health Inform. 2018, 22, 1744–1753. [Google Scholar] [CrossRef] [PubMed]
Yu, G.; Li, Z.; Li, S.; Liu, J.; Sun, M.; Liu, X.; Sun, F.; Zheng, J.; Li, Y.; Yu, Y.; et al. The Role of Artificial Intelligence in Identifying Asthma in Pediatric Inpatient Setting. Ann. Transl. Med. 2020, 8, 1367. [Google Scholar] [CrossRef]
Pareja-Galeano, H.; Garatachea, N.; Lucia, A. Exercise as a Polypill for Chronic Diseases. Prog. Mol. Biol. Transl. Sci. 2015, 135, 497–526. [Google Scholar] [CrossRef]
Tan, T.E.; Anees, A.; Chen, C.; Li, S.; Xu, X.; Li, Z.; Xiao, Z.; Yang, Y.; Lei, X.; Ang, M.; et al. Retinal Photograph-Based Deep Learning Algorithms for Myopia and a Blockchain Platform to Facilitate Artificial Intelligence Medical Research: A Retrospective Multicohort Study. Lancet Digit. Health 2021, 3, e317–e329. [Google Scholar] [CrossRef]
Zhang, Y.; Yu, H.; Dong, R.; Ji, X.; Li, F. Application Prospect of Artificial Intelligence in Rehabilitation and Management of Myasthenia Gravis. Biomed Res. Int. 2021, 2021, 5592472. [Google Scholar] [CrossRef]
Grady, C.; Eckstein, L.; Berkman, B.; Brock, D.; Cook-Deegan, R.; Fullerton, S.M.; Greely, H.; Hansson, M.G.; Hull, S.; Kim, S.; et al. Broad Consent for Research with Biological Samples: Workshop Conclusions. Am. J. Bioeth. 2015, 15, 34–42. [Google Scholar] [CrossRef]
Mayer-Schönberger, V.; Ingelsson, E. Big Data and Medicine: A Big Deal? J. Intern. Med. 2018, 283, 418–429. [Google Scholar] [CrossRef]
Xie, Y.; Lu, L.; Gao, F.; He, S.J.; Zhao, H.J.; Fang, Y.; Yang, J.M.; An, Y.; Ye, Z.W.; Dong, Z. Integration of Artificial Intelligence, Blockchain, and Wearable Technology for Chronic Disease Management: A New Paradigm in Smart Healthcare. Curr. Med. Sci. 2021, 41, 1123. [Google Scholar] [CrossRef] [PubMed]
Kiran, M.P.R.S.; Rajalakshmi, P.; Bharadwaj, K.; Acharyya, A. Adaptive Rule Engine Based IoT Enabled Remote Health Care Data Acquisition and Smart Transmission System. In Proceedings of the 2014 IEEE World Forum on Internet of Things (WF-IoT 2014), Seoul, Republic of Korea, 6–8 March 2014; pp. 253–258. [Google Scholar] [CrossRef]
Dixit, S.; Kumar, A.; Srinivasan, K. A Current Review of Machine Learning and Deep Learning Models in Oral Cancer Diagnosis: Recent Technologies, Open Challenges, and Future Research Directions. Diagnostics 2023, 13, 1353. [Google Scholar] [CrossRef] [PubMed]
Wylde, V.; Rawindaran, N.; Lawrence, J.; Balasubramanian, R.; Prakash, E.; Jayal, A.; Khan, I.; Hewage, C.; Platts, J. Cybersecurity, Data Privacy and Blockchain: A Review. SN Comput. Sci. 2022, 3, 127. [Google Scholar] [CrossRef] [PubMed]
Shahid, J.; Ahmad, R.; Kiani, A.K.; Ahmad, T.; Saeed, S.; Almuhaideb, A.M. Data Protection and Privacy of the Internet of Healthcare Things (IoHTs). Appl. Sci. 2022, 12, 1927. [Google Scholar] [CrossRef]
Saura, J.R.; Ribeiro-Soriano, D.; Palacios-Marqués, D. Assessing Behavioral Data Science Privacy Issues in Government Artificial Intelligence Deployment. Gov. Inf. Q. 2022, 39, 101679. [Google Scholar] [CrossRef]
DoCarmo, T.; Rea, S.; Conaway, E.; Emery, J.; Raval, N. The Law in Computation: What Machine Learning, Artificial Intelligence, and Big Data Mean for Law and Society Scholarship. Law Policy 2021, 43, 170–199. [Google Scholar] [CrossRef]
Shuaib, M.; Alam, S.; Shabbir Alam, M.; Shahnawaz Nasir, M. Compliance with HIPAA and GDPR in Blockchain-Based Electronic Health Record. Mater. Today Proc. 2021; in press. [Google Scholar] [CrossRef]
Zaeem, R.N.; Barber, K.S. The Effect of the GDPR on Privacy Policies. ACM Trans. Manag. Inf. Syst. 2021, 12, 2. [Google Scholar] [CrossRef]
Baik, J.S. Data Privacy against Innovation or against Discrimination?: The Case of the California Consumer Privacy Act (CCPA). Telemat. Inform. 2020, 52, 101431. [Google Scholar] [CrossRef]
Cohen, I.G.; Mello, M.M. HIPAA and Protecting Health Information in the 21st Century. JAMA 2018, 320, 231–232. [Google Scholar] [CrossRef]
Bari, L.; O’Neill, D.P. Rethinking Patient Data Privacy In The Era Of Digital Health. Health Affairs Forefront, 12 December 2019. [Google Scholar] [CrossRef]
Rehman, M.U.; Shafique, A.; Ghadi, Y.Y.; Boulila, W.; Jan, S.U.; Gadekallu, T.R.; Driss, M.; Ahmad, J. A Novel Chaos-Based Privacy-Preserving Deep Learning Model for Cancer Diagnosis. IEEE Trans. Netw. Sci. Eng. 2022, 9, 4322–4337. [Google Scholar] [CrossRef]
Choi, Y.B.; Capitan, K.E.; Krause, J.S.; Streeper, M.M. Challenges Associated with Privacy in Health Care Industry: Implementation of HIPAA and the Security Rules. J. Med. Syst. 2006, 30, 57–64. [Google Scholar] [CrossRef] [PubMed]
Mercuri, R.T. The HIPAA-Potamus in Health Care Data Security. Commun. ACM 2004, 47, 25–28. [Google Scholar] [CrossRef]
Ngiam, K.Y.; Khor, I.W. Big Data and Machine Learning Algorithms for Health-Care Delivery. Lancet Oncol. 2019, 20, e262–e273. [Google Scholar] [CrossRef] [PubMed]
Choy, G.; Khalilzadeh, O.; Michalski, M.; Do, S.; Samir, A.E.; Pianykh, O.S.; Geis, J.R.; Pandharipande, P.V.; Brink, J.A.; Dreyer, K.J. Current Applications and Future Impact of Machine Learning in Radiology. Radiology 2018, 288, 318–328. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Percentage of newly diagnosed cases and deaths by areas worldwide in 2020. (A) Both male and female, (B) male, and (C) female [5].

Figure 2. History behind the development of MMs in the field of biomedical studies [22].

Figure 3. Simplified illustration for working objective of NN method of ML [16].

Figure 4. An example of the curse of dimensionality. (a). Outcome of multiple discontinuities (b). Outcome of continuous sensor computation [77].

Figure 5. General working process of a classifiers [88].

Figure 6. History of MM in the field of cancer research.

Figure 7. Classifications of ML [108].

Figure 8. Applications of ML and MM in new drug discovery [39].

Figure 9. The applications of ML algorithms in lung cancer management [258].

Figure 10. Commonly used ML classifiers in colon cancer [266].

Figure 11. Schematic representation of achieving worthwhile therapeutic outcomes using a combination of ML and MM techniques in cancer treatment [351].

Figure 12. Challenges of ML application and cyber threats in healthcare.

Table 1. Features of ML algorithms used in drug discovery and design.

SN	ML Tool	Description of Algorithm	Features of Algorithm	Reference
1.	Drug finder	In silico virtual screening (VS)	Used to validate the screening platform along with its methods and enhance the credence in its software components to generate appropriate results	[144]
2.	LigGrep	Tool for filtration of docked stances to enhance VS hit rates	Provides better hit rates in terms of test VS in targeting H. sapiens poly adenosine diphosphate ribose polymerase 1 (HsPARP1), S. cerevisiae hexokinase-2 protein (ScHxk2), and H. sapiens peptidyl prolyl cis trans isomerase NIMA-interacting 1 protein (HsPin1)	[145]
3.	LS-align	On an atomic level, flexible ligand structural alignment algorithm for high-throughput VS	Produces rapid and accurate atomic-level structural alignments of ligand for particular molecules	[146]
4.	ProPose	Navigated VS via Simultaneous Protein–Ligand Docking and Ligand–Ligand Alignment	A combined method based on ligand and receptor to ensure steric fit through VS via ranking the molecules as per their similar interaction pattern with known ligands	[147]
5.	StackCBPred	Stacking-based assumption of binding sites between carbohydrates and proteins from their sequence	Predicts biostructural features of amino acids to train a stacking-based ML effectively for the exact identification of binding sites between carbohydrates and protein s	[148]
6.	TrixX	Structural molecular indexing for large-scale VS in almost linear time	One of the fastest VS tools currently available, which is almost two times faster than standard FlexX	[149]

Table 2. List of MMs applied for COVID-19 management with their purposes.

No.	Model	Purpose	Reference(s)
1.	Bats–Hosts–Reservoir–People (BHRP)	Simulation of virus transmissibility from bat to human	[163,164,165,166]
2.	SPSS modeler	Predicting the number future infections, deaths, and tourism crises and disaster management (TCDM)	[167,168,169]
3.	Markov Chain Monte Carlo (MCMC)	Transmission dynamic model combined with personal protective measures	[170,171,172,173]
4.	Ordinary differential equations (ODE) metapopulation model	Disease transmissibility prediction and effect of dynamic interventions	[174,175]
5.	Susceptible–Exposed–Symptomatic–Asymptomatic–Recovered/removed (SEIAR)	Quantification of the age-specific ability for transmission and effect of personal protective measures	[176,177]
6.	Susceptible–Exposed–Infectious–Quarantined–Recovered (SEIQR)	Disease transmissibility prediction and management techniques	[178,179]
7.	Susceptible–Exposed–Infected–Removed (SEIR)	Prediction of disease transmissibility, epidemic scenario, and impact of humidity and temperature	[180,181,182,183,184,185]
8.	Susceptible–Infected–Recovered (SIR)	Monitor transmission and recovery rates in real time, as well as data fitting and management techniques.	[186,187]
9.	Susceptible–Infectious–Quarantined–Recovered (SIQR)	Strategies for management and measurement for quarantine	[188,189]

Table 3. List of MLs applied for COVID-19 management with their respective accuracies.

SN	Name of Tool/Developer/User	ML Algorithm	Accuracy (%)	Reference
1.	Deep Learning with X-ray	CNN	96.78	[190]
2.	iSARF	DT/RF	87.9	[191]
3.	Kunhua	LR	87	[192]
4.	NLR&RDW-SD	LDA	85.7%	[193]
5.	ResNet50	CNN	96.1–99.7	[194]
6.	COVNet	CNN	95	[195]
7.	COVID-Net	CNN	92.6	[196]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hassan, J.; Saeed, S.M.; Deka, L.; Uddin, M.J.; Das, D.B. Applications of Machine Learning (ML) and Mathematical Modeling (MM) in Healthcare with Special Focus on Cancer Prognosis and Anticancer Therapy: Current Status and Challenges. Pharmaceutics 2024, 16, 260. https://doi.org/10.3390/pharmaceutics16020260

AMA Style

Hassan J, Saeed SM, Deka L, Uddin MJ, Das DB. Applications of Machine Learning (ML) and Mathematical Modeling (MM) in Healthcare with Special Focus on Cancer Prognosis and Anticancer Therapy: Current Status and Challenges. Pharmaceutics. 2024; 16(2):260. https://doi.org/10.3390/pharmaceutics16020260

Chicago/Turabian Style

Hassan, Jasmin, Safiya Mohammed Saeed, Lipika Deka, Md Jasim Uddin, and Diganta B. Das. 2024. "Applications of Machine Learning (ML) and Mathematical Modeling (MM) in Healthcare with Special Focus on Cancer Prognosis and Anticancer Therapy: Current Status and Challenges" Pharmaceutics 16, no. 2: 260. https://doi.org/10.3390/pharmaceutics16020260

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Applications of Machine Learning (ML) and Mathematical Modeling (MM) in Healthcare with Special Focus on Cancer Prognosis and Anticancer Therapy: Current Status and Challenges

Abstract

1. Introduction

1.1. General Concepts of Machine Learning (ML) and Mathematical Modeling (MM)

1.1.1. ML

1.1.2. MM

2. Paradigms of ML

2.1. Supervised ML

2.2. Unsupervised ML

2.3. Reinforced Learning

3. ML and MM Approaches in Healthcare

3.1. Discovery of New Drug Molecule

3.2. Prediction and Management of Global Pandemic

3.3. Epigenomics

3.4. Protein Engineering

4. ML Algorithms in Specific Types of Cancer

4.1. Lung Cancer

4.2. Colon Cancer

4.3. Pancreatic Cancer

4.4. Glioma

4.5. Skin Cancer

4.6. Oral Cancer

5. MM Techniques in Specific Types of Cancer

5.1. Tumor Growth

5.2. Treatment

5.3. Interconnection between ML and MM

6. Challenges of ML and MM Approaches in Cancer Prognosis and Therapy

6.1. Data Quantity

6.2. Ethical Consideration

6.3. Data Privacy

7. Further Discussion and Future Directions

8. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI