Machine Learning Prediction Models for Postoperative Stroke in Elderly Patients: Analyses of the MIMIC Database

Zhang, Xiao; Fei, Ningbo; Zhang, Xinxin; Wang, Qun; Fang, Zongping

doi:10.3389/fnagi.2022.897611

ORIGINAL RESEARCH article

Front. Aging Neurosci., 18 July 2022
Sec. Neuroinflammation and Neuropathy
Volume 14 - 2022 | https://doi.org/10.3389/fnagi.2022.897611

Machine Learning Prediction Models for Postoperative Stroke in Elderly Patients: Analyses of the MIMIC Database

Xiao Zhang^1†

Ningbo Fei^2†

Xinxin Zhang¹

Qun Wang¹

Zongping Fang^1*

¹Department of Anesthesiology and Perioperative Medicine, Xijing Hospital, Fourth Military Medical University, Xi’an, China
²Department of Orthopedics and Traumatology, The Duchess of Kent Children’s Hospital at Sandy Bay, The University of Hong Kong, Hong Kong, Hong Kong SAR, China

Objective: With the aging of populations and the high prevalence of stroke, postoperative stroke has become a growing concern. This study aimed to establish a prediction model and assess the risk factors for stroke in elderly patients during the postoperative period.

Methods: ML (Machine learning) prediction models were applied to elderly patients from the MIMIC (Medical Information Mart for Intensive Care)-III and MIMIC-VI databases. The SMOTENC (synthetic minority oversampling technique for nominal and continuous data) balancing technique and iterative SVD (Singular Value Decomposition) data imputation method were used to address the problem of category imbalance and missing values, respectively. We analyzed the possible predictive factors of stroke in elderly patients using seven modeling approaches to train the model. The diagnostic value of the model derived from machine learning was evaluated by the ROC curve (receiver operating characteristic curve).

Results: We analyzed 7,128 and 661 patients from MIMIC-VI and MIMIC-III, respectively. The XGB (extreme gradient boosting) model got the highest AUC (area under the curve) of 0.78 (0.75–0.81), making it better than the other six models, Besides, we found that XGB model with databalancing was better than that without data balancing. Based on this prediction model, we found hypertension, cancer, congestive heart failure, chronic pulmonary disease and peripheral vascular disease were the top five predictors. Furthermore, we demonstrated that hypertension predicted postoperative stroke is much more valuable.

Conclusion: Stroke in elderly patients during the postoperative period can be reliably predicted. We proved XGB model is a reliable predictive model, and the history of hypertension should be weighted more heavily than the results of laboratory tests to prevent postoperative stroke in elderly patients regardless of gender.

Introduction

Stroke, also called cerebrovascular accident, includes the neurological pathology of the brain arteries that can result from ischemia or hemorrhage (Boursin et al., 2018). Stoke ranks as the second-leading cause of mortality and disability worldwide behind ischemic heart disease and thereby become a major health-related challenge (Merino, 2014; Sirsat et al., 2020). Stroke also gives approximately 16,000,000 individuals worldwide various motor and cognitive impairments, which are often unavoidable sequelae in stroke patients. These sequelae greatly aggravate the social and family burden (Di Carlo, 2009). People with advanced age, surgery patients and ICU patients are at high risk of stroke (Mantz et al., 2010; Biteker et al., 2014; Dong et al., 2017). Consequently, it’s urgent to establish an advanced model that can help to predict and diagnose stroke. The early correct detection of stroke will lay a solid foundation to efficiently prevent and treat stroke and will greatly improve the prognosis of surgery. A prediction model is a practicable way to achieve the above goals and several attempts have been made (Maravic-Stojkovic et al., 2014; Khattar et al., 2016; Dunham et al., 2017; Sporns et al., 2017; Zhou et al., 2020). However, there is still a demand for models that can predict stroke in elderly patients after surgery.

Machine Learning (ML), as a mature and scientific modeling method, is attracting more attention than traditional modeling approaches such as the Cox proportional hazard model. ML is a pivotal part of artificial intelligence (AI), it can achieve self-optimization by learning complex structure with numerous variables and data (Bi et al., 2019). So far, ML has wide application in several fields, including search engines, sales and marketing, and autonomous driving (Deo, 2015; Jiang et al., 2017; Handelman et al., 2018; Connor, 2019), as well as medical diagnostics and clinical research (Heo et al., 2019; Saber et al., 2019; Sirsat et al., 2020). During the past few decades, several studies were conducted on the improvement of stroke diagnosis using ML, most of them obtained satisfying results, which would be of great value in early prognosis of stroke (Asadi et al., 2016; Cox et al., 2016; Bacchi et al., 2020; Wu and Fang, 2020). For example, the electromyography (EMG) based muscular activity monitoring system, electroencephalography (EEG) based neuronal firing activity monitoring system and electrocardiogram (ECG) based monitoring system have been applied into the early identification and prognosis of stroke, which are also beneficial to post-stroke rehabilitation (Robinson et al., 2003; Hussain and Park, 2020, 2021a,b).

We obtained our data from two public clinical databases, which contains rich and complete clinical data. In the practice of machine learning modeling, we utilized not only subjects from MIMIC-VI for internal validation but also samples from the MIMIC-III database for further external testing. The goal of the present study is to introduce a prediction model for postoperative stroke in elderly patients. We applied seven machine learning method in this research combined with iterative SVD data imputation and SMOTENC method, which would deliver an accurate and quick prediction outcome. Based on our results, the perioperative patients with high risk of stroke could be found and treated as early as possible, which would shed new light on the prevention of stroke.

Materials and Methods

Database and Study Design

We obtained our data from two publicly available retrospective multigranular clinical databases, MIMIC-III and MIMIC-VI, which are high-quality database of admitted patients from 2000 to 2014 and from 2014 to 2018, respectively. They have large samples with comprehensive clinical information. The 80% percent of the samples from MIMIC-VI, chosen randomly, were regarded as the development set, and the remaining 20% were regarded as the validation set. Besides, the samples from MIMIC-III were applied as an independent testing set to further evaluate the applicability of the established models and predictors.

Subjects and Outcomes

In this study, subjects who were admitted to the SICU (surgery intensive care unit) with age > 55 years were selected. All these patients should include vital signs, complications and laboratory results. As shown in Figure 1, subjects younger than 55 years were excluded. Missing values of enrolled individuals in MIMIC-VI were filled with the iterative SVD data imputation method. Only patients with complete data in MIMIC-III were kept. We finally screened 661 patients from MIMIC-III and 7,128 patients from MIMIC-VI into the study. Incidence of stroke was used as the outcome measure. Then we separated patients into the stroke group and non-stroke group based on their diagnosis in the hospital.

FIGURE 1

Figure 1. Flow diagram of the selection process of patients in MIMIC III (A) and MIMIC VI (B).

We select predictors according to what features chosen in the previous research (Heo et al., 2019; Sirsat et al., 2020; Wu and Fang, 2020), as well as our clinical experience. Predictors with missing data more than 30% in MIMIC-III and MIMIC-VI, such as bicarbonate, were excluded. The predictors included (a) demographic information: age, sex, ethnicity and BMI index; (b) comorbidities: peripheral vascular disease, hypertension, chronic pulmonary disease, diabetes, renal disease, liver disease, peptic ulcer disease, sepsis, congestive heart failure, cancer, and rheumatic disease; (c) the first-day laboratory results in the ICU: the mean level of glucose; the lowest and mean levels of Spo₂, the lowest and highest levels of anion gap, albumin, bilirubin total, creatinine, hematocrit, hemoglobin, WBC (white blood cells), lactate, platelets, potassium, PTT (partial thromboplastin time), PT (prothrombin time), INR (international normalized ratio), and BUN (blood urea nitrogen); and (e) the first-day vital signs in the ICU: the highest and mean levels of heart rate, SBP (systolic blood pressure), DBP (diastolic blood pressure), and MBP (mean blood pressure) (Dunham et al., 2017; Wu and Fang, 2020; Bolourani et al., 2021).

We extracted the target subjects with all of the above information and outcome measures via navicat premium12 software. Data cleaning was completed by Stata software after the data extraction. Firstly, individuals who met the exclusion criteria were excluded. Secondly, the extreme values and outliers were deleted. For data in MIMIC- VI, we excluded subjects with missing values accounting for more than 5% of the predictive features. Imputation method was used to handle with the missing values. For data in MIMIC- III, we merely keep variables with complete values, which was treated as an independent validation set. Therefore, the subsets were established for the final analyses. The process of establishing models was well illustrated in Figure 2.

FIGURE 2

Figure 2. Schematic illustration to the performance of the stroke prediction model. The SMOTENC balancing technique was applied to training dataset before establishing the model due to the imbalanced ratio of non-stroke to stroke patients in this work. After the normalization of data was completed, we applied seven machine learning methods to train and test models with the training dataset, test dataset and the independent test dataset. Finally, we can get model-based importance of features.

Statistical Analysis

We compared the characteristics above between the stroke group and the non-stroke group, also between the development cohort and validation cohort. Differences in normally distributed data are described as mean ± SD (standard deviation) and were compared by the Students’ t-test, while differences in non-normal data are described as median and IQR (interquartile range) and were compared by a non-parametric test. Differences in rate and constituent ratio data are presented as numbers and percentages, and they were compared with the chi-squared test or a non-parametric test.

The missing values of the training set and the verification set in the MIMIC-VI database were reasonably filled in with the iterative SVD data imputation method (Troyanskaya et al., 2001; Di Lena et al., 2019). The results of previous literature suggested that machine learning methods with data balancing methods had better performance in stroke prediction compared with imbalanced data (Wu and Fang, 2020). And in this research, a variant of SMOTE called SMOTE-NC is applied in this research to solve the problem of category imbalance because of the categorical features. All classifiers are trained with an equal number of training samples per class through oversampling (Pears et al., 2014; Bolourani et al., 2021).

In this article, several kinds of classifiers are used in machine learning methods to classify strokes. We used a Python library called Scikit-learn to build our classifier. This Python package provides several classification algorithms and is a powerful and useful open-source machine learning toolkit. The details of performing the stroke prediction model are shown in Figure 2. We employed 7 machine learning algorithms including KNN (k-nearest neighbor), SVM (support vector machine), MLP (multilayer perceptron), LR (logistic regression), DT (decision tree), RF (random forest) and XGBoost (extreme gradient boosting) to establish a stroke prediction model with the training set. The hyperparameters which we used in 7 machine learning algorithms came from default setting in scikit-learn package. Eg: The hyperparameter of KNN is k and the default setting in scikit-learn is “k = 3,” which was used in this study. The performance of the models was weighed by the AUROC (area under the receiver operating characteristic curves) (Zhou et al., 2020). For the best-performing model, the importance of the predictors was evaluated and computed with the information gain. SAS 9.4, R software 3.6.1, and MATLAB 9.9 were used for statistical analyses.

Results

Patient Characteristics

With the data obtained from the common database, we finally identified 7,128 and 661 patients from MIMIC-VI and MIMIC-III, respectively. The screening process is shown in Figure 1. Predictors with too much missing data, such as bicarbonate, were excluded. In the current research, we ultimately included 51 predictors. The age, vital signs and partial laboratory results are shown as mean and SD; other laboratory results are shown as median and IQR. Sex, race, and comorbidities are shown as number and percentage. Patients in the MIMIC-VI database were divided into stroke group (n = 731) and non-stroke group (n = 6,397). Their baseline characteristics are shown in Table 1. The stroke group subjects were older (74.0 ± 10.5 vs. 72.1 ± 10.4) and had a higher incidence of hypertension. Additionally, both the stroke group and the non-stroke group were similar in BMI and sex distribution. Patients in the MIMIC-VI database were randomly separated into training and validation sets at a ratio of 8:2, while the MIMIC-III database made up the independent testing set. Patients with stroke in the training set, validation set and independent testing set accounted for 10.3, 10.2, and 6.51%, respectively. The training cohort included 5,117 non-stroke subjects and 585 stroke subjects. The validation cohort had 1,280 non-stroke subjects and 146 stroke subjects. The independent testing cohort had 618 non-stroke subjects and 43 stroke subjects. Patients in the training and validation cohorts were similar in demographic characteristics, the incidence of various comorbidities, laboratory results and vital signs, as shown in Supplementary Table 1. For the independent testing set, constituted by data from MIMIC-VI database, the population included was quite different, and only patients with complete values were included, which is a requirement of an independent testing cohort. The characteristics of patients in the independent testing set were shown in Supplementary Table 2.

TABLE 1

Table 1. Characteristics of stroke and non-stroke patients in the training and validation sets of the MIMIC-IV database.

Prediction Models

The process of performing the stroke prediction model was illustrated in Figure 2. Due to the imbalanced ratio of non-stroke to stroke patients in this work, the SMOTENC balancing technique was applied to training dataset before establishing the model. After normalization of data was completed, we applied KNN, SVM, MLP, LR, DT, RF, and XGBoost machine learning algorithms to train with the training dataset, and to test models with testing dataset and the independent testing dataset. The ROC curves of all seven models applied to the testing dataset and the independent testing dataset are given in Figure 3. The mean AUC values of 7 models in the validation cohort were 0.69, 0.76, 0.74, 0.75, 0.59, 0.78, and 0.78, respectively. Take the ROC curves into consideration, the XGB model performed best, with higher accuracy, sensitivity, specificity, and AUC values, they are 0.68 (0.57–0.78), 0.77 (0.63–0.9), 0.67 (0.53–0.8), 0.78 (0.75–0.81), respectively (Table 2). Due to the differences between the populations included in the database, the proportion of stroke patients and data characteristics of the independent testing cohort were distinct from those of the training set and validation set. Not surprisingly, we found the XGB model performed best in the independent testing set (Table 2). The accuracy, sensitivity, specificity, and AUC values of XGB model are 0.87 (0.78–0.93), 0.97 (0.96–0.98), 0.3 (0.19–0.45), 0.83 (0.79–0.87) respectively.

FIGURE 3

Figure 3. Performance evaluation for seven machine learning algorithms with ROC curves. (A) ROC curves were drawn for the validation set based on MIMIC VI performed by leaving 20% as a testing set and using the rest for the training set. (B) ROC curves were drawn for the independent testing set based on MIMIC III. The mean ROC curve of XGB is shown in pink and its corresponding 95% confidence interval is shown in deep blue.

TABLE 2

Table 2. Performance of machine learning methods in different data sets.

Finally, we can get model-based importance of features, we present the importance of the predictors in the XGB model in Figure 4. The top five predictors were hypertension, cancer, congestive heart failure, chronic pulmonary disease and peripheral vascular disease (with importance values of 0.275, 0.104, 0.080, 0.063, and 0.054, respectively). The confusion matrix of the XGB model is presented in Table 3, which represents the predicted values vs. actual values for the validating and independent testing cohorts.

FIGURE 4

Figure 4. Significance of the predictors in the XGB model. The 20 variables with the highest relative importance are measured by the amount the variable reduced the information gain. CHF, congestive heart failure; CPD, chronic pulmonary disease; PVD, peripheral vascular disease; inr, international normalized ratio; spo2, Finger pulse oxygen saturation; sbp, systolic blood pressure; PUD, chronic pulmonary disease; bun, blood urea nitrogen.

TABLE 3

Table 3. Confusion matrix of the XGBoost model.

Discussion

In this study, we aimed to establish suitable model to recognize the possible stroke in elderly patients undergoing surgery and characterize the critical predictors of post-operative stroke. Nowadays, machine learning has been widely used in establishing disease prediction model.

With the help of ML methods, we found that the incidence of stroke in elderly patients undergoing surgery was associated with various clinical features. The XGB model performed best among the KNN, SVM, MLP, LR, DT, RF, and XGB models in our study. We identified hypertension, cancer, congestive heart failure, chronic pulmonary disease and peripheral vascular disease as predictors that were most associated with stroke.

Similar to our study, a study conducted using data from the Chinese Longitudinal Healthy Longevity Study built a stroke prediction model in elderly patients aged more than 60 years (Wu and Fang, 2020). It used SMOTH to deal with imbalanced data and selected important predictors as inputs in three machine learning methods. However, due to the different sources of patients and models, they found that sex, LDLC (low-density lipoprotein cholesterol), GLU (blood glucose), hypertension, and UA (uric acid) were the top five predictors in their RF model.

Compared with other studies, ours have certain strengths. This is the first study to establish stroke prediction models focused on elderly patients undergoing surgery by using an advanced machine learning method. We used several different methods to impute data (KNN, SoftImpute, IterativeImputer, IterativeSVD) and deal with imbalanced data (SMOTENC, ADASYN, BorderlineSMOTE, KMeansSMOTE, SVMSMOTE). Finally, we chose IterativeSVD and SMOTENC according to the AUC values. We utilized not only subjects from MIMIC-III for internal validation but also samples from the MIMIC-VI database for further external testing of the seven machine learning models.

Our study also has some limitations. First, relying on the results, we can only prevent stoke as much as possible, but cannot identify the stroke. The physiological signals like EMG, EEG, and ECG based monitoring system may have a chance to early identify stroke, which is also helpful to post-stroke treatment (Robinson et al., 2003; Hussain and Park, 2020, 2021a,b). Second, there were a certain number of missing values. We abandoned some potential confounding variables for having too many missing data, which is unavoidable in retrospective studies. Third, there were many variables involved, and the excessive variables increased the difficulty of model construction and the accuracy of the established models. Therefore further study about the effect of the strongest stroke predictors that we screened out should be carried out in the future.

Conclusion

Our results showed that hypertension, cancer, congestive heart failure, chronic pulmonary disease and peripheral vascular disease might be closely associated with stroke in SICU elderly patients. The XGBoost model performs better than the KNN, SVM, MLP, LR, DT, and RF models in our study. In order to prevent stroke of elderly patients in SICU, we need to pay attention to their comorbidities more than other laboratory features, especially maintaining stable blood pressure. However, further additional verifications are necessary to examine the generalization of our models and predictors.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: https://mimic.mit.edu/.

Ethics Statement

Medical Information Mart for Intensive Care (MIMIC) is a large, freely-available medical database consisting of deidentified data from patients who were admitted to the critical care units of the Beth Israel Deaconess Medical Center. The consent was obtained at the beginning of data collection. Therefore, the ethical approval statement and the need for informed consent were jumped in this manuscript, which were not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

ZF, QW, and XZ made contributions to the conception and design of the work. XZ extracted the data from the MIMIC-III and MIMIC-VI databases. NF and XXZ participated in processing the data and the statistical analysis. XZ and NF finished the first draft, they contributed equally to this work and shared first authorship. All authors had revised the manuscript and approved the final edition.

Funding

This work was supported the National Natural Science Foundation of China (no. 82171322) to ZF.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We are extremely grateful to my husband Xiuquan Wu for his consolation and constructive comments on the draft.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnagi.2022.897611/full#supplementary-material

References

Asadi, H., Kok, H. K., Looby, S., Brennan, P., O’Hare, A., and Thornton, J. (2016). Outcomes and Complications After Endovascular Treatment of Brain Arteriovenous Malformations: A Prognostication Attempt Using Artificial Intelligence. World Neurosurg. 96, 562.e–569.e. doi: 10.1016/j.wneu.2016.09.086

PubMed Abstract | CrossRef Full Text | Google Scholar

Bacchi, S., Zerner, T., Oakden-Rayner, L., Kleinig, T., Patel, S., and Jannes, J. (2020). Deep Learning in the Prediction of Ischaemic Stroke Thrombolysis Functional Outcomes: A Pilot Study. Acad. Radiol. 27:e19–e23. doi: 10.1016/j.acra.2019.03.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Bi, Q., Goodman, K. E., Kaminsky, J., and Lessler, J. (2019). What is Machine Learning? A Primer for the Epidemiologist. Am. J. Epidemiol. 188, 2222–2239. doi: 10.1093/aje/kwz189

PubMed Abstract | CrossRef Full Text | Google Scholar

Biteker, M., Kayatas, K., Türkmen, F. M., and Mısırlı, C. H. (2014). Impact of perioperative acute ischemic stroke on the outcomes of noncardiac and nonvascular surgery: a single centre prospective study. Can. J. Surg. 57, E55–E61. doi: 10.1503/cjs.003913

PubMed Abstract | CrossRef Full Text | Google Scholar

Bolourani, S., Brenner, M., Wang, P., McGinn, T., Hirsch, J. S., Barnaby, D., et al. (2021). A Machine Learning Prediction Model of Respiratory Failure Within 48 Hours of Patient Admission for COVID-19: Model Development and Validation. J. Med. Internet Res. 23, e24246. doi: 10.2196/24246

PubMed Abstract | CrossRef Full Text | Google Scholar

Boursin, P., Paternotte, S., Dercy, B., Sabben, C., and Maïer, B. (2018). [Semantics, epidemiology and semiology of stroke]. Soins 63, 24–27. doi: 10.1016/j.soin.2018.06.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Connor, C. W. (2019). Artificial Intelligence and Machine Learning in Anesthesiology. Anesthesiology 131, 1346–1359. doi: 10.1097/aln.0000000000002694

PubMed Abstract | CrossRef Full Text | Google Scholar

Cox, A. P., Raluy-Callado, M., Wang, M., Bakheit, A. M., Moore, A. P., and Dinet, J. (2016). Predictive analysis for identifying potentially undiagnosed post-stroke spasticity patients in United Kingdom. J. Biomed. Inform. 60, 328–333. doi: 10.1016/j.jbi.2016.02.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Deo, R. C. (2015). Machine Learning in Medicine. Circulation 132, 1920–1930. doi: 10.1161/circulationaha.115.001593

PubMed Abstract | CrossRef Full Text | Google Scholar

Di Carlo, A. (2009). Human and economic burden of stroke. Age Ageing 38, 4–5. doi: 10.1093/ageing/afn282

PubMed Abstract | CrossRef Full Text | Google Scholar

Di Lena, P., Sala, C., Prodi, A., and Nardini, C. (2019). Missing value estimation methods for DNA methylation data. Bioinformatics 35, 3786–3793. doi: 10.1093/bioinformatics/btz134

PubMed Abstract | CrossRef Full Text | Google Scholar

Dong, Y., Cao, W., Cheng, X., Fang, K., Zhang, X., Gu, Y., et al. (2017). Risk Factors and Stroke Characteristic in Patients with Postoperative Strokes. J. Stroke Cerebrovasc. Dis. 26, 1635–1640. doi: 10.1016/j.jstrokecerebrovasdis.2016.12.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Dunham, A. M., Grega, M. A., Brown, C.H.t, McKhann, G. M., Baumgartner, W. A., and Gottesman, R. F. (2017). Perioperative Low Arterial Oxygenation Is Associated With Increased Stroke Risk in Cardiac Surgery. Anesth. Analg. 125, 38–43. doi: 10.1213/ane.0000000000002157

PubMed Abstract | CrossRef Full Text | Google Scholar

Handelman, G. S., Kok, H. K., Chandra, R. V., Razavi, A. H., Lee, M. J., and Asadi, H. (2018). eDoctor: machine learning and the future of medicine. J. Intern. Med. 284, 603–619. doi: 10.1111/joim.12822

PubMed Abstract | CrossRef Full Text | Google Scholar

Heo, J., Yoon, J. G., Park, H., Kim, Y. D., Nam, H. S., and Heo, J. H. (2019). Machine Learning-Based Model for Prediction of Outcomes in Acute Stroke. Stroke 50, 1263–1265. doi: 10.1161/strokeaha.118.024293

PubMed Abstract | CrossRef Full Text | Google Scholar

Hussain, I., and Park, S. J. (2020). HealthSOS: Real-Time Health Monitoring System for Stroke Prognostics. IEEE Access 8, 213574–213586. doi: 10.1109/access.2020.3040437

CrossRef Full Text | Google Scholar

Hussain, I., and Park, S. J. (2021a). Big-ECG: Cardiographic Predictive Cyber-Physical System for Stroke Management. IEEE Access 9, 123146–123164. doi: 10.1109/access.2021.3109806

CrossRef Full Text | Google Scholar

Hussain, I., and Park, S. J. (2021b). Prediction of Myoelectric Biomarkers in Post-Stroke Gait. Sensors 21:5334. doi: 10.3390/s21165334

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, F., Jiang, Y., Zhi, H., Dong, Y., Li, H., Ma, S., et al. (2017). Artificial intelligence in healthcare: past, present and future. Stroke Vasc. Neurol. 2, 230–243. doi: 10.1136/svn-2017-000101

PubMed Abstract | CrossRef Full Text | Google Scholar

Khattar, N. K., Friedlander, R. M., Chaer, R. A., Avgerinos, E. D., Kretz, E. S., Balzer, J. R., et al. (2016). Perioperative stroke after carotid endarterectomy: etiology and implications. Acta Neurochir. 158, 2377–2383. doi: 10.1007/s00701-016-2966-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Mantz, J., Dahmani, S., and Paugam-Burtz, C. (2010). Outcomes in perioperative care. Curr. Opin. Anaesthesiol. 23, 201–208. doi: 10.1097/ACO.0b013e328336aeef

PubMed Abstract | CrossRef Full Text | Google Scholar

Maravic-Stojkovic, V., Lausevic-Vuk, L. J., Obradovic, M., Jovanovic, P., Tanaskovic, S., Stojkovic, B., et al. (2014). Copeptin level after carotid endarterectomy and perioperative stroke. Angiology 65, 122–129. doi: 10.1177/0003319712473637

PubMed Abstract | CrossRef Full Text | Google Scholar

Merino, J. G. (2014). Clinical stroke challenges: A practical approach. Neurol. Clin. Pract. 4, 376–377. doi: 10.1212/cpj.0000000000000082

PubMed Abstract | CrossRef Full Text | Google Scholar

Pears, R., Finlay, J., and Connor, A. M. (2014). Synthetic Minority Over-sampling TEchnique(SMOTE) for Predicting Software Build Outcomes. Comput. Sci. 1508–2806. doi: 10.48550/arXiv.1407.2330

CrossRef Full Text | Google Scholar

Robinson, T. G., Dawson, S. L., Eames, P. J., Panerai, R. B., and Potter, J. F. (2003). Cardiac baroreceptor sensitivity predicts long-term outcome after acute ischemic stroke. Stroke 34, 705–712. doi: 10.1161/01.Str.0000058493.94875.9f

CrossRef Full Text | Google Scholar

Saber, H., Somai, M., Rajah, G. B., Scalzo, F., and Liebeskind, D. S. (2019). Predictive analytics and machine learning in stroke and neurovascular medicine. Neurol. Res. 41, 681–690. doi: 10.1080/01616412.2019.1609159

PubMed Abstract | CrossRef Full Text | Google Scholar

Sirsat, M. S., Ferme, E., and Camara, J. (2020). Machine Learning for Brain Stroke: A Review. J. Stroke Cerebrovasc. Dis. 29:105162. doi: 10.1016/j.jstrokecerebrovasdis.2020.105162

PubMed Abstract | CrossRef Full Text | Google Scholar

Sporns, P. B., Hanning, U., Schwindt, W., Velasco, A., Buerke, B., Cnyrim, C., et al. (2017). Ischemic Stroke: Histological Thrombus Composition and Pre-Interventional CT Attenuation Are Associated with Intervention Time and Rate of Secondary Embolism. Cerebrovasc. Dis. 44, 344–350. doi: 10.1159/000481578

PubMed Abstract | CrossRef Full Text | Google Scholar

Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., et al. (2001). Missing value estimation methods for DNA microarrays. Bioinformatics 17, 520–525. doi: 10.1093/bioinformatics/17.6.520

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, Y., and Fang, Y. (2020). Stroke Prediction with Machine Learning Methods among Older Chinese. Int. J. Environ. Res. Public Health 17:1828. doi: 10.3390/ijerph17061828

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, Q., Zhu, C., Shen, Z., Zhang, T., Li, M., Zhu, J., et al. (2020). Incidence and potential predictors of thromboembolic events in epithelial ovarian carcinoma patients during perioperative period. Eur. J. Surg. Oncol. 46, 855–861. doi: 10.1016/j.ejso.2020.01.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: stroke, machine learning, prediction model, post-operative, MIMIC database

Citation: Zhang X, Fei N, Zhang X, Wang Q and Fang Z (2022) Machine Learning Prediction Models for Postoperative Stroke in Elderly Patients: Analyses of the MIMIC Database. Front. Aging Neurosci. 14:897611. doi: 10.3389/fnagi.2022.897611

Received: 16 March 2022; Accepted: 13 June 2022;
Published: 18 July 2022.

Edited by:

Yujie Chen, Army Medical University, China

Reviewed by:

Iqram Hussain, Seoul National University, South Korea
Jinlong Liu, Zhejiang University, China

Copyright © 2022 Zhang, Fei, Zhang, Wang and Fang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zongping Fang, zongping03@163.com

^†These authors have contributed equally to this work and share first authorship

ORIGINAL RESEARCH article

Machine Learning Prediction Models for Postoperative Stroke in Elderly Patients: Analyses of the MIMIC Database

Introduction

Materials and Methods

Database and Study Design

Subjects and Outcomes

Statistical Analysis

Results

Patient Characteristics

Prediction Models

Discussion

Conclusion

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Conflict of Interest

Publisher’s Note

Acknowledgments

Supplementary Material

References

This article is part of the Research Topic

People also looked at