Skip to main content

ORIGINAL RESEARCH article

Front. Public Health, 17 November 2022
Sec. Infectious Diseases – Surveillance, Prevention and Treatment
This article is part of the Research Topic Modeling of COVID-19 and Other Infectious Diseases: Mathematical, Statistical and Biophysical Analysis of Spread Patterns View all 10 articles

Identification of hospitalized mortality of patients with COVID-19 by machine learning models based on blood inflammatory cytokines

  • 1Department of Nephrology, Xijing Hospital, Fourth Military Medical University, Xi'an, China
  • 2First Unit, Third Branch of Fangcang Shelter Hospital of National Exhibition and Convention Center, Shanghai, China

Coronavirus disease 2019 (COVID-19) spread worldwide and presented a significant threat to people's health. Inappropriate disease assessment and treatment strategies bring a heavy burden on healthcare systems. Our study aimed to construct predictive models to assess patients with COVID-19 who may have poor prognoses early and accurately. This research performed a retrospective analysis on two cohorts of patients with COVID-19. Data from the Barcelona cohort were used as the training set, and data from the Rotterdam cohort were used as the validation set. Cox regression, logistic regression, and different machine learning methods including random forest (RF), support vector machine (SVM), and decision tree (DT) were performed to construct COVID-19 death prognostic models. Based on multiple clinical characteristics and blood inflammatory cytokines during the first day of hospitalization for the 138 patients with COVID-19, we constructed various models to predict the in-hospital mortality of patients with COVID-19. All the models showed outstanding performance in identifying high-risk patients with COVID-19. The accuracy of the logistic regression, RF, and DT models is 86.96, 80.43, and 85.51%, respectively. Advanced age and the abnormal expression of some inflammatory cytokines including IFN-α, IL-8, and IL-6 have been proven to be closely associated with the prognosis of patients with COVID-19. The models we developed can assist doctors in developing appropriate COVID-19 treatment strategies, including allocating limited medical resources more rationally and early intervention in high-risk groups.

Introduction

Syndrome coronavirus-2 (SARS-CoV-2) is the causative agent of coronavirus disease 2019 (COVID-19), which infected more than 180 million people. Compared with a similar acute respiratory syndrome caused by the severe acute respiratory syndrome coronavirus, COVID-19 seems milder but more infectious (1). After being infected with COVID-19, patients' characteristics vary. Some patients became clinically asymptomatic or had mild cases of fever, cough, fatigue, and other symptoms. However, some people became patients with severe COVID-19 and even lost their lives (24). Due to inappropriate assessments and treatment, strategies were detrimental to the patient's health and promoted SARS-CoV-2 to become a global societal problem, which has caused over 4 million deaths to date (5). Therefore, identifying patients with poor prognoses as early as possible for necessary interventions is an essential direction for the treatment of COVID-19, which will significantly improve the prognosis of patients and release a tremendous burden on the medical care system.

The immune system and inflammatory syndrome have been proven to play a crucial role in COVID-19 infection (6). Inflammatory cytokines are critical mediators that oversee and regulate immune and inflammatory responses via complex networks and serve as biomarkers for many diseases (7). According to previous studies (810), inflammatory cytokines were closely related to the progression, complications, and mortality of COVID-19. Universally, these studies paid attention to the relationship between cytokines and disease severity. However, few researchers specifically employed cytokines to construct a model for predicting the prognosis of patients with COVID-19.

Machine learning (ML) algorithms have been widely applied in the medical field, including diagnosing and predicting prognosis. ML models are also used in every aspect of the diagnosis and treatment of COVID-19 due to their fantastic data processing capabilities. Previous ML studies have used multiple indicators, including clinical and blood text indicators, to determine the prognosis of patients with COVID-19. Due to cytokine tests' simplicity, high efficiency, and accuracy, they gradually became an alternative plan for the early prediction of COVID-19 prognosis. Abers et al. (11) fit a Cox proportional hazard to screen the mortality-related inflammatory cytokines. Patterson et al. (12) applied ML methods for the early identification of patients with severe COVID-19 based on cytokines. Mueller et al. (13) classified patients with COVID-19 into different subgroups according to inflammatory cytokines and applied the immunotypes to predict long-term post-COVID-19 complications. However, there are no available models for early prediction of the death of patients with COVID-19 based on blood inflammatory cytokines for clinical work. Constructing a predictive model that can be applied in the clinic seemed urgent. Therefore, in this research, we made use of data collected by Mueller et al. (13), containing 138 inpatients with COVID-19, to construct multiple models based on different algorithms, including logistic regression, random forest (RF), and decision tree (DT) to predict patient deaths.

Methods

Data acquisition

In this research, we obtained COVID-19 data from a dataset of Mueller et al. (13) after obtaining author approval. The data were from a finished cohort study, which has been approved by the Ethics Committee for Research with Medicines of Hospital Universitari Vall d'Hebron and Erasmus University Medical Center (13). Therefore, we do not require reapproval from the ethics committee for this study. The dataset contained 138 patients with COVID-19 from two independent cohorts (Rotterdam cohort n = 50 and Barcelona cohort n = 88). Clinical parameters and laboratory data for each patient during the first 24 h of hospitalization were included in the dataset. Final clinical outcomes were classified into discharge from the hospital and in-hospital death. All inpatients in the study cohort were older than 18 and SARS-CoV-2 positive diagnosed by reverse transcription-polymerase chain reaction (RT-PCR) test and were sampled at hospital entry. Dataset included cytokines measured through the ELLA Simple Plex system and ELISA kits. The COVID-19 antibody concentration in serum was measured through ELISA, and other clinical indicators were obtained by routine tests. Data acquisition is presented in the study by Mueller et al. (13).

Data preprocessing

First, we binarized the outcome variables into Booleans. We used mean value interpolation to substitute missing values, which was widely used and proved effective for missing values in datasets. Thankfully, no missing values in the COVID-19 patients' cytokines values ensured the model's reliability. We used the Barcelona cohort to construct the model and applied the Rotterdam cohort as an independent external validation set.

Patients' inflammatory cytokine values from the training and validation datasets were loaded into R software (version 4.1.0). R packages “vegan” and “stats” were applied to perform the principal component analysis (PCA) and draw the PCA figure. Besides, we used “ggplot2” for the figure drawing.

Our research selected the following five algorithms: Cox regression, logistic regression, RF, support vector machine (SVM), and decision tree (DT). COVID-19 cases located in different areas were conducted to verify the accuracy of each model to ensure the model's reliability. In the logistic regression model, we input all the inflammatory cytokine values, age, and sex into the logistic regression in the training set. Then, the variables with p < 0.1 in the single logistic regression were filtered for the following research. We applied multiple logistic regression (backward: likelihood ratio method) multivariate analysis for the hub variables, and the coefficients derived were used to generate a prognostic model. A prognostic model was constructed based on the logistic regression coefficients. For verification, we generated ROC and calibration curves to calculate the model results for the training set and validation set. The area under the curve (AUC) and 95% confidence interval (CI) were used to verify the model efficiency. Moreover, to obtain a more comprehensive evaluation of the application value of the signature, decision curve analysis (DCA) was performed, demonstrating the net benefit to the patients with COVID-19 after applying the model for prognosis. These steps above were finished using SPSS (version 26.0) and Stata software (version 16). The Cox regression was performed using STATA (version 16), too. The Cox prognostic model was based on the Cox regression coefficients. The ROC and DCA were both drawn to evaluate the model.

In supervised ML, we evaluated the residuals of both methods for the model's accuracy and compared the residuals in our research. After confirming that the RF model is a better method with fewer residuals, we created an RF model to obtain the variables' significance. Artificial neural network (ANN) model training was performed, and the ROCs were used to verify the model efficiency. What is more, we constructed a DT based on all inflammatory cytokines, sex, and age through the R package “rpart” and calculated the accuracy of the DT in the training set and validation set. The RF model was constructed using R packages “randomForest” and “neuralnet.” DT was finished through R package “rpart.” Additionally, the R package “pROC” was applied to perform ROC.

A flowchart is shown in the Figure 1 to help readers better understand the complete analysis steps.

FIGURE 1
www.frontiersin.org

Figure 1. The flowchart of our research. LR, logistic regression; RF, random forest; DT, decision tree.

Results

Characteristics of the patients

There are 138 patients with COVID-19 participating in our study cohort. There were 27 (19.6%) deceased cases, containing 18 (13.0%) dead cases in the Barcelona cohort and 9 (7.0%) dead cases in the Rotterdam cohort. In Table 1, we listed the characteristics of each variable in both cohorts. We found no significant differences between the two cohorts, which ensures the feasibility and rationality of applying both datasets to model and validate.

TABLE 1
www.frontiersin.org

Table 1. Characteristics of patients with COVID-19.

Building a logistic regression model to classify patients and evaluate the model

We compared the patients with COVID-19 with different outcomes. There existed a significant difference between survival and death cases in age, some antibodies, and cytokines, indicating that these characteristics might play a key role in promoting death (Table 2).

TABLE 2
www.frontiersin.org

Table 2. Characteristics of two cohorts.

All the inflammatory cytokines, sex, and age of the training set were used as input variables to perform single logistic regression. Five variables, including IL8, IL6, IFN-α, and IL17-α, and age with p < 0.1 might play a vital role in the COVID-19 patients' classification (Table 3). Multivariate analysis revealed that there were four variables with p < 0.05. The logistic regression model was constructed based on these four hub variables (Table 3).

TABLE 3
www.frontiersin.org

Table 3. Logistic regression analysis for mortality of patients in the training set.

The AUC of the model in the training set was 0.919 and in the validation set was 0.7236 (Figures 2A,B). Both were >0.7, which means that the model possessed good diagnosability. Besides, we found that the different groups were separated based on the hub variables (Figures 2C,D) in the PCA, indicating that these four variables might represent essential differences between different groups (Figure 2E). We drew a nomogram to apply our model in clinical work better. For evaluation, we performed calibration curve plots that fit well with the diagonal reference line (Figure 2F), indicating our model's great performance. DCA is a powerful method for assessing the degree of patient benefits. This research applied the DCA for the model in training and validation sets (Figures 2G,H). The DCA curves revealed that patients with COVID-19 could obtain net benefits through the logistic regression model. To establish a more comprehensive prognosis evaluation system, we applied Cox regression in Supplementary material 1.

FIGURE 2
www.frontiersin.org

Figure 2. The characteristics of logistic regression (LR) signature. (A) The receiver operating characteristic curve (ROC) of signature in the training set. The area under the curve (AUC) is 0.919, indicating that the signature works well in the training set. (B) The ROC of signature in the validation set. AUC is 0.7236, indicating that the signature is valuable in the validation set. (C) The principal component analysis (PCA) of hub cytokines and age shows that patients with COVID-19 with different outcomes are separate in the training set. (D) The PCA of hub cytokines and age shows that patients with COVID-19 with different outcomes are separate in the validation set. (E) The nomogram of the signature. (F) Calibration curve plots of the signature. The lowess fits the reference line well, showing that the signature is effective. (G) The decision curve analysis (DCA) of the signature in the training set. (H) The DCA of the signature in the validation set. Both decision curve analyses revealed that patients with COVID-19 can benefit from applying the LR signature.

Establishing an ML model to classify patients and evaluate the model

To improve the diagnostic performance of the model, we applied ML algorithms, including RF and SVM, to construct a new model. To reduce the subsequent unnecessary workload, first, we evaluated the residuals of the SVM and RF (Figures 3A,B). The results indicated that RF performed better with fewer residuals. Therefore, we decided to choose RF as the main ML algorithm to construct a model.

FIGURE 3
www.frontiersin.org

Figure 3. The comparison of random forest (RF) and support vector machine (SVM) methods in constructing models. (A) Boxplots of |residual|. The red dot stands for the root mean square of residuals. (B) Reverse cumulative distribution of |residual|. These results indicate that RF might be more suitable than SVM to construct a model in our research.

In the next step, all the variables (including cytokines, sex, and age) were entered into the RF classifier. We set the optimal parameter mtry to 2 as a default setting. The optimal number of trees in the classifier was set as 500, maintaining a low error for the classifier (Figure 4A). We used the perspective of reducing mean square error to measure the variable importance of the results (Gini coefficient). To keep our model succinct, we identified hub variables using a cutoff of importance >2 (Figure 4B). After obtaining the hub variables, we constructed an ANN model through the R package “neuralnet”. Two parallel training processes were used to construct a scoring model based on the training set. The ANN topology of the training set contained eight input layers, five hidden layers, and two output layers (Figure 4C).

FIGURE 4
www.frontiersin.org

Figure 4. The RF modeling processes. (A) The influence of the number of decision trees on the error rate. After multiple repetitive operations, the error becomes stable gradually. (B) Results of the Gini coefficient method in the random forest classifier. We set importance=2 as a cutoff. (C) Neural network topology of the microarray with 8 input layers, 5 hidden layers, and 2 output layers. (D) The PCA of hub cytokines and age shows that patients with COVID-19 with different outcomes are separate in the training set. (E) The PCA of hub cytokines and age shows that patients with COVID-19 with different outcomes are separate in the validation set. (F) The ROC of signature in the training set. AUC is 0.99, indicating that the signature is perfect in the training set. (G) The ROC of signature in the validation set. AUC is 0.783, indicating that the signature is good in the validation set.

The PCA of hub cytokines from RF and age shows that patients with COVID-19 with different outcomes are separate in both training and validation sets, revealing that the model possesses good discrimination (Figures 4D,E). In the training set, the AUC of the model was 0.99 (Figure 4F). Additionally, in the validation set, the AUC came to 0.783 (Figure 4G). The results indicated that the model we built possessed advantages in some situations.

Constructing a DT to classify patients and evaluate the model

To simplify the model and improve the feasibility of models for clinical application, we performed DT for all the variables previously mentioned. DT showed the interrelationship among the selected variables screened by the DT algorithm. The DT became the most straightforward tree when the complexity parameter (CP) was 0.1944444 (Figure 5). The first filtration age was (≥68 years), and the mortality was 48.27%. The second combination was IFN-α≥47 pg/ml. The mortality for patients with COVID-19 within the double combination was 100%. In the training set and validation set, the accuracy of the DT model was 0.875 and 0.86, respectively. The results revealed that the DT model considered both effectiveness and concise usability.

FIGURE 5
www.frontiersin.org

Figure 5. The characteristics of decision tree (DT). (A) The relative errors of the DT model and tree size when the complexity parameter comes to an ideal value. (B) The inter-relationship among selected clinical indicators.

Models' performance evaluation

We listed the results of the three models' performance in overall patients with COVID-19 in our cohort (the data from the training set and the validation set are merged, 138 patients). According to Figure 6, the models developed by logistic regression happened to be the highest in accuracy with 86.96% when compared with other models developed by the RF model and the DT model have 80.43 and 85.51%. While for sensitivity that shows the mortality rate of patients with COVID-19 correctly by the models, the logistic regression model seems to be the beat one with 96.3%, followed by the RF model with 70.37% and the DT model with 25.93%. Additionally, for specificity showing the survival rate of patients with COVID-19 correctly by the models, the DT model emerged to be the best one with 100%, followed by the RF model with 81.98% and the logistic regression model with 63.06%.

FIGURE 6
www.frontiersin.org

Figure 6. LR, RF, and DT models' performance evaluation results in the whole dataset.

Discussion

This research found that old age and several inflammatory cytokines played a crucial role in promoting severe COVID-19. We constructed multiple predictive prognostic models based on these factors to identify patients with COVID-19 at high risk of death at hospital entry. The models were validated using external datasets, and all models' performance is satisfactory. What we did may provide a novel insight into evaluating COVID-19 patients' conditions.

SARS-CoV-2, caused by COVID-19, spread worldwide at an unpredicted speed and brought profound and unfolding impacts on every aspect of human life. Previous studies revealed that patients with COVID-19 expressed huge heterogeneous characteristics, ranging from asymptomatic to losing lives (14, 15). COVID-19 infection affects various systems in the human body, including the immune system, leading to changes in the patient's internal environment and inflammation. Gao et al. (16) reported that pro-inflammatory cytokines were highly associated with severe disease. Previous studies had found that several cytokines were closely related to the development of COVID-19 but simply considered individual cytokines as predictive indicators (17, 18). COVID-19 causes immune dysregulation accompanied by multiple cytokine disturbances, which is hard to evaluate scientifically with one variable. Therefore, an effective inflammatory cytokines signature to comprehensively evaluate the COVID-19 patient's immune status must contain multiple variables. Inflammatory cytokines are texted in serum, which is easy to obtain from inpatients. The 12 inflammatory cytokines, the most essential and common series of cytokines, are sufficient to evaluate the patient's immune status.

In the research, we filtered out some prominent inflammatory cytokines in predicting the prognosis of COVID-19. IFN-α also has immunoregulatory effects, which might activate inflammatory responses and cause uncontrolled pathogenic damage (19). Krämer et al. (20) indicated preferential IFN-α responses in severe COVID-19 and declared that IFN-α was associated with a poorer COVID-19 infection outcome. Our research found that early high IFN-α signatures were hazardous features of poor prognoses for patients with COVID-19. Patients with COVID-19 with old age and elevated IFN-α levels suffered a very extreme risk of death. Systemic and autocrine IL-8 loops were essential neutrophil activation factors for immunopathology, triggering multiple cell dysfunctions (21). A previous study indicated that in patients with severe COVID-19, IL-8 might be a prognostic indicator for in-hospital death and a target for an effective treatment strategy (22). IL-8 was included in both the logistic regression and RF models in our research, which declared a crucial clinical value for this inflammatory cytokine. IL-6 is one of the most prominent inflammatory cytokines. According to Mojtabavi et al. (23), the elevated IL-6 level was an independent risk factor for adverse COVID-19 outcomes. There had already been several treatment strategies based on IL-6. Some results of them were reported as encouraging (24, 25). In our study, IL-6 significantly differed between death and survival cases in training and verification sets. This result revealed that IL-6 was a stable prognostic factor that could be applied on a large scale. Maione et al. (26) indicated IL-17A as a silent amplifier of cytokine storm in patients with COVID-19, activating several inflammatory pathways. In the clinical work, researchers identified IL-17A as a target to develop therapeutic strategies and made some progress (27). In this study, we screened out a possible link between elevated IL-17A levels and COVID-19 mortality.

Due to our work, we found that age was a key factor for all models and scored the highest in the RF model. There exist significant differences in age between survival and death cases. O'Driscoll et al. (28) declared that patients with COVID-19 aged older than 65 years suffered from higher mortality. Chen et al. (29) stated that age was related to declining and dysregulation of immune function, which heightened vulnerability to COVID-19 in elders. Previous researchers set the age>65 years as a high risk for severe COVID-19 outcomes (28, 30). Our DT sets age>68 years as a cutoff of the first combination, similar to those in previous reports. These results indicated that age played a major role in contributing to COVID-19 mortality.

Considering all models involved in this study comprehensively, the models developed with RF and logistic regression happened to be two well-rounded models, with considerable AUC in the training set and AUC = 0.783 in the validation set and high accuracy. However, the complexity of these models might bring some inconvenience in clinical applications. The model constructed by DT had good accuracy, specificity, and convenience. As for areas with underdeveloped medical conditions, it is an excellent choice to apply the DT model we constructed which was a simple model with good accuracy.

Building COVID-19 prognostic models through blood inflammatory cytokines levels is a novel thought. Thus, our research suggests that more extensive cohort studies should be conducted to reveal the role of inflammatory cytokines in predicting long-term post-COVID-19 complications.

Conclusion

In this research, we identified that advanced age, IFN-α, IL-8, and IL-6 have been identified as potential prognostic predictors of COVID-19 outcomes by multiple models in our research, which indicated that these cytokines might play a vital role in the progression of SARS-CoV-2. Therefore, we advised that accurate and quantitative detection of the inflammatory cytokines could be performed when necessary. The RF, logistic regression, and DT models based on blood inflammatory cytokines performed well in identifying patients with COVID-19 at risk of death. We strongly suggest that the models developed with RF and logistic regression should be applied in the regions with more abundant medical resources, and the model developed by DT could be used in the regions with less abundant medical. The models we developed can assist doctors in applying individual strategies to different risk cohorts to perform early intervention and treatment to benefit patients with COVID-19.

Data availability statement

The data and codes we used in this passage could be obtained from the corresponding author with reasonable requests. Requests to access these datasets should be directed to SS, sunshiren@medmail.com.cn.

Ethics statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

ZY and XL conducted the analysis and prepared an initial draft. JZ collected data. SS offered ideas for the project. All authors contributed to the article and approved the submitted version.

Funding

This study was funded by the Special Scientific Research Project of Shanghai Medical Assistance Team from Air Force Military Medical University in 2022, 2022YH-1009.

Acknowledgments

We express our most sincere thanks to Professor Peter D. Katsikis and his team for their generous willingness to share their data. Meanwhile, we show great respect to all who fought against COVID-19.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2022.1001340/full#supplementary-material

References

1. Alsharif W, Qurashi A. Effectiveness of COVID-19 diagnosis and management tools: a review. Radiography (London, England: 1995). (2021) 27:682–7. doi: 10.1016/j.radi.2020.09.010

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Pascarella G, Strumia A, Piliego C, Bruno F, Del Buono R, Costa F, et al. COVID-19 diagnosis and management: a comprehensive review. J Intern Med. (2020) 288:192–206. doi: 10.1111/joim.13091

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Wu D, Wu T, Liu Q, Yang Z. The SARS-CoV-2 outbreak: what we know. Int J Infect Dis. (2020) 94:44–8. doi: 10.1016/j.ijid.2020.03.004

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Guan WJ Ni ZY, Hu Y, Liang WH, Ou CQ, He JX, et al. Clinical characteristics of coronavirus disease 2019 in China. N Engl J Med. (2020) 382:1708–20. doi: 10.1056/NEJMoa2002032

PubMed Abstract | CrossRef Full Text | Google Scholar

5. CDC COVID-19 Response Team Food and Drug Administration. Allergic Reactions Including Anaphylaxis After Receipt of the First Dose of Moderna COVID-19 Vaccine - United States, December 21, 2020-January 10, 2021. MMWR Morb Mortal Wkly Rep. (2021) 70:125–9. doi: 10.15585/mmwr.mm7004e1

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Wack S, Patton T, Ferris LK. COVID-19 vaccine safety and efficacy in patients with immune-mediated inflammatory disease: review of available evidence. J Am Acad Dermatol. (2021) 85:1274–84. doi: 10.1016/j.jaad.2021.07.054

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Liu C, Chu D, Kalantar-Zadeh K, George J, Young HA, Liu G. Cytokines: from clinical significance to quantification. Adv Sci (Weinheim, Baden-Wurttemberg, Germany). (2021) 8:e2004433. doi: 10.1002/advs.202004433

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Kunnumakkara AB, Rana V, Parama D, Banik K, Girisa S, Henamayee S, et al. COVID-19, cytokines, inflammation, and spices: How are they related? Life Sci. (2021) 284:119201. doi: 10.1016/j.lfs.2021.119201

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Kaur S, Bansal R, Kollimuttathuillam S, Gowda AM, Singh B, Mehta D, et al. The looming storm: blood and cytokines in COVID-19. Blood Rev. (2021) 46:100743. doi: 10.1016/j.blre.2020.100743

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Darif D, Hammi I, Kihel A, El Idrissi Saik I, Guessous F, Akarid K. The pro-inflammatory cytokines in COVID-19 pathogenesis: what goes wrong? Microb Pathog. (2021) 153:104799. doi: 10.1016/j.micpath.2021.104799

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Abers MS, Delmonte OM, Ricotta EE, Fintzi J, Fink DL, de Jesus AAA, et al. An immune-based biomarker signature is associated with mortality in COVID-19 patients. JCI Insight. (2021) 6. doi: 10.1172/jci.insight.144455

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Patterson BK, Guevara-Coto J, Yogendra R, Francisco EB, Long E, Pise A, et al. Immune-based prediction of COVID-19 severity and chronicity decoded using machine learning. Front Immunol. (2021) 12:700782. doi: 10.3389/fimmu.2021.700782

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Mueller YM, Schrama TJ, Ruijten R, Schreurs MWJ, Grashof DGB, van de Werken HJG, et al. Stratification of hospitalized COVID-19 patients into clinical severity progression groups by immuno-phenotyping and machine learning. Nat Commun. (2022) 13:915. doi: 10.1038/s41467-022-28621-0

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Badal S, Thapa Bajgain K, Badal S, Thapa R, Bajgain BB, Santana MJ. Prevalence, clinical characteristics, and outcomes of pediatric COVID-19: A systematic review and meta-analysis. J Clin Virol. (2021) 135:104715. doi: 10.1016/j.jcv.2020.104715

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Fu L, Wang B, Yuan T, Chen X, Ao Y, Fitzpatrick T, et al. Clinical characteristics of coronavirus disease (2019). (COVID-19) in China: a systematic review and meta-analysis. J Infect. (2020) 80:656–65. doi: 10.1016/j.jinf.2020.03.041

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Gao YD, Ding M, Dong X, Zhang JJ, Kursat Azkur A, Azkur D, et al. Risk factors for severe and critically ill COVID-19 patients: A review. Allergy. (2021) 76:428–55. doi: 10.1111/all.14657

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Liu F, Li L, Xu M, Wu J, Luo D, Zhu Y, et al. Prognostic value of interleukin-6, C-reactive protein, and procalcitonin in patients with COVID-19. J Clin Virol. (2020) 127:104370. doi: 10.1016/j.jcv.2020.104370

PubMed Abstract | CrossRef Full Text | Google Scholar

18. McElvaney OJ, McEvoy NL, McElvaney OF, Carroll TP, Murphy MP, Dunlea DM, et al. Characterization of the inflammatory response to severe COVID-19 illness. Am J Respir Crit Care Med. (2020) 202:812–21. doi: 10.1164/rccm.202005-1583OC

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Yang L, Wang J, Hui P, Yarovinsky TO, Badeti S, Pham K, et al. Potential role of IFN-α in COVID-19 patients and its underlying treatment options. Appl Microbiol Biotechnol. (2021) 105:4005–15. doi: 10.1007/s00253-021-11319-6

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Krämer B, Knoll R, Bonaguro L, ToVinh M, Raabe J, Astaburuaga-García R, et al. Early IFN-α signatures and persistent dysfunction are distinguishing features of NK cells in severe COVID-19. Immunity. (2021) 54:2650–69.e14. doi: 10.1016/j.immuni.2021.09.002

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Kaiser R, Leunig A, Pekayvaz K, Popp O, Joppich M, Polewka V, et al. Self-sustaining IL-8 loops drive a prothrombotic neutrophil phenotype in severe COVID-19. JCI Insight. (2021) 6:e150862. doi: 10.1172/jci.insight.150862

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Li H, Zhang J, Fang C, Zhao X, Qian B, Sun Y, et al. The prognostic value of IL-8 for the death of severe or critical patients with COVID-19. Medicine. (2021) 100:e23656. doi: 10.1097/MD.0000000000023656

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Mojtabavi H, Saghazadeh A, Rezaei N. Interleukin-6 and severe COVID-19: a systematic review and meta-analysis. Eur Cytokine Netw. (2020) 31:44–9. doi: 10.1684/ecn.2020.0448

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Gordon AC, Mouncey PR, Al-Beidh F, Rowan KM, Nichol AD, Arabi YM, et al. Interleukin-6 receptor antagonists in critically ill patients with Covid-19. N Engl J Med. (2021) 384:1491–502. doi: 10.1056/NEJMoa2100433

PubMed Abstract | CrossRef Full Text | Google Scholar

25. RECOVERY Collaborative Group. Tocilizumab in patients admitted to hospital with COVID-19 (RECOVERY): a randomised, controlled, open-label, platform trial. Lancet (London, England). (2021) 397:1637–45. doi: 10.1016/S0140-6736(21)00676-0

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Maione F, Casillo GM, Raucci F, Salvatore C, Ambrosini G, Costa L, et al. Interleukin-17A (IL-17A): a silent amplifier of COVID-19. Biomed Pharmacother. (2021) 142:111980. doi: 10.1016/j.biopha.2021.111980

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Ayhan E, Öztürk M, An I, Abdelmaksoud A, Araç E. Potential role of anti-interleukin-17 in COVID-19 treatment. Dermatol Ther. (2020) 33:e13715. doi: 10.1111/dth.13715

PubMed Abstract | CrossRef Full Text | Google Scholar

28. O'Driscoll M, Ribeiro Dos Santos G, Wang L, Cummings DAT, Azman AS, Paireau J, et al. Age-specific mortality and immunity patterns of SARS-CoV-2. Nature. (2021) 590:140–5. doi: 10.1038/s41586-020-2918-0

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Chen Y, Klein SL, Garibaldi BT Li H, Wu C, Osevala NM, et al. Aging in COVID-19: Vulnerability, immunity and intervention. Ageing Res Rev. (2021) 65:101205. doi: 10.1016/j.arr.2020.101205

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Brodin P. Immune determinants of COVID-19 disease presentation and severity. Nat Med. (2021) 27:28–33. doi: 10.1038/s41591-020-01202-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: COVID-19, outcome, inflammatory cytokines, machine learning, prognostic models

Citation: Yu Z, Li X, Zhao J and Sun S (2022) Identification of hospitalized mortality of patients with COVID-19 by machine learning models based on blood inflammatory cytokines. Front. Public Health 10:1001340. doi: 10.3389/fpubh.2022.1001340

Received: 23 July 2022; Accepted: 24 October 2022;
Published: 17 November 2022.

Edited by:

Gopi Battineni, University of Camerino, Italy

Reviewed by:

Junxiang Chen, University of Pittsburgh, United States
Muhammad Imran Khan, The University of Haripur, Pakistan

Copyright © 2022 Yu, Li, Zhao and Sun. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shiren Sun, sunshiren@medmail.com.cn

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.