Application of machine learning algorithm in prediction of lymph node metastasis in patients with intermediate and high-risk prostate cancer

Wang, Xiangrong; Zhang, Xiangxiang; Li, Hengping; Zhang, Mao; Liu, Yang; Li, Xuanpeng

doi:10.1007/s00432-023-04816-w

Application of machine learning algorithm in prediction of lymph node metastasis in patients with intermediate and high-risk prostate cancer

Research
Open access
Published: 02 May 2023

Volume 149, pages 8759–8768, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Cancer Research and Clinical Oncology Aims and scope Submit manuscript

Application of machine learning algorithm in prediction of lymph node metastasis in patients with intermediate and high-risk prostate cancer

Download PDF

Xiangrong Wang¹,
Xiangxiang Zhang¹,
Hengping Li¹,
Mao Zhang¹,
Yang Liu¹ &
…
Xuanpeng Li¹

1709 Accesses
1 Altmetric
Explore all metrics

Abstract

Purpose

This study aims to establish the best prediction model of lymph node metastasis (LNM) in patients with intermediate- and high-risk prostate cancer (PCa) through machine learning (ML), and provide the guideline of accurate clinical diagnosis and precise treatment for clinicals.

Methods

A total of 24,470 patients with intermediate- and high-risk PCa were included in this study. Multivariate logistic regression model was used to screen the independent risk factors of LNM. At the same time, six algorithms, namely random forest (RF), naive Bayesian classifier (NBC), xgboost (XGB), gradient boosting machine (GBM), logistic regression (LR) and decision tree (DT) are used to establish risk prediction models. Based on the best prediction performance of ML algorithm, a prediction model is established, and the performance of the model is evaluated from three aspects: area under curve (AUC), sensitivity and specificity.

Results

In multivariate logistic regression analysis, T stage, PSA, Gleason score and bone metastasis were independent predictors of LNM in patients with intermediate- and high-risk PCa. By comprehensively comparing the prediction model performance of training set and test set, GBM model has the best prediction performance (F1 score = 0.838, AUROC = 0.804). Finally, we developed a preliminary calculator model that can quickly and accurately calculate the regional LNM in patients with intermediate- and high-risk PCa.

Conclusion

T stage, PSA, Gleason and bone metastasis were independent risk factors for predicting LNM in patients with intermediate- and high-risk PCa. The prediction model established in this study performs well; however, the GBM model is the best one.

Clinico-radiological characteristic-based machine learning in reducing unnecessary prostate biopsies of PI-RADS 3 lesions with dual validation

Article 10 June 2020

Improved Prediction of Significant Prostate Cancer Following Repeated Prostate Biopsy by the Random Forest Classifier

Article 05 December 2022

Comparative Analysis of Breast and Prostate Cancer Prediction Using Machine Learning Techniques

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

According to the global cancer statistics in 2020, PCa ranks sixth in incidence rate and seventh in mortality in China. (Cao 2020). Pelvic lymph node metastasis (PLNM) accounts for about 15% of all newly diagnosed PCa patients, which is related to biochemical recurrence (BCR) and distant metastasis (DM) after treatment (von Bodman et al. 2010; Wilczak et al. 2018). Gervasi et al. reported that the 10-year risk of DM in lymph node positive patients was 83%, and the 10 year risk of death from PCa was 57% (Wagner et al. 2008). Extended pelvic lymph node dissection (ePLND) has become an integral part of radical prostatectomy (RP), while the American Association of Urology (AUA) and the European Association of Urology (EAU) recommend that low-risk patients do not need ePLND; ePLND is an option for patients with intermediate- and high-risk PCa whose Briganti nomogram predicts that the probability of LNM is greater than 5% (Engel et al. 2010; Lestingi et al. 2021). Therefore, the clinical staging of PCa is the key to precision medicine, and accurate identification of PLNM of PCa patients is crucial to determine the appropriate treatment plan (Hou et al. 2021; Mottet et al. 2017).

At present, many studies have reported that non-invasive imaging techniques can be used to predict LNM of PCa before treatment. CT and MRI, the most commonly used in clinic, can assess the status of pelvic lymph nodes by examining their size. Both of them have no obvious advantages and disadvantages, with a sensitivity of about 40% and a specificity of about 82% (Créhange et al. 2012; Hövels et al. 2008). Von Below et al. showed that multi parameter MRI (mpMRI) is more sensitive and specific than MRI in detecting tumors and lymph nodes, but it is easy to lose signal or image distortion in DWI sequence (von Below et al. 2016). Similarly, PSMA PET/CT has been widely used to detect PCa in prostate, soft tissue and bone, however, and its detection rate of 2–5 mm lymph node invasion is about 60% (Hofman et al. 2018; van Leeuwen et al. 2017). In addition, new imaging technologies are being developed such as MR lymphography with superparamagnetic iron oxide (SPIO) nanoparticles and targeted positron emission tomography imaging (PET) (Muteganya et al. 2018). Their efficacy of prediction for the NLM is still unclear.

Recently, scientists have made great efforts to explore different methods for more accurately evaluating the risks of LNM. However, due to the complexity of medical data, there are important connections between various factors, and certain differences in the calculation methods of models. Therefore, machine learning (ML) has become a powerful tool for improvement of clinical strategies in the field of medical research (Mirza et al. 2019; Oliveira 2019). Compared with traditional regression analysis, ML algorithm has significant advantages in prediction performance in large databases (Bi et al. 2019; Wang et al. 2020). Tian et al. established RDA model using ML to accurately predict LNM of early gastric cancer (Tian et al. 2021). Li et al. established XGB model to predict LNM of patients with osteosarcoma (Li and Liu et al. 2022). Li et al. established RF model to better predict LNM of Ewing’s sarcoma (Li and Zhou et al. 2022).

To our knowledge, there is no effective ML model for predicting risks of LNM of PCa. Therefore, in this study, we established a new model for predicting risks of LNM in patients with intermediate- and high-risk PCa through 6 ML methods based on the clinical and histopathological parameters that are closely related to the prognosis of the PCa in the SEER database.

Materials and methods

Study population

The training set and test set were recruited from the SEER database for patients diagnosed with intermediate- and high-risk PCa from 2000 to 2019. The patients diagnosed as intermediate- and high-risk PCa by Gansu Provincial Hospital from 2012 to 2018 will be taken as the validation set. Inclusion criteria were as follows: (1) patients with primary prostate cancer confirmed by the case; (2) at least meet one of PSA ≥ 10 ng/ml, Gleason score ≥ 7 or T stage ≥ T2b; (3) The clinical and pathological data and survival period were complete. Exclusion criteria: (1) no complete clinicopathological data and survival period; (2) PSA < 10 ng/ml, Gleason score < 7 and T1–T2a. Since the study was retrospective and the data were from an open database, informed consent was not used. The detailed screening process is shown in Fig. 1.

Establishment of predictive model

In this study, we compared the pathological characteristics selected from SEER database and external validation set, and analyzed the risk factors for predicting LNM using single factor analysis. Multivariate logistic regression analysis was used to evaluate the variables, and independent predictors related to LNM were obtained. Then we selected 6 common prediction models based on ML to predict LNM of intermediate- and high-risk PCa. We have established six models: random forest (RF), naive Bayesian classifier (NBC), xgboost (XGB), gradient boosting machine (GBM), logistic registration (LR) and decision tree (DT). The SEER dataset was divided by a ratio of 70:30. 70% is used for machine algorithm training, 30% is used for testing, and external verification was used as a separate verification set. In the training process of ML algorithm, each model is cross verified for 10 times to maintain the stability of the model, and the best super parameters are selected using random search method. The F1 score, AUROC, sensitivity and specificity of each model are comprehensively evaluated, compared the performance differences of different models, and selected the model with the highest accuracy as the final model according to the comprehensive score. Finally, the accuracy and generalization of the selected best prediction model are further verified using an independent external verification set.

Assessment of prediction model

We used area under curve (AUC) to evaluate the accuracy of each model. Considering the possibility of over fitting or under fitting, we combined the sensitivity and specificity of each model to obtain F1 score. In addition, we use decision curve analysis to test the prediction accuracy of the model.

Statistical analysis

We used SEER * STAT statistical software to extract training sets and test sets from SEER database. Hospital patients as an external validation set. All patient data were analyzed with SPSS V.25.0. Continuous variables are represented by the median of interquartile interval (IQR), and categorical variables are represented by values and proportions. Wilcoxon rank sum test is used for continuous variables, and chi square test or Fisher exact test is used for categorical variables. Univariate and multivariate logistic regression were used to analyze the risk factors of lymph node metastasis in high-risk PCa. P values lower than 0.05 were statistically significant. Adjusted odds ratios (ORs) and corresponding 95% confidence intervals (95% CI) were calculated. The modeling process is implemented through the Sci Kit Learn library (version 0.19.2) in Python (version 3.7.1). Test the training set with RF, NBC, XGB, GBM, LR and DT, and establish a prediction model. The relative importance of each input variable in each model is analyzed. We used 10 times cross validation and ROC curve analysis on the training set to test the performance of the model. Finally, the prediction accuracy of GBM model is further verified by decision curve analysis.

Results

Baseline characteristics

A total of 24,470 patients with intermediate- and high-risk PCa were included in this study, including 24,359 from SEER database and 111 from our hospital’s external validation set. Patients were divided into two groups according to whether they had LNM. There were significant differences between the two groups (patients with or without LNM) in terms of grade (p < 0.001), T stage (p < 0.001), M stage (p < 0.001), Stage (p < 0.001), Gleason (p < 0.001), PSA (p < 0.001), bone metastasis (p < 0.001), liver metastasis and lung metastasis (p < 0.001) (Table 1).

Table 1 Describe the study population according to whether there is lymph node metastasis

Full size table

Univariate and multivariate analyses of potential factors for predicting lymph node metastases

In univariate analysis, race (p = 0.049), grade (p < 0.001), T (p < 0.001), M (p < 0.001), stage (p < 0.001), Gleason score (p < 0.001), PSA (p < 0.001), bone metastasis (p < 0.001), liver metastasis (p < 0.001), and lung metastasis (p < 0.001) were significantly related to the occurrence of lymph node metastasis of intermediate- and high-risk PCa. There was no significant difference in age between the two groups. Multivariate logistic regression analysis showed that T (p = 0.016), Gleason (p = 0.031), PSA (p = 0.033) and bone metastasis (p < 0.001) were independent predictors of LNM (Table 2).

Table 2 Single- and multi-factor logistic regression analysis for the modeling group

Full size table

Screening and validation of the best machine learning model

With lymph node status as a prognostic indicator, four factors (p < 0.05) in the above logistic regression analysis were determined to enter the model as variables. In the training set, ML algorithms including RF, NBC, XGB, GBM, LR and DT are executed to establish the prediction model. We used 10 times cross validation training for patients in the training group to adjust parameter balance and avoid over fitting of the model. The data set was divided into 10 parts, including 9 parts for training and 1 part for rotation test. The final accuracy rate averaged 10 times (Figs. 2–3). We found that RF model has the best prediction ability, AUROC = 0.82 (Fig. 4). AUROC of all models in the test set is > 0.7. F1 score value is suitable for evaluating the prediction performance of unbalanced samples. In the test set, GBM has the best prediction performance, significantly better than RF (F1 value: 0.838, sensitivity (recall): 0.877, specificity: 0.783; F1 value: 0.798, sensitivity (recall): 0.857, specificity: 0.709). Based on the aforementioned results, GBM was selected as the best prediction model for predicting LNM (Table 3). Furthermore, decision curve analysis (Fig. 5) shows the accuracy of GBM model.

Table 3 Performance of the developed models

Full size table

Permutation feature of importance

In the six models, the relative importance order of each input variable is slightly different. T, PSA and Gleason are almost the first three indicators of each model, and bone metastasis is a lower indicator. (Fig. 6) In the GBM model, the order of relative importance of the variables from high to low is T, PSA, Gleason and bone metastasis.

Calculator preliminary model

The GBM model performs best among the six models. Accordingly, we have established a calculator preliminary model to promote the clinical application of this prediction model (Fig. 7).

Discussion

LNM is a paramount prognostic factor for patients with PCa, and has been proved to be an important predictor of BCR survival, metastasis free survival and overall survival of PCa (Engel et al. 2010; Wilczak et al. 2018). Wessels et al. extracted prognostic information from the H&E histology of PCa and used the deep learning method to predict the LN status in PCa patients (Wessels et al. 2021). Hou et al. established PLNM risk calculator by integrating radiologist’s interpretation, clinicopathologic factors and MRIs, and using ML and deep migration learning algorithms (Hou et al. 2021). For the sake of accurately evaluating the risk of LNM, Some studies have designed different prediction models for lymph node prediction of intermediate- and high-risk PCa according to the detection pathway. Diamand R et al. reported and validated the LNM of patients treated with ePLND by nomogram, and provided a more reasonable cut-off value (Diamand et al. 2020). Ferraro DA et al. designed a new model by combining PSA, Gleason score and visual lymph node analysis on 68 Ga-PSMA-11 PET. Compared with the previously used clinical nomograms, this model has a remarkably improved the positive rate of LNM in the patient selecting to perform ePLND (Ferraro et al. 2020). In this study, we used the large sample size of SEER database and ML algorithm to develop six prediction models to predict LNM in the patients with intermediate- and high-risk PCa. Logistic regression analysis showed that T stage, Gleason score, PSA and bone metastasis were independent risk factors for pelvic LNM of intermediate- and high-risk PCa.

Among the six models, the AUC value of GBM model is the highest, and the prediction accuracy of other models for LNM is about 80%. RF model shows the best prediction performance before and after data balancing, with obvious advantages of high precision and fast speed; however, it also has the disadvantage of over fitting. F1 score, which represents the harmonic average of the accuracy rate and recall rate, is the final assessment parameter of the evaluating each model. According to the evaluation results of the test set, the prediction performance of GBM model is better than that of RF model. It can be seen that RF model may show over fitting in the training process, which makes it unsuitable for the data in the test set, while GBM model has the best prediction performance. To increase the application feasibility of this model, we developed a calculator to evaluate the individual probability of LNM in patients with intermediate- and high-risk PCa.

The results of this study showed that T stage, PSA, Gleason score and bone metastasis were the most important predictors in the patients with intermediate- and high-risk PCa. As an important indicator of tumor progression, T stage is positively correlated with LNM in a large number of tumors (Barriera-Silvestrini et al. 2021). A large number of research data in this study show that the level of high PSA will increase the rate of lymph node invasion, which is contrary to the results of the previous studies. The possible reason is PSA may be more meaningful in D'Amico risk stratification. The increase of Gleason score also increases the risk of lymph node invasion (Turk et al. 2018). Bone metastasis is significantly related to LNM of PCa, which can provide some ideas for follow-up research, that is, consider the existence of metastasis of other sites as a factor before patients have LNM.

The EAU guidelines used Briganti’s nomogram prediction model to screen ePLND patients. The advantage of this study is to compare several models head-to-head with the nomogram model. The sensitivity, specificity and AUC of the nomogram are 0.882, 0.705 and 0.80, respectively, while the sensitivity, specificity and AUC of GBM are 0.877, 0.783 and 0.813, respectively. It shows that GBM in the six predictive models has the best predictive value for LNM in the patients with intermediate- and high-risk PCa. To further facilitate clinical application, we designed a preliminary calculator model that can quickly calculate the probability of LNM.

Of course, this study has several limitations. First, this study is a retrospective study, which may have some selection bias. Second, SEER database lacks more data such as tumor volume, percentage of positive tissue cores, testosterone level, and so on. In addition, the external validation set data is small, and more sample sizes need to be included to test the effectiveness of the model. Finally, although we have corrected the sample imbalance problem of SEER dataset as much as possible, this problem will still interfere with the results and affect the generalization ability of the model.

Conclusion

This research has developed and validated six prediction models using ML algorithm, of which GBM model has the best performance. Based on this algorithm, a preliminary model of the calculator is designed, and then the local LNM probability in patients with intermediate- and high-risk PCa can be individually predicted according to the existing clinical characteristics, which can help clinicians quickly and accurately assess the risk of LNM, finally, precise therapy.

Data availability

The data on which the study is based is available from the repository and can be downloaded at the following link (https://seer.cancer.gov). Relevant information will be provided upon reasonable request.

Abbreviations

LNM:: Lymph node metastasis
PCa:: Prostate cancer
ML:: Machine learning
RF:: Random forest
NBC:: Naive Bayesian classifier
XGB:: Xgboost
GBM:: Gradient boosting machine
LR:: Logistic regression
DT:: Decision tree
ROC:: Receiver operating characteristic
AUC:: Area under curve
PLNM:: Pelvic lymph node metastasis
ePLND:: Extended pelvic lymph node dissection
SEER:: Surveillance, epidemiology and end results
BCR:: Biochemical recurrence

References

Barriera-Silvestrini P, Iacullo J, Knackstedt TJ (2021) American joint committee on cancer staging and other platforms to assess prognosis and risk. Clin Plast Surg 48(4):599–606
Article PubMed Google Scholar
Bi Q, Goodman KE, Kaminsky J, Lessler J (2019) What is machine learning? A primer for the epidemiologist. Am J Epidemiol 188(12):2222–2239. https://doi.org/10.1093/aje/kwz189
Article PubMed Google Scholar
Cao W, Chen HD, Yu YW, Li N, Chen WQ (2021) Changing profiles of cancer burden worldwide and in China: a secondary analysis of the global cancer statistics 2020. Chin Med J (engl) 134(7):783–791. https://doi.org/10.1097/CM9.0000000000001474
Article PubMed Google Scholar
Créhange G, Chen CP, Hsu CC, Kased N, Coakley FV, Kurhanewicz J, Roach MR (2012) Management of prostate cancer patients with lymph node involvement: a rapidly evolving paradigm. Cancer Treat Rev 38(8):956–967. https://doi.org/10.1016/j.ctrv.2012.05.005
Article PubMed PubMed Central Google Scholar
Diamand R, Oderda M, Albisinni S, Fourcade A, Fournier G, Benamran D, Iselin C, Fiard G, Descotes JL, Assenmacher G, Svistakov I, Peltier A, Simone G, Di Cosmo G, Roche JB, Bonnal JL, Van Damme J, Rossi M, Mandron E, Gontero P, Roumeguère T (2020) External validation of the Briganti nomogram predicting lymph node invasion in patients with intermediate and high-risk prostate cancer diagnosed with magnetic resonance imaging-targeted and systematic biopsies: a European multicenter study. Urol Oncol 38(11):847–849. https://doi.org/10.1016/j.urolonc.2020.04.011
Article Google Scholar
Engel J, Bastian PJ, Baur H, Beer V, Chaussy C, Gschwend JE, Oberneder R, Rothenberger KH, Stief CG, Hölzel D (2010) Survival benefit of radical prostatectomy in lymph node-positive patients with prostate cancer. Eur Urol 57(5):754–761. https://doi.org/10.1016/j.eururo.2009.12.034
Article PubMed Google Scholar
Ferraro DA, Muehlematter UJ, Garcia SH, Rupp NJ, Huellner M, Messerli M, Rüschoff JH, Ter Voert E, Hermanns T, Burger IA (2020) (68)Ga-PSMA-11 PET has the potential to improve patient selection for extended pelvic lymph node dissection in intermediate to high-risk prostate cancer. Eur J Nucl Med Mol Imaging 47(1):147–159. https://doi.org/10.1007/s00259-019-04511-4
Article PubMed Google Scholar
Hofman MS, Hicks RJ, Maurer T, Eiber M (2018) Prostate-specific membrane antigen pet: clinical utility in prostate cancer, normal patterns, pearls, and pitfalls. Radiographics 38(1):200–217. https://doi.org/10.1148/rg.2018170108
Article PubMed Google Scholar
Hou Y, Bao J, Song Y, Bao ML, Jiang KW, Zhang J, Yang G, Hu CH, Shi HB, Wang XM, Zhang YD (2021) Integration of clinicopathologic identification and deep transferrable image feature representation improves predictions of lymph node metastasis in prostate cancer. EBioMedicine 68:103395. https://doi.org/10.1016/j.ebiom.2021.103395
Article PubMed PubMed Central Google Scholar
Hövels AM, Heesakkers RA, Adang EM, Jager GJ, Strum S, Hoogeveen YL, Severens JL, Barentsz JO (2008) The diagnostic accuracy of CT and MRI in the staging of pelvic lymph nodes in patients with prostate cancer: a meta-analysis. Clin Radiol 63(4):387–395. https://doi.org/10.1016/j.crad.2007.05.022
Article PubMed Google Scholar
Lestingi J, Guglielmetti GB, Trinh QD, Coelho RF, Pontes JJ, Bastos DA, Cordeiro MD, Sarkis AS, Faraj SF, Mitre AI, Srougi M, Nahas WC (2021) Extended versus limited pelvic lymph node dissection during radical prostatectomy for intermediate- and high-risk prostate cancer: early oncological outcomes from a randomized phase 3 trial. Eur Urol 79(5):595–604. https://doi.org/10.1016/j.eururo.2020.11.040
Article PubMed Google Scholar
Li W, Liu Y, Liu W, Tang ZR, Dong S, Li W, Zhang K, Xu C, Hu Z, Wang H, Lei Z, Liu Q, Guo C, Yin C (2022a) Machine learning-based prediction of lymph node metastasis among osteosarcoma patients. Front Oncol 12:797103. https://doi.org/10.3389/fonc.2022.797103
Article PubMed PubMed Central Google Scholar
Li W, Zhou Q, Liu W, Xu C, Tang ZR, Dong S, Wang H, Li W, Zhang K, Li R, Zhang W, Hu Z, Shibin S, Liu Q, Kuang S, Yin C (2022b) A Machine learning-based predictive model for predicting lymph node metastasis in patients with Ewing’s Sarcoma. Front Med (lausanne) 9:832108. https://doi.org/10.3389/fmed.2022.832108
Article PubMed Google Scholar
Mirza B, Wang W, Wang J, Choi H, Chung NC, Ping P (2019) Machine learning and integrative analysis of biomedical big data. Genes (basel) 10(2):87. https://doi.org/10.3390/genes10020087
Article CAS PubMed Google Scholar
Mottet N, Bellmunt J, Bolla M, Briers E, Cumberbatch MG, De Santis M, Fossati N, Gross T, Henry AM, Joniau S, Lam TB, Mason MD, Matveev VB, Moldovan PC, van den Bergh R, Van den Broeck T, van der Poel HG, van der Kwast TH, Rouvière O, Schoots IG, Wiegel T, Cornford P (2017) EAU-ESTRO-SIOG guidelines on prostate cancer part 1: screening, diagnosis, and local treatment with curative intent. Eur Urol 71(4):618–629. https://doi.org/10.1016/j.eururo.2016.08.003
Article PubMed Google Scholar
Muteganya R, Goldman S, Aoun F, Roumeguère T, Albisinni S (2018) Current imaging techniques for lymph node staging in prostate cancer: a review. Front Surg 5:74. https://doi.org/10.3389/fsurg.2018.00074
Article PubMed PubMed Central Google Scholar
Oliveira AL (2019) Biotechnology, big data and artificial intelligence. Biotechnol J 14(8):e1800613. https://doi.org/10.1002/biot.201800613
Article CAS PubMed Google Scholar
Tian H, Ning Z, Zong Z, Liu J, Hu C, Ying H, Li H (2021) Application of machine learning algorithms to predict lymph node metastasis in early gastric cancer. Front Med (lausanne) 8:759013. https://doi.org/10.3389/fmed.2021.759013
Article PubMed Google Scholar
Turk H, Ün S, Koca O, Cinkaya A, Kodaz H, Zorlu F (2018) The factors that affect the prediction of lymph node metastasis in prostate cancer. J Cancer Res Ther 14(5):1094–1098. https://doi.org/10.4103/0973-1482.187286
Article PubMed Google Scholar
van Leeuwen PJ, Emmett L, Ho B, Delprado W, Ting F, Nguyen Q, Stricker PD (2017) Prospective evaluation of 68Gallium-prostate-specific membrane antigen positron emission tomography/computed tomography for preoperative lymph node staging in prostate cancer. BJU Int 119(2):209–215. https://doi.org/10.1111/bju.13540
Article CAS PubMed Google Scholar
von Below C, Daouacher G, Wassberg C, Grzegorek R, Gestblom C, Sörensen J, Ahlström H, Waldén M (2016) Validation of 3 T MRI including diffusion-weighted imaging for nodal staging of newly diagnosed intermediate- and high-risk prostate cancer. Clin Radiol 71(4):328–334. https://doi.org/10.1016/j.crad.2015.12.001
Article Google Scholar
von Bodman C, Godoy G, Chade DC, Cronin A, Tafe LJ, Fine SW, Laudone V, Scardino PT, Eastham JA (2010) Predicting biochemical recurrence-free survival for patients with positive pelvic lymph nodes at radical prostatectomy. J Urol 184(1):143–148. https://doi.org/10.1016/j.juro.2010.03.039
Article Google Scholar
Wagner M, Sokoloff M, Daneshmand S (2008) The role of pelvic lymphadenectomy for prostate cancer–therapeutic? J Urol 179(2):408–413. https://doi.org/10.1016/j.juro.2007.09.027
Article CAS PubMed Google Scholar
Wang Z, Li H, Carpenter C, Guan Y (2020) Challenge-enabled machine learning to drug-response prediction. AAPS J 22(5):106. https://doi.org/10.1208/s12248-020-00494-5
Article PubMed Google Scholar
Wessels F, Schmitt M, Krieghoff-Henning E, Jutzi T, Worst TS, Waldbillig F, Neuberger M, Maron RC, Steeg M, Gaiser T, Hekler A, Utikal JS, von Kalle C, Fröhling S, Michel MS, Nuhn P, Brinker TJ (2021) Deep learning approach to predict lymph node metastasis directly from primary tumour histology in prostate cancer. BJU Int 128(3):352–360. https://doi.org/10.1111/bju.15386
Article CAS PubMed Google Scholar
Wilczak W, Wittmer C, Clauditz T, Minner S, Steurer S, Büscheck F, Krech T, Lennartz M, Harms L, Leleu D, Ahrens M, Ingwerth S, Günther CT, Koop C, Simon R, Jacobsen F, Tsourlakis MC, Chirico V, Höflmayer D, Vettorazzi E, Haese A, Steuber T, Salomon G, Michl U, Budäus L, Tilki D, Thederan I, Fraune C, Göbel C, Henrich MC, Juhnke M, Möller K, Bawahab AA, Uhlig R, Adam M, Weidemann S, Beyer B, Huland H, Graefen M, Sauter G, Schlomm T (2018) Marked prognostic impact of minimal lymphatic tumor spread in prostate cancer. Eur Urol 74(3):376–386. https://doi.org/10.1016/j.eururo.2018.05.034
Article PubMed Google Scholar

Download references

Funding

This study was supported by Gansu Natural Science Foundation (22JR5RA670 and 22JR11RA271) and Gansu Provincial Hospital (17GSSY3-4).

Author information

Authors and Affiliations

Department of Urology, Gansu Provincial Hospital, Lanzhou, Gansu, China
Xiangrong Wang, Xiangxiang Zhang, Hengping Li, Mao Zhang, Yang Liu & Xuanpeng Li

Authors

Xiangrong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiangxiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hengping Li
View author publications
You can also search for this author in PubMed Google Scholar
Mao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xuanpeng Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

WXR and ZXX: carried out the research and design. ZM: conducted research and collected and analyzed data. LHP: conceived this research and helped shape language. LXP and LY: provided suggestions. All authors have contributed to the article and approved the submitted version.

Corresponding author

Correspondence to Hengping Li.

Ethics declarations

Conflict of interest

The authors declare that there is no conflict of interest that could be perceived as prejudicing the impartiality of the research reported.

Ethics approval

The SEER database is an open and identifiable public database, so it does not require the approval and informed consent of the agency review committee. For single-center data, this study is a retrospective study. The basic information of patients is not involved in the study, so the approval of the ethics committee is not required.

Consent to participate

Informed consent was obtained from all individual participants included in the study.

Consent to publish

All authors consent to publish this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, X., Zhang, X., Li, H. et al. Application of machine learning algorithm in prediction of lymph node metastasis in patients with intermediate and high-risk prostate cancer. J Cancer Res Clin Oncol 149, 8759–8768 (2023). https://doi.org/10.1007/s00432-023-04816-w

Download citation

Received: 16 March 2023
Accepted: 23 April 2023
Published: 02 May 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s00432-023-04816-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Application of machine learning algorithm in prediction of lymph node metastasis in patients with intermediate and high-risk prostate cancer

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

Clinico-radiological characteristic-based machine learning in reducing unnecessary prostate biopsies of PI-RADS 3 lesions with dual validation

Improved Prediction of Significant Prostate Cancer Following Repeated Prostate Biopsy by the Random Forest Classifier

Comparative Analysis of Breast and Prostate Cancer Prediction Using Machine Learning Techniques

Introduction

Materials and methods

Study population

Establishment of predictive model

Assessment of prediction model

Statistical analysis

Results

Baseline characteristics

Univariate and multivariate analyses of potential factors for predicting lymph node metastases

Screening and validation of the best machine learning model

Permutation feature of importance

Calculator preliminary model

Discussion

Conclusion

Data availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval

Consent to participate

Consent to publish

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation