Automated machine learning to predict the difficulty for endoscopic resection of gastric gastrointestinal stromal tumor

Liu, Luojie; Zhang, Rufa; Shi, Dongtao; Li, Rui; Wang, Qinghua; Feng, Yunfu; Lu, Fenying; Zong, Yang; Xu, Xiaodan

doi:10.3389/fonc.2023.1190987

ORIGINAL RESEARCH article

Front. Oncol., 10 May 2023

Sec. Gastrointestinal Cancers: Gastric and Esophageal Cancers

Volume 13 - 2023 | https://doi.org/10.3389/fonc.2023.1190987

This article is part of the Research Topic Precise Diagnosis, Functional Mechanisms, and Therapeutic Potentials in Gastrointestinal Cancers – Volume II View all 36 articles

Automated machine learning to predict the difficulty for endoscopic resection of gastric gastrointestinal stromal tumor

Luojie Liu¹

Rufa Zhang¹

Dongtao Shi²

Rui Li²

Qinghua Wang³

Yunfu Feng³

Fenying Lu⁴

Yang Zong^5*

Xiaodan Xu^1*

¹Department of Gastroenterology, Changshu Hospital Affiliated to Soochow University, Suzhou, China
²Department of Gastroenterology, The First Affiliated Hospital of Soochow University, Suzhou, China
³Department of Gastroenterology, No.1 People’s Hospital of Kunshan, Suzhou, China
⁴Department of Gastroenterology, No.2 People’s Hospital of Changshu, Suzhou, China
⁵Department of General Surgery, Changshu Hospital Affiliated to Soochow University, Suzhou, China

Background: Accurate preoperative assessment of surgical difficulty is crucial to the success of the surgery and patient safety. This study aimed to evaluate the difficulty for endoscopic resection (ER) of gastric gastrointestinal stromal tumors (gGISTs) using multiple machine learning (ML) algorithms.

Methods: From December 2010 to December 2022, 555 patients with gGISTs in multi-centers were retrospectively studied and assigned to a training, validation, and test cohort. A difficult case was defined as meeting one of the following criteria: an operative time ≥ 90 min, severe intraoperative bleeding, or conversion to laparoscopic resection. Five types of algorithms were employed in building models, including traditional logistic regression (LR) and automated machine learning (AutoML) analysis (gradient boost machine (GBM), deep neural net (DL), generalized linear model (GLM), and default random forest (DRF)). We assessed the performance of the models using the areas under the receiver operating characteristic curves (AUC), the calibration curve, and the decision curve analysis (DCA) based on LR, as well as feature importance, SHapley Additive exPlanation (SHAP) Plots and Local Interpretable Model Agnostic Explanation (LIME) based on AutoML.

Results: The GBM model outperformed other models with an AUC of 0.894 in the validation and 0.791 in the test cohorts. Furthermore, the GBM model achieved the highest accuracy among these AutoML models, with 0.935 and 0.911 in the validation and test cohorts, respectively. In addition, it was found that tumor size and endoscopists’ experience were the most prominent features that significantly impacted the AutoML model’s performance in predicting the difficulty for ER of gGISTs.

Conclusion: The AutoML model based on the GBM algorithm can accurately predict the difficulty for ER of gGISTs before surgery.

Introduction

Gastric gastrointestinal stromal tumors (gGISTs) are the most common mesenchymal tumors of the gastrointestinal tract (1). Endoscopic resection (ER) is a minimally invasive and effective treatment option for small GISTs, but the procedure can be challenging for larger and more complex tumors (2, 3). To ensure the safety and efficacy of ER, it is essential to predict the difficulty of the procedure beforehand accurately. Traditional methods of predicting difficulty rely on subjective assessment by experienced endoscopists, which can be influenced by interobserver variability and other factors. Su et al. (4) have made the first-ever prediction of the difficulty in ER of gGISTs by constructing a nomogram. The area under the receiver operating characteristic (ROC) curves (AUC) and the accuracy of this model in predicting surgical difficulty were found to be 0.756 and 0.798, respectively. Although the model has demonstrated exemplary performance, finer models could yield even better results.

Machine learning (ML) is becoming increasingly prevalent in medicine because of its efficient computing algorithms, which enable the learning of valuable insights from vast amounts of clinical data (5, 6). Previous studies (7–11) have established the immense potential of ML in developing models for disease diagnosis, predicting prognosis, analyzing survival rates, and other medical applications. Automated machine learning (AutoML), a new type of ML, intelligently chooses from a range of algorithms and hyperparameters to create customized models based on specific target data (12, 13). Compared to traditional ML, AutoML utilizes intelligent early stopping, regularization, hyperparameter optimization, and cross-validation techniques, allowing for the development of more accurate models in less time.

In this study, we aimed to provide a dataset consisting of clinical and endoscopic features of patients with gGISTs from multiple centers. We used this dataset to train, validate, and test a series of machine learning models to predict the difficulty for ER of gGISTs.

Material and methods

Patients

We conducted a retrospective analysis of consecutive patients who underwent ER of gGISTs at the First Affiliated Hospital of Soochow University between December 2010 and December 2022. The patients were randomly divided into training and validation cohorts in a 7:3 ratio. In addition, we gathered information on patients who received ER of gGISTs at Changshu Hospital Affiliated to Soochow University, No.1 People’s Hospital of Kunshan, and No.2 People’s Hospital of Changshu from January 2017 to December 2022. This data was used to create the test cohort for the study. The main inclusion criteria were (1): diagnosis of gGIST through pathological and immunohistochemical examination after surgery (2); regular preoperative blood routine, coagulation tests, and electrocardiogram results (3); absence of lymph node or distant metastasis in patients. Patients who met any of the following criteria were excluded from the study (1): lesions with a high risk of malignancy based on EUS evaluation (2); patients with synchronous lesions in multiple locations (3); patients with multiple lesions in the stomach (4); patients with poor cardiopulmonary function and unable to undergo anesthesia and surgery (5); incomplete medical records of the patient. Our institutions received ethical approval for the clinical research study protocol from the ethics committee. Before the ER procedure, all patients were thoroughly informed about the advantages and potential risks and provided with a signed written consent form. The reporting of this study conforms to STROBE guidelines (14).

Endoscopic equipment and procedures

Based on the nature of the lesion, we employed three distinct ER techniques: endoscopic submucosal dissection (ESD), endoscopic full-thickness resection (EFTR), and submucosal tunnel endoscopic resection (STER). ESD is employed to treat gGISTs that arise from either the muscularis mucosae (MM) or muscularis propria (MP) and protrude into the lumen. If GISTs originate from the deep MP with extraluminal growth or tumors that cannot be separated from the serosal layer during ESD, EFTR can be utilized as a treatment. STER is mainly used for gGISTs that grow in the gastroesophageal junction or greater curvature of the stomach, where a submucosal tunnel can be quickly established. Comprehensive information regarding ER procedures can be found in the previous publication (15–17). Although the endoscopists involved in the procedures had varying degrees of experience with ER of gGISTs, all cases were performed by senior endoscopists with extensive experience. These endoscopists had previously completed over 5,000 gastroscopy and colonoscopy procedures and more than 200 EMR procedures for gastrointestinal polyps before performing ER for gGISTs. In our study, an endoscopist was considered experienced in ER of gGISTs once he or she had carried out a cumulative sum (CUSUM) of 50 such procedures. General anesthesia and endotracheal intubation were administered to all patients. All patients were placed in the left lateral position. The ER procedures utilized either a dual knife (KD-650L; Olympus^®, Japan), an insulated-tip knife (KD-611L; Olympus^®, Japan), or a combination of the two. A single-channel endoscope (GIF-Q260J, Olympus^®, Japan) equipped with a transparent cap on its tip was employed. The energy output was achieved using a High-frequency electric coagulation and electrocautery device (ERBE^® VIO 200D). Other equipment utilized during the procedures included metallic clips, nylon loops (LeClampTM^® Loop-20 and Loop-30; Leo, Changzhou, China), over-the-scope clips (OTSC), injection needles, hot biopsy forceps, and a carbon dioxide insufflator.

Postoperative management

Following surgery, specimens were preserved in a 10% formalin solution, and immunohistochemical staining (including CD117, CD34, and Dog-1, among others) was conducted to confirm the diagnosis. Typically, patients receive nasogastric decompression after surgery to prevent postoperative complications. They are instructed to fast for two days, or three days or more in the case of EFTR patients, depending on their postoperative status. Blood routine, CRP, and/or calcitonin tests were carried out after surgery, and all patients were administered proton pump inhibitors, gastric mucosal protective agents, nutritional support, and fluid replacement. When patients exhibited abdominal pain or muscle tension, a CT or orthostatic X-ray scan was performed to rule out postoperative perforation. Antibiotic therapy or surgical treatment was administered based on their condition. For patients who experienced intraoperative perforation or postoperative infection, antibiotics were prescribed.

Data collection

Patient information, such as gender, age, history of smoking or alcohol consumption, primary symptoms, medical history, American Society of Anesthesiologists (ASA) score (18), body mass index (BMI), tumor size, location, shape, depth of invasion, boundary characteristics, procedure duration, intraoperative and postoperative complications, R0 resection rates, ER technique used, modified National Institutes of Health (NIH) risk criteria (19), number of days of postoperative fasting, and length of hospital stay following surgery, were gathered from electronic medical records of our institutions.

Definitions

A difficult case was defined as meeting one of the following criteria: an operative time ≥ 90 min, severe intraoperative bleeding, or conversion to laparoscopic resection. The operative time was determined from the point at which the submucosal injection began to the completion of the closure of the defect. The origin of the tumor was identified based on preoperative endoscopic ultrasonography (EUS) examination. Tumors with a round, oval, or nodular shape were categorized as having a regular shape, whereas those with a branching shape were designated as having an irregular shape. Severe intraoperative bleeding was characterized by repeated endoscopic hemostasis, a postoperative decrease in hemoglobin levels exceeding 2 g/dL, or necessitating surgical assistance (20, 21). Tumor characteristics, such as tumor size and location, were assessed based on preoperative endoscopic ultrasound examination or abdominal-enhanced computed tomography (CT) scans. Postoperative complications included delayed bleeding, delayed perforation, and postoperative infection. Delayed bleeding was defined as clinical evidence of bleeding that occurred after ER, as evidenced by hematemesis or melena, a decline in hemoglobin levels of more than 2.0 g/dL within 24 hours, or the need for endoscopic therapy (22). Delayed perforation was verified through X-ray or CT. Postoperative infection was determined by a postoperative body temperature exceeding 37.5°C and/or an increase in inflammatory indicators such as blood routine, CRP, or calcitonin (23). R0 resection was defined as the surgical removal of a tumor with no residual cancerous tissue detected in the margins of the excised tissue, as confirmed by histological examination of the specimen’s radial and deep margins (24).

Automated machine learning

AutoML analysis was carried out using the H2O package installed from the H2O.ai platform (www.h2o.ai), which automatically selects and combines suitable algorithms into several ensemble models. The set of algorithms comprises a randomized grid of Gradient Boosting Machines (GBMs), a randomized grid of Deep Neural Networks (DLs), a default Random Forest (DRF), and a fixed grid of Generalized Linear Models (GLMs). Hyperparameter optimization was conducted through a 5-fold cross-validation grid search on the training set, where various combinations of hyperparameters included in the grid search were evaluated based on their AUCs. AutoML visualization was presented through feature importance, SHapley Additive exPlanation (SHAP), and Local Interpretable Model Agnostic Explanation (LIME) techniques. Through SHAP analysis, it was possible to determine the key features that significantly influenced the model predictions and the extent of their contribution to the overall model performance for a specific prediction (25). By randomly selecting examples from the test set, LIME analysis illustrated the contribution of each feature toward predicting the outcome (26).

Statistical analysis

Categorical variables were expressed as frequencies and percentages, and the Chi-square test or Fisher exact test was used to compare groups. Continuous variables were expressed as the median and interquartile ranges (IQR), and a comparison between the two groups was made using the Mann-Whitney U test. To address the issue of multiple collinear relationships among the explanatory variables, a univariate analysis was performed using the least absolute shrinkage and selection operator (LASSO) regression model with the minimum criterion. The model was then further refined using a binary logistic backward stepwise regression analysis. The predictive performance of the resulting model was evaluated using the areas under the receiver operating characteristic curves (AUC), calibration curve, and decision curve analysis (DCA). Furthermore, a nomogram was constructed based on the independent risk factors identified in the multivariate analysis. The statistical significance level was set at P < 0.05. R software (version 4.1.0) was utilized for conducting all the statistical analyses.

Results

Baseline characteristics of patients and lesions

In this study, a total of 555 patients were enrolled, out of which 97 cases (17.5%) experienced difficulty in the whole cohort. Figure 1 illustrates the study protocol in the form of a flow chart, while Table 1 presents the features of 555 gGISTs in the developing and test cohorts. In the developing dataset, there were 195 men (45.2%) and 236 women (54.8%). The proportion of patients aged < 60 years in the difficult group was 43.0%, while in the non-difficult group, it was 51.7%. In the test dataset, the proportion of female patients with gGISTs is higher than that of male patients (62.9% vs. 37.1%). No significant differences were observed between the two groups of three datasets in terms of sex, age, history of smoking or alcohol consumption, medical history, ASA score, and BMI (P > 0.05).

TABLE 1

Table 1 Demographic and clinical characteristics of patients in training, validation and test groups.

FIGURE 1

Figure 1 Flow chart of the study. gGISTs, gastric gastrointestinal stromal tumors; GBM, gradient boost machine; DL, deep neural net; GLM, generalized linear model; DRF, default random forset; LASSO, least absolute shrinkage and selection operator.

Univariate and multivariate logistic regression analysis

By utilizing the LASSO regression model with a minimum criterion attained through 5-fold cross-validation, four variables out of 17 were selected and designated as independent risk factors. This approach was employed to address the issue of multiple collinear relationships among the explanatory variables, as depicted in Supplementary Figure 1. A logistic model comprising of four variables (tumor size, invasion depth, location, and endoscopists’ experience) was ultimately established and presented as both a nomogram and a score system, suitable for clinical utilization (Figure 2). The calibration curves pertaining to the training set, validation set, and test set are depicted in Supplementary Figure 2, and the mean absolute errors being 0.021, 0.035 and 0.043, respectively. The calibration curves provided evidence that the LASSO model’s estimated risk was in close proximity to the actual risk, implying a considerable level of dependability. The DCA plots of the LASSO model in the test set demonstrated that when the threshold probability of a difficult procedure predicted by the LASSO model was between 20% and 100%, an intervention might add more benefit (10% - 80%) (Supplementary Figure 3). The DCA plots of the AutoML models are presented in Supplementary Figure 4, and the net benefit of these models is about 80%.

FIGURE 2

Figure 2 Nomogram of the LASSO model for predicting the difficulty for endoscopic resection of gGIST. LASSO, least absolute shrinkage and selection operator; gGISTs, gastric gastrointestinal stromal tumors.

Automated machine learning analysis

Using four ML algorithms (GBM, DL, GLM, and DRF), 64 models were constructed, with the stacked ensemble models being excluded due to limited interpretability. The GBM model outperformed the other models, exhibiting the highest AUC values and accuracy, and consequently deemed the most optimal model. Figure 3 indicates that tumor size was identified as the most crucial feature, followed by endoscopists’ experience, invasion depth, location (cross-sectional), shape, BMI, location (longitudinal), primary symptom, history of smoking, and sex, in that order of importance. Additionally, tumor size, endoscopists’ experience, invasion depth, and location (longitudinal) were identified as the common important variables shared by the GBM and logistic regression models. Figure 4 displays the SHAP contribution plots generated by GBM algorithms, illustrating the ten most significant variables, namely tumor size, endoscopists’ experience, location (cross-sectional), sex, shape, invasion depth, location (longitudinal), boundary, BMI, and age. As a variable’s value approaches 1, the likelihood of a patient having a difficult procedure increases. For example, the red dots in the SHAP plot corresponding to tumors ≥ 3.0cm are predominantly located on the right side of the zero axis, indicating that patients with tumors larger than 3.0cm are more likely to experience a difficult procedure. As shown in Table 2, the GBM algorithm outperformed the DL, DRF, and GLM algorithms in the validation cohort regarding AUC, with a higher value of 0.894 compared to 0.881, 0.858, and 0.854, respectively. Furthermore, the accuracy values for the GBM algorithm were the highest compared to the DL, DRF, and GLM algorithms, with 0.935, 0.870, 0.854, and 0.878, respectively. Among these 5 models, the DRF model has the highest sensitivity, with values of 1.000 in both the validation and test sets, but the lowest specificity, with values of 0.847 and 0.862, respectively. The LASSO model has the lowest sensitivity in both the validation and test sets, with values of 0.739 and 0.556, respectively. The DL and GLM models have intermediate performance in terms of AUC, sensitivity, specificity, and accuracy among these models. A LIME plot based on the GBM model for the test cohort showcased the impact of various significant variables on the difficulty for ER of gGISTs. For example, based on the GBM model, Figure 5 demonstrates that case 2 had a predicted probability of 0.94 for experiencing a difficult procedure. Tumor size greater than 3.0cm was identified as the most critical predictor for difficult procedures, followed by irregular tumor shape, invasion depth beyond MP, history of alcohol consumption, and tumor location in the upper third of the stomach. Conversely, the effect of the experienced endoscopist and male gender had a mitigating effect on these factors.

TABLE 2

Table 2 Comparison of AutoML models and logistic regression analysis in predicting the difficulty for ER of gGISTs in the validation cohort.

FIGURE 3

Figure 3 Variable importance of the GBM model in the training cohort, showing that tumor size was the most important feature, followed by endoscopists’ experience (CUSUM), invasion depth, etc.

FIGURE 4

Figure 4 SHAP of the GBM model in the training cohort. As a variable’s value approaches 1, the likelihood of a patient having a difficult procedure increases. SHAP, SHapley Additive exPlanation; GBM, gradient boost machine.

FIGURE 5

Figure 5 LIME of the GBM model in the test cohort. LIME, Local Interpretable Model Agnostic Explanation.

Discussion

This study aimed to evaluate the difficulty for ER of gGISTs using multiple ML algorithms. A total of 555 patients with gGISTs were retrospectively studied and assigned to a training, validation, and test cohort. Five algorithms were employed in building models, and the GBM model outperformed other models with an AUC of 0.894 in the validation cohort and 0.791 in the test cohort. The AutoML model based on the GBM algorithm can accurately predict the difficulty for ER of gGISTs before surgery, and tumor size and endoscopists’ experience were identified as the most prominent features that significantly impacted the performance of the AutoML model. This study provides a machine learning-based approach for accurately predicting the surgical difficulty for ER of gGISTs.

Accurate preoperative assessment of surgical difficulty is crucial to the success of the surgery and patient safety. By predicting the difficulty of the surgical procedure before surgery, surgeons can better prepare for the surgery, optimize the surgical plan, and ensure patient safety during the operation (27, 28). Su et al. (4) are the only ones who have predicted the difficulty for ER of gGISTs so far. Their study defined a difficult procedure as an operative time greater than 90 minutes or severe intraoperative bleeding. However, previous studies have suggested that conversion to laparoscopic or open surgery indicates difficult surgery (29–31) because it may increase operative time, blood loss, and postoperative recovery time, thereby increasing the risk to patients. Therefore, meeting one of the following criteria was used to define a difficult case in this study: operative time of 90 minutes or more, severe bleeding during the surgery, or the need to convert to laparoscopic resection or open surgery.

The SHAP analysis revealed that in our study, the most crucial feature of the GBM model is tumor size. The result agreed with the findings of the logistic regression model in our study and aligned with the risk factors for the endoscopic surgical difficulty reported in the literature (4, 32). According to the studies by Su et al. (4) and Jian et al. (32), ER was challenging for tumors larger than 3.0 cm in size. In treating gGISTs with larger tumor sizes, the limited operating space in the ER results in poorer functional space and surgical field of view. Consequently, endoscopists must frequently adjust the angle of the endoscopic incision and the volume of air in the stomach cavity to achieve complete tumor removal. Therefore, for gGISTs with larger tumor sizes, ER should be performed by experienced endoscopists, as this study found that surgical experience is also an essential factor affecting the difficulty of the procedure. Experienced endoscopists may have better technical proficiency and higher success rates, enabling them to adapt better to the surgical environment, accomplish surgical tasks more effectively, and reduce the incidence of surgical complications. Sun et al. (33) reported that the learning curve for ER of gastric submucosal tumors was approximately 32 cases, while Yoshida et al. (34) retrospectively analyzed the learning curve of 7 novice endoscopists in ER of gastric lesions and found that a stable state could be reached after completing around 30 cases. To account for potential variations in the learning curves of different endoscopists, a minimum threshold of 50 GIST excisions was established in this study to ensure that the endoscopists had adequate experience conducting ER for gGISTs. In this study, we divided tumor size into three groups: < 2.0cm, 2.0-3.0cm, and ≥ 3.0cm, and endoscopists’ experience into <50 cases and ≥50 cases. The larger the tumor, and the less experience the endoscopist has, the more difficult the surgery becomes. Therefore, we recommend that endoscopists lacking surgical experience should choose lesions with smaller diameters for surgical intervention.

We utilized five different ML algorithms to construct predictive models with high accuracy. Our models achieved superior AUC and accuracy compared to the nomogram model built by Su et al. (4). Furthermore, by accurately assessing the surgical difficulty for ER of gGISTs, this study can assist doctors in understanding potential challenges prior to surgery, thereby improving the success rate of the operation and patient safety. Additionally, this multi-center research boasts a larger sample size and higher external validity and reduces potential biases caused by the unique circumstances of a single research center. However, our study had some limitations. First, our study may have had selection bias and information bias due to its retrospective nature. Future research could use a prospective study design to more accurately evaluate the effectiveness of different ML algorithms in predicting the difficulty for ER of gGISTs. Second, this study did not consider the postoperative prognosis and patient recovery. Future research could incorporate these factors to comprehensively evaluate the clinical application value of ML models in predicting the difficulty for ER of gGISTs. Third, the advancements in novel medical devices and surgical techniques may affect the difficulty of ER, and the factors influencing surgical difficulty may also change. Therefore, it is crucial to keep pace with the latest developments when studying the difficulty for ER of gGISTs. Fourth, due to differences in procedural steps, the difficulty levels of ESD, EFTR, and STER endoscopic techniques may vary. Conducting more in-depth research on individual endoscopic techniques could aid in identifying and analyzing the specific difficulties associated with each technique. Fifth, due to the low prevalence of gGISTs, our validation cohort consisted of only 123 cases and the test cohort included 124 cases. Adding more samples later would be better.

In conclusion, our study evaluated the difficulty for ER of gGISTs using ML algorithms. The GBM model outperformed others, achieving high accuracy in predicting ER difficulty. Tumor size and endoscopists’ experience were identified as influential factors. The GBM-based AutoML model shows promise for preoperative assessment, but further validation on diverse datasets and consideration of new medical technologies are needed to enhance its clinical applicability.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving human participants were reviewed and approved by the ethics committee of the first affiliated hospital of Soochow University. The patients/participants provided their written informed consent to participate in this study.

Author contributions

LL and RZ contributed equally to this work. All authors contributed to the article and approved the submitted version.

Funding

This study was supported by the Health personnel training project of Soochow (GSWS2020109), the Changshu Science and Technology Project (CS202116), the Changshu Science and Technology Program (Social Development) Project (CS202120), and the Changshu Clinical Medical Expert Team Introduction Project (CSYJTD202101).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1190987/full#supplementary-material

Supplementary Figure 1 | A chart showing the penalties for predictive factors indicating the level of difficulty for endoscopic resection of gGISTs was derived through LASSO regression analysis. Left: Regression coefficients. With the value of λ increasing, the absolute values of coefficients decrease. Right: Identification of the optimal λ value in the LASSO regression analysis was achieved by 5-fold cross-validation. (The left vertical line is drawn using the minimum criterion and the right vertical line is drawn using the 1_se criterion. In our study, LASSO regression model with minimum criterion was used in the univariate analysis in order to solve such multiple co-linear relationships among the explanatory variables. LASSO: least absolute shrinkage and selection operator; gGIST, gastric gastrointestinal stromal tumors.

Supplementary Figure 2 | Calibration curve of the LASSO model in the training, validation and test set, with the mean absolute errors being 0.021, 0.035 and 0.043, respectively.

Supplementary Figure 3 | Decision curve analysis of the LASSO model in the test set. The DCA plots demonstrated that when the threshold probability of a difficult procedure predicted by the LASSO model was between 20% and 100%, an intervention might add more benefit (10% - 80%).

Supplementary Figure 4 | Decision curve analysis plots of 4 AutoML models in the test set, indicating net benefits of around 80%. (A) DL model; (B) GBM model; (C) GLM model; (D) DRF model.

References

1. Akahoshi K, Oya M, Koga T, Shiratsuchi Y. Current clinical management of gastrointestinal stromal tumor. World J Gastroenterol (2018) 24(26):2806–17. doi: 10.3748/wjg.v24.i26.2806

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Tan Y, Tan L, Lu J, Huo J, Liu D. Endoscopic resection of gastric gastrointestinal stromal tumors. Transl Gastroenterol Hepatol (2017) 2:115. doi: 10.21037/tgh.2017.12.03

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Chen ZM, Peng MS, Wang LS, Xu ZL. Efficacy and safety of endoscopic resection in treatment of small gastric stromal tumors: a state-of-the-art review. World J Gastrointest Oncol (2021) 13(6):462–71. doi: 10.4251/wjgo.v13.i6.462.

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Su W, Wang M, Zhang D, Zhu Y, Lv M, Zhu L, et al. Predictors of the difficulty for endoscopic resection of gastric gastrointestinal stromal tumor and follow-up data. J Gastroenterol Hepatol (2022) 37(1):48–55. doi: 10.1111/jgh.15650

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Handelman GS, Kok HK, Chandra RV, Razavi AH, Lee MJ, Asadi H. eDoctor: machine learning and the future of medicine. J Intern Med (2018) 284(6):603–19. doi: 10.1111/joim.12822

PubMed Abstract | CrossRef Full Text | Google Scholar

6. MacEachern SJ, Forkert ND. Machine learning for precision medicine. Genome (2021) 64(4):416–25. doi: 10.1139/gen-2020-0131

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Rauschert S, Raubenheimer K, Melton PE, Huang RC. Machine learning and clinical epigenetics: a review of challenges for diagnosis and classification. Clin Epigenetics. (2020) 12(1):51. doi: 10.1186/s13148-020-00842-4

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Kabade V, Hooda R, Raj C, Awan Z, Young AS, Welgampola MS, et al. Machine learning techniques for differential diagnosis of vertigo and dizziness: a review. Sensors (Basel). (2021) 21(22):7565. doi: 10.3390/s21227565

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Tran KA, Kondrashova O, Bradley A, Williams ED, Pearson JV, Waddell N. Deep learning in cancer diagnosis, prognosis and treatment selection. Genome Med (2021) 13(1):152. doi: 10.1186/s13073-021-00968-x

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Campagnini S, Arienti C, Patrini M, Mannini A, Carrozza MC. Machine learning methods for functional recovery prediction and prognosis in post-stroke rehabilitation: a systematic review. J Neuroeng Rehabil. (2022) 19(1):54. doi: 10.1186/s12984-022-01032-4

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Lynch CM, Abdollahi B, Fuqua JD, de Carlo AR, Bartholomai JA, Balgemann RN, et al. Prediction of lung cancer patient survival via supervised machine learning classification techniques. Int J Med Inform. (2017) 108:1–8. doi: 10.1016/j.ijmedinf.2017.09.013

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Leite D, Martins A Jr, Rativa D, De Oliveira JFL, Maciel AMA. An automated machine learning approach for real-time fault detection and diagnosis. Sensors (Basel). (2022) 22(16):6138. doi: 10.3390/s22166138

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Puri M. Automated machine learning diagnostic support system as a computational biomarker for detecting drug-induced liver injury patterns in whole slide liver pathology images. Assay Drug Dev Technol (2020) 18(1):1–10. doi: 10.1089/adt.2019.919

PubMed Abstract | CrossRef Full Text | Google Scholar

14. von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke J. The strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. Ann Intern Med (2007) 147:573–7. doi: 10.7326/0003-4819-147-8-200710160-00010

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Pimentel-Nunes P, Libânio D, Bastiaansen BAJ, Bhandari P, Bisschops R, Bourke MJ, et al. Endoscopic submucosal dissection for superficial gastrointestinal lesions: European society of gastrointestinal endoscopy (ESGE) guideline - update 2022. Endoscopy (2022) 54(6):591–622. doi: 10.1055/a-1811-7025

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Huberty V, Verset L, Deviere J. Endoscopic full-thickness resection of a gastric GI stromal tumor. VideoGIE (2019) 4(3):120–2. doi: 10.1016/j.vgie.2018.11.003

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Zhang X, Modayil R, Criscitelli T, Stavropoulos SN. Endoscopic resection for subepithelial lesions-pure endoscopic full-thickness resection and submucosal tunneling endoscopic resection. Transl Gastroenterol Hepatol (2019) 4:39. doi: 10.21037/tgh.2019.05.01

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Perez Valdivieso JR, Bes-Rastrollo M. Concerns about the validation of the Berlin questionnaire and American society of anesthesiologist checklist as screening tools for obstructive sleep apnea in surgical patients. Anesthesiology (2009) 110(1):194. doi: 10.1097/ALN.0b013e318190bd8e

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Joensuu H. Risk stratification of patients diagnosed with gastrointestinal stromal tumor. Hum Pathol (2008) 39(10):1411–9. doi: 10.1016/j.humpath.2008.06.025

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Oda I, Suzuki H, Nonaka S, Yoshinaga S. Complications of gastric endoscopic submucosal dissection. Dig Endosc. (2013) 25 Suppl 1:71–8. doi: 10.1111/j.1443-1661.2012.01376.x

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Saito I, Tsuji Y, Sakaguchi Y, Niimi K, Ono S, Kodashima S, et al. Complications related to gastric endoscopic submucosal dissection and their managements. Clin Endosc. (2014) 47(5):398–403. doi: 10.5946/ce.2014.47.5.398

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Li R, Cai S, Sun D, Shi Q, Ren Z, Qi Z, et al. Risk factors for delayed bleeding after endoscopic submucosal dissection of colorectal tumors. Surg Endosc. (2021) 35(12):6583–90. doi: 10.1007/s00464-020-08156-5

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Muro T, Higuchi N, Imamura M, Nakagawa H, Honda M, Nakao K, et al. Post-operative infection of endoscopic submucosal dissection of early colorectal neoplasms: a case-controlled study using a Japanese database. J Clin Pharm Ther (2015) 40(5):573–7. doi: 10.1111/jcpt.12313

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Hermanek P, Wittekind C. The pathologist and the residual tumor (R) classification. Pathol Res Pract (1994) 190(2):115–23. doi: 10.1016/S0344-0338(11)80700-4

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Nohara Y, Matsumoto K, Soejima H, Nakashima N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput Methods Programs Biomed (2022) 214:106584. doi: 10.1016/j.cmpb.2021.106584

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Neves I, Folgado D, Santos S, Barandas M, Campagner A, Ronzio L, et al. Interpretable heartbeat classification using local model-agnostic explanations on ECGs. Comput Biol Med (2021) 133:104393. doi: 10.1016/j.compbiomed.2021.104393

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Ausania F, Borin A, Martinez-Perez A, Blasi A, Landi F, Colmenero J, et al. Development of a preoperative score to predict surgical difficulty in liver transplantation. Surgery (2022) 172(5):1529–36. doi: 10.1016/j.surg.2022.07.001

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Kosaka H, Satoi S, Kono Y, Yamamoto T, Hirooka S, Yamaki S, et al. Estimation of the degree of surgical difficulty anticipated for pancreatoduodenectomy: preoperative and intraoperative factors. J Hepatobiliary Pancreat Sci (2022) 29(11):1166–74. doi: 10.1002/jhbp.1052

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Hazama H, Tanaka M, Kakushima N, Yabuuchi Y, Yoshida M, Kawata N, et al. Predictors of technical difficulty during endoscopic submucosal dissection of superficial esophageal cancer. Surg Endosc. (2019) 33(9):2909–15. doi: 10.1007/s00464-018-6591-4

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Tanaka S, Yoshizaki T, Yamamoto Y, Ose T, Ishida T, Kitamura Y, et al. The risk scoring system for assessing the technical difficulty of endoscopic submucosal dissection in cases of remnant gastric cancer after distal gastrectomy. Surg Endosc. (2022) 36(2):1482–9. doi: 10.1007/s00464-021-08433-x

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Alberici L, Paganini AM, Ricci C, Balla A, Ballarini Z, Ortenzi M, et al. Development and validation of a preoperative “difficulty score” for laparoscopic transabdominal adrenalectomy: a multicenter retrospective study. Surg Endosc. (2022) 36(5):3549–57. doi: 10.1007/s00464-021-08678-6

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Jian G, Tan L, Wang H, Lv L, Wang X, Qi X, et al. Factors that predict the technical difficulty during endoscopic full-thickness resection of a gastric submucosal tumor. Rev Esp Enferm Dig. (2021) 113(1):35–40. doi: 10.17235/reed.2020.7040/2020

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Sun C, Zheng Z, Wang B. Learning curve for endoscopic submucosal dissection of gastric submucosal tumors: is it more difficult than it may seem? J Laparoendosc Adv Surg Tech A. (2014) 24(9):623–7. doi: 10.1089/lap.2014.0122

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Yoshida M, Kakushima N, Mori K, Igarashi K, Kawata N, Tanaka M, et al. Learning curve and clinical outcome of gastric endoscopic submucosal dissection performed by trainee operators. Surg Endosc. (2017) 31(9):3614–22. doi: 10.1007/s00464-016-5393-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: automated machine learning, predictive models, endoscopic resection, gastrointestinal stromal tumors, difficulty

Citation: Liu L, Zhang R, Shi D, Li R, Wang Q, Feng Y, Lu F, Zong Y and Xu X (2023) Automated machine learning to predict the difficulty for endoscopic resection of gastric gastrointestinal stromal tumor. Front. Oncol. 13:1190987. doi: 10.3389/fonc.2023.1190987

Received: 21 March 2023; Accepted: 26 April 2023;
Published: 10 May 2023.

Edited by:

Qun Zhang, Nanjing Medical University, China

Reviewed by:

Gwang Ha Kim, Pusan National University, Republic of Korea
Jie Shen, Nanjing Drum Tower Hospital, China
Zequn Li, The Affiliated Hospital of Qingdao University, China

Copyright © 2023 Liu, Zhang, Shi, Li, Wang, Feng, Lu, Zong and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yang Zong, zongy_0316@aliyun.com; Xiaodan Xu, xxd20@163.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.