Introduction

Pancreatic neuroendocrine tumors (PNETs) constitute a heterogeneous category of infrequent neoplasms that arise within the diffuse neuroendocrine system of the pancreas1. These tumors possess the capacity to undergo a gradual metamorphosis from a benign to a malignant state. The prevalence rates of PNETs have escalated to an impressive 10% among all pancreatic tumors2. Presently, as the National Comprehensive Cancer Network (NCCN) guidelines (V1.2015), the surgical excision of PNETs exceeding 2 cm in size is advocated3. Nevertheless, a contentious debate persists concerning the role of surgery in the management of small non-functional pancreatic neuroendocrine tumors (NF-PNETs), particularly those characterized by a well-differentiated nature4.

The Chinese guidelines for the diagnosis and treatment of pancreatic neuroendocrine tumors propose that when PNETs measure ≤ 2 cm in size, remain localized within the pancreas, exhibit no signs of vascular invasion, and display a proportion of HPF and Ki-67-positive cells of less than or equal to 2%, they can be deemed as benign tumor. Nonetheless, fulfilling this criterion frequently necessitates confirmation through postoperative pathological assessment5. However, the attainment of this criterion commonly necessitates the confirmation of postoperative pathology. As a result, preoperative diagnostic methods such as imaging fail to comprehensively identify aggressive tumors measuring less than 2 cm in size.

This outcome contributes to the escalation of uncertainty surrounding the necessity of surgical intervention. Several studies propose that NF-PNETs may not require immediate surgical intervention but rather necessitate diligent surveillance6,7,8. However, it is important that these studies have relatively limited median follow-up durations, thus leaving the long-term outcomes in doubt. Conversely, some research endeavors have demonstrated that surgical intervention augments the overall survival rates of NF-PNETs8,9,10. Although surgical resection remains the exclusive curative modality for NF-PNETs, pancreatic resections pose notable perils in terms of morbidity and mortality. While the risks associated with pancreatic surgery may be deemed justifiable for symptomatic hormone-producing PNETs and larger tumors, the optimal approach to managing NF-PNETs is less straightforward. It is important to recognize that pancreatectomy carries a significant risk of complications and death, and thus, the potential benefits of surgical removal for survival in the case of NF-PNETs remain uncertain.

This study endeavors to investigate whether surgical excision of such tumors can enhance the survival duration of patients afflicted with NF-PNETs. Furthermore, propensity scores were employed to evaluate the impact of other factors on the surgical outcomes.

Method

Data source

The data were extracted from the Surveillance, Epidemiology, and End Results (SEER, https://seer.cancer.gov) database using SEER*Stat software Version 8.3.6. Data from patients with Pancreatic neuroendocrine tumors (PNETs) diagnosed in 2004–2015 who had complete information including age, sex, race, primary site, grade, marital status, T stage, N stage, tumor size, histology were included in the study.

The surveillance, epidemiology, and results (SEER) database (https://seer.cancer.gov/seerstat), which contains prospectively collected data on demographics, lesion, the first course of treatment, and survival of all cancer patients from state cancer registries across the United States, was utilized for case extraction.

Inclusion criteria

  1. 1.

    Diagnosed with PNETs as defined by the International Classification of Disease codes (ICD-O-3): (8150/3: Pancreatic endocrine tumor; 8240/3: Carcinoid tumor; 8246/3: Neuroendocrine carcinoma and 8249/3: Atypical carcinoid tumor) and site code (C25.0-C25.4 and C25.7-C25.9) in 2004–2015. The follow up time was 5 years.

  2. 2.

    Patients with tumors sizes ≤ 2 cm (T1) without regional lymph node involvement (N0) or distant metastasis (M0).

  3. 3.

    Based on grade, patients with tumor moderately well-differentiated and well-differentiated were included

  4. 4.

    Patients had completed follow-ups.

Exclusion criteria

  1. 1.

    Patients diagnosed by the International Classification of Disease codes (ICD-O-3) not (8150/3: Pancreatic endocrine tumor; 8240/3: Carcinoid tumor; 8246/3: Neuroendocrine carcinoma and 8249/3: Atypical carcinoid tumor) were excluded.

  2. 2.

    Patients with poorly differentiated (grade 3) or undifferentiated/anaplastic (grade 4) tumors were excluded.

  3. 3.

    Patients with multiple cancers and autopsy/death certificate only and missing staging data were also excluded; no patients were found to have missing receipt of surgery data.

Definition of variables and endpoint

Our analysis involved extracting clinicopathologic variables such as age at diagnosis, race (categorized as white, Black, and other), sex (male or female), tumor differentiation grade (well-differentiated or moderately differentiated), tumor size, marital status (categorized as married, single, or unknown), and primary tumor size (categorized as head, body, tail, or other). The predictor variable was receipt of surgery, defined by codes 25–90 in the RX Sum Surg Prim Site 1998. The primary outcome variable was overall survival (OS). Overall survival was calculated from the date of disease detection until the occurrence of death from any cause.

Statistical analysis

Crude analysis

The basic clinical characteristics of patients in the surgical and non-surgical groups were compared using either the t test or chi-square test. The continuous predictor, age, was categorized following an assessment using restricted cubic splines. This approach was employed to relax the assumption of a linear relationship between age and the risks of death11. Cox multivariate regression analysis was conducted to examine the association between variables and overall survival (OS), yielding hazard ratios (HR) and 95% confidence intervals. The clinicopathological features of patient survival were assessed through Kaplan–Meier curves and compared using the log-rank test.

Propensity-score analyses

To help account for the nonrandomized treatment administration of surgery, we used propensity-score methods to reduce confounding effects. The individual propensities for receiving surgical treatment were estimated using a multivariable cox regression model that included the same covariates as the Cox regression model. Associations between surgery treatment and death were then evaluated by multivariable Cox regression models using two propensity-score methods.

Inverse probability weighting

The primary analysis employed inverse probability weighting. In this analysis, the propensity-score model12 was utilized to estimate predicted probabilities, which were then used to compute stabilized inverse probability weighting weights13. Kaplan–Meier curves and Cox models incorporating the inverse probability weighting weights were utilized in the analysis.

Propensity-score matching

We conducted a secondary analysis that used propensity-score matching. Patients in the surgical and non-surgical groups underwent 1:1 Propensity Score Matching (PSM) using the PSM method. In order to alleviate selection bias, confounding factors in non-randomized studies were harmonized in a manner akin to randomization. The matching factors included sex, age, ethnicity, tumor location, TNM stage, and tumor grade. In the propensity-score matching analysis, the greedy matching method was applied to create a matched control sample, the effect value was 0.05.

Sensitivity analysis

We used sensitivity analysis in propensity-score matching results. Sensitivity analysis proceeds as follows. First, a series of G values starting 1 is selected. Then, the Wilcoxon signed-rank test (for continuous outcomes) or the McNemar's test (for binary outcomes) is conducted for each G. Lastly, the upper bound of the P value of the test statistic is inspected for changes over the series of G values. The treatment effect estimate is considered sensitive if a small increase in G from 1 (no bias to small bias) changes its significance value from significant (e.g., P < 0.05) to non-significant (e.g., P > 0.05). If statistical significance does not change until a very large G, then the effect estimate is considered very robust to hidden bias14. Cox multivariate regression analysis was conducted using the R software (version 3.6.1 R Project for Statistical Computing) to enable an analysis of overall survival (OS). For each analysis, P ≤ 0.05 is the threshold for significance.

Result

Clinicopathologic characteristics of patients

A total of 1102 patients diagnosed with NF-PNETs were identified between the years 2004 and 2015. The baseline characteristics of these patients are presented in Table 1. Among them, 965 patients (87%) underwent surgical treatment, while 137 patients (13%) did not undergo surgical intervention. A secondary analysis was conducted, employing propensity-score matching. The greedy matching method was employed to generate a matched control sample within the propensity-score matching analysis, as presented in Table 1.

Table 1 Characteristics of stage I PNETs (n = 1102).

Crude analysis

We then used univariate and multivariate analysis to evaluate the predictive value of the included variables (Table 2). Using the restricted cubic spline method, we defined older patients as those aged older than 65 years, while younger patients were classified as those aged 65 years and younger (Supplement Fig. 1). In our analysis, we observed that surgical treatment exhibited a protective effect [HR = 0.41, 95% CI (0.26–0.65), P < 0.001], indicating an increased overall survival rate. Conversely, factors such as age > 65 [HR = 3.99, 95% CI (2.66–5.98), P < 0.001], male gender [HR = 1.82, 95% CI (1.23–2.68), P = 0.003], and moderate grade [HR = 1.71, 95% CI (1.04–2.80), P = 0.034] were associated with a decrease in the patient's survival time. Furthermore, in the multivariate analysis, male gender [HR = 1.73, 95% CI (1.15–2.61), P = 0.009], moderate grade [HR = 0.53, 95% CI (0.33–0.84), P = 0.007], and old [HR = 3.52, 95% CI (2.33–5.31), P < 0.001] were identified as influential predictors with higher hazard ratios. However, surgical treatment remained a significant factor associated with increased overall survival [HR = 0.53, 95% CI (0.33, 0.84), P = 0.007] in the multivariate analysis (Table 2).

Table 2 Cox proportional hazards regression model for overall survival.

Propensity score analysis

Propensity scores were employed and the matching variables were visualized. We observed a substantial overlap in the distribution of scores between patients in the surgery group and those in the observation group (Fig. 1A). Following the matching process, each patient's score demonstrated a tendency towards consistency, with no instances of mismatch (Fig. 1B). Univariate and multivariate analysis after matching is shown in Table 3. In univariate analysis, we observed that surgical treatment exhibited a protective effect [HR = 0.44, 95% CI (0.22–0.85), P = 0.015]. Conversely, factors such as old [HR = 2.52, 95% CI (1.25–5.08), P = 0.01] and male gender [HR = 2.02, 95% CI (1.04–3.91), P = 0.037] were associated with a decrease in the patient's survival time. Furthermore, in the multivariate analysis, the old patients [HR = 2.15, 95% CI (1.05–4.41), P = 0.037] were identified as influential predictors with higher hazard ratios. However, surgical treatment remained a significant factor associated with increased overall survival (HR = 0.41,95% CI (0.20–0.82), P = 0.011) in the multivariate analysis (Table 3). In general, advanced age and the implementation of surgical intervention are predictive factors for patient survival.

Figure 1
figure 1

Propensity-score step. (A) The propensity score distribution: the frequency distribution of the surgical cohort and the observation cohort, with the surgical group depicted in shades of green and the observation group represented by hues of red. (B) Jitter plot of propensity score: each circle symbolizes an individual's propensity score (PS), with the size of the circle indicating their weight. The absence of circles in the highest and lowest strata signifies the absence of unmatched cases within each subgroup.

Table 3 Cox proportional hazards regression model for overall survival after PSM.

Given the influence of the propensity score on reducing the patient count in the observation group, we employed the inverse probability weighting method. By augmenting the number of patients in the observation group, we ensured consistency with the number of patients in the surgical group. Following this adjustment, we conducted univariate regression analysis to investigate the impact of surgical factors. In the univariate analysis of surgical treatment, both the weight group [HR = 0.48, 95% CI (0.29–0.78), P = 0.003] and matched group [HR = 0.44, 95% CI (0.22–0.85), P = 0.015] demonstrated a protective effect, as presented in Table 4. These findings consistently identified surgical treatment as a protective factor.

Table 4 Associations between surgical treatment and the death in the crude analysis, univariate analysis, and propensity-score analyses.

Survival analysis

The survival analysis which including crude analysis, propensity-score matching analysis and Inverse probability weighting analysis, identified that the surgical treatment could increase the overall time (log-rank, P < 0.05) (Fig. 2A–C). In order to evaluate surgical outcomes across different age groups, subgroup analyses were conducted. The overall survival (OS) results are illustrated in Fig. 2D–F. Among the various subgroups analyzed, it was observed that younger individuals who underwent surgery experienced the most significant improvements in survival. Conversely, older individuals who did not undergo surgery exhibited the poorest prognosis in terms of survival.

Figure 2
figure 2

Overall survival analysis of PNETs patients in different method group: (A) unmatch; (B) propensity-score match; (C) inverse probability weighting. Kaplan–Meier estimate of overall survival by subgroup analysis: (D) unmatch; (E) propensity-score match; (F) inverse probability weighting.

Sensitivity analysis

The results displayed in Supplement Table 1 indicate that the treatment effect, in terms of improving survival time, remains statistically significant even under the assumption of no hidden bias (Gamma = 1), as evidenced by the upper bound of the P value being less than 0.001. Furthermore, this statistical significance persists even if the treated individuals were up to four times more likely to receive the treatment than their matched counterparts, as indicated by the upper bound of the P value being 0.0048 at Gamma = 3. In other words, the treatment effect estimate for this outcome is robust to hidden bias from unmeasured covariates up to Gamma3.

Discussion

The rarity of small non-functional pancreatic neuroendocrine tumors (NF-PNETs) poses a challenge in accurately assessing their prognosis. Moreover, the treatment of NF-PNETs is complex and lacks consistency, which further contributes to the unpredictability of optimal treatment and prognostic factors. However, the SEER database offers researchers a valuable resource with a large sample size, enabling the identification of factors associated with patient survival. This is particularly advantageous when studying rare tumors, especially when utilizing surgical data. The present study utilized propensity score matching (PSM) and multivariate regression analysis, which revealed that surgery positively impacted overall survival (OS) in patients with SN-PNETs.

Some guidelines suggested surgery as the default management strategy for small NF-PNETs15,16, and several studies have demonstrated the benefit of surgery on OS for small, non-functional PNETs4,9. For example, a population-based survey of data from the National Cancer Database examined 1854 patients with nonfunctioning PNETs < 2 cm. This study found that monitored patients had nearly three times the risk of mortality compared with those who underwent surgical treatment10. Meanwhile, the meta-analysis of 714 patients showed that surgical treatment of non-functional PNETs smaller than 2 cm was improved os rates17. In our study, the survival analysis showed that surgical treatment could improve overall survival time. However, some studies showed adverse pinons, for example, in a study reviewing 464 patients to observe a patient who has finally developed metastases or died from nonfunctioning PNETs < 3 cm. This study reported that observation was not an increase in death after a median follow-up of 44 months8. Another study showed that nonoperative management might be advocated when serial imaging demonstrates minimal or no growth without suspicious features7.

In our initial analysis, we identified sex, age, race, and pathological grade as risk factors influencing patient survival. However, even after propensity score matching, age remained a significant risk factor for survival. Traditionally, small non-functional pancreatic neuroendocrine tumors (NF-PNETs) have been considered relatively slow-growing tumors, typically occurring in younger patients, and are generally associated with a favorable prognosis following surgical resection. Our study further demonstrates that younger patients who undergo surgery exhibit the highest survival rates. Pancreatic surgery is a more invasive procedure and is not commonly recommended for older patients. In the surgical group, a higher proportion of younger patients were observed, reflecting the consideration of the surgeon in weighing the benefits and drawbacks of performing surgery in elderly patients, often presenting with symptoms such as tumor obstruction of the pancreatic duct or abdominal pain. Notably, in the older age group, the survival rates were better in the surgery group compared to the observation group. These findings suggest that small non-functioning endocrine tumors may pose a greater risk to long-term survival compared to potential damage to pancreatic function resulting from surgery. Therefore, based on our study, we recommend surgical intervention for small non-functional endocrine tumors to improve patient outcomes.

This study had certain limitations. Firstly, the SEER database lacked specific information on surgical details such as the types of surgeries performed and their timing. Secondly, important factors like Ki-67 and mitotic index, which are critical for tumor classification in the SEER database, were not taken into account in the tumor classification used in this study. Additionally, several significant prognostic factors including disease-free survival, microvascular invasion, vascular resection, R0/R1 resection, anti-tumor immune response, organ specificity, and peptide receptor radionuclide therapy were not considered. The surgical management of neuroendocrine tumors involves various approaches such as radical excision, debulking, and palliative resection, each carrying its own implications. Due to limited data, these procedures were not evaluated in our study, potentially introducing bias. The nature of surgeries also varied, including cases involving obstruction or bleeding, therapeutic excision, palliation, or unmeasured confounding factors, which could introduce bias when using propensity score matching (PSM). Lastly, the SEER database did not provide information on comorbidities, thus precluding an assessment of their impact on overall survival outcomes, which could introduce bias.

Conclusion

Patients diagnosed with small, non-functioning pancreatic neuroendocrine tumors who undergo surgical intervention exhibit improved overall survival (OS) outcomes. Therefore, surgery is strongly recommended for this patient population.