Introduction

Primary liver cancer is the sixth most common malignant tumor and the fourth leading cause of cancer-related deaths worldwide [1, 2]. In 2020, 907,100 new cases of primary liver cancer were reported, and 8.3% of patients died of it [3]. Hepatocellular carcinoma (HCC) is the most common type of primary liver cancer [4]. It usually occurs in those patients who suffer from chronic inflammation and fibrosis caused by viral hepatitis, alcohol, and metabolic-related fatty liver disease [5]. HCC’s 5-year overall survival (OS) rate is often less than 20% [6]. Tumor resection is the most effective treatment for early stage HCC, but the recurrence and metastasis rates are high and the prognosis of patients with HCC is usually poor [7, 8].

Improvements in the treatment of chronic liver diseases and the extension of life expectancy have resulted in an increase in the number of elderly patients with HCC. Although the elderly constitute more than a third of HCC patients, they have not been adequately represented in treatment and prognosis studies. Thus, there is not enough evidence to guide the treatment of such patients [9, 10]. The incidence rate of HCC is estimated to increase by approximately 59% by 2030, whereas individuals aged 65 or above are expected to constitute over 50% HCC patient by then [11].

Nomogram is considered a widely used predictive model for evaluating the prognosis of cancer patients [12,13,14]. In this study, we aim to identify the prognostic factors of elderly patients with HCC and construct a new prognostic model for predicting their OS, which would facilitate the provision of therapy suggestions and assist clinical decision-making.

Materials and methods

Subject selection

The Surveillance, Epidemiology, and End Results (SEER) is a dominant cancer statistics database in the United States (US) [15]. The database contains the diagnosis, treatment, and survival data of millions of cancer patients in US and other countries. The data regarding HCC patients during 2010–2015 were selected and handled by SEER*Stat 8.3.9 (https://seer.cancer.gov/) in this study. The inclusion and exclusion criteria are outlined in Supplementary Fig. 1. To be included, the patients must match the criteria: (1) diagnosed with HCC between 2010 and 2015 and (2) 65 years or older. Contrarily, we excluded: (1) patients for whom liver cancer was not their first primary tumor, (2) those with unknown alpha-fetoprotein (AFP) level, (3) those without follow-up time and (4) those with missing survival data. Moreover, we retrospectively enrolled elderly patients diagnosed with HCC to construct the external validation set between 2008 and 2017 from the First Affiliated Hospital of Zhejiang University School of Medicine in light of the selection criteria. The protocol of this research was approved by the Ethics Committee of the First Affiliated Hospital of Zhejiang University School of Medicine (Ethical approval No. IIT20230048B).

Variables selection

Fourteen variables were selected in this study: age at diagnosis, race, sex, marital status, T stage, N stage, surgery, radiotherapy, chemotherapy, tumor size, AFP level, bone metastasis, lung metastasis, and grade. Fibrosis score (FS) was also adopted in this study. Surgery was divided into five categories: no surgery, local destruction (LD), segmental resection (SR), larger resection (LR), and liver transplantation (LT). For LD patients, there exists a large range of therapeutic options including photodynamic therapy (PDT), electrocautery / fulguration (includes the use of hot forceps for tumor destruction), cryosurgery, laser, alcohol (percutaneous ethanol injection [PEI]), heat-radio-frequency ablation (RFA) and other methods (e.g., ultrasound, acetic acid). The above therapeutic options have been defined in SEER database. Considering that age and tumor size were continuous variables, X-tile was used to determine the optimal cutoff values for them [16]. The results showed that the best cutoff values for age were 74 and 80, whereas the best cutoff values for tumor size were 5.6 and 8.5 cm. OS was the outcome of the model proposed in this study, which was defined as the time from randomization until death from any cause or the date of the last follow-up.

Statistical analysis

Statistical analysis was performed using SPSS 22.0 and R (version 4.0.2, Vienna, Austria). Values were considered statistically significant at p < 0.05. Univariate cox regression analysis and multivariate cox regression analysis were adopted to identify independent prognostic factors. The risk ratio (HR) and 95% confidence interval (CI) were used to show the impact of the variables on the patients’ survival. A nomogram was constructed based on these independent prognostic factors. The discriminatory values of the models were evaluated based on the concordance index (C-index). The area under the curve (AUC) of the receiver operating characteristic (ROC) curve was used to evaluate the prognostic accuracy of the nomogram. Calibration curve was generated to evaluate the calibration of the nomogram. In addition, a decision curve analysis (DCA) was performed to demonstrate the clinical benefit of the nomogram [17]. To streamline the power calculation estimation, we produced PowerTools—an interactive open-source web application that was written in R using the Shiny framework (http://www.shinyapps.io/).

In addition, we developed a risk stratification model based on the total score of each patient, as calculated by the nomogram. Then, X-Tile was used to determine the best cutoff value to divide patients into low-risk, intermediate-risk, and high-risk groups. Kaplan–Meier curves and log-rank tests were used to analyze and compare the OS of patients in different subgroups.

Results

Patient characteristics

A total of 2,721 patients extracted from the SEER database were divided into a training set (N = 1904) and an internal validation set (N = 817). The chi-square test showed that there was no significant difference between the two sets (Table 1). In the training set, 58.4% (1112/1904), 27.2% (518/1,904), and 14.4% (274/1904) of the patients were aged < 74, 74–80, and > 80, respectively. In addition, 52.0% (990/1904), 21.8% (415/1904), and 26.2% (499/1904) of the patients’ tumor sizes were < 5.6 cm, 5.6–8.5 cm, and > 8.5 cm, respectively. Furthermore, 62.9% (1197/1904) of the patients had elevated AFP levels, whereas 37.1% (707/1,904) of them had normal AFP levels. Patient characteristics of the external validation set (N = 101) was listed in Supplementary Table 1.

Table 1 Baseline: clinicopathological characteristics of the subjects

Univariate and multivariate analysis for prognosis

To identify OS-related variables, 15 variables were used in the univariate cox analysis. The univariate cox regression analysis showed that all variables were statistically significant (p < 0.05). The multivariate cox analysis indicated that higher age, sex (male), being unmarried, higher T stage, higher N stage, no surgery, no chemotherapy, larger tumor size, elevated AFP, higher fibrosis score, bone metastasis, lung metastasis, and higher grade were all independently associated with poor OS of HCC patients, as shown in Table 2 and Fig. 1.

Table 2 Univariate and multivariate analyses for overall survival of the subjects
Fig. 1
figure 1

Forest plot showing the results of multivariate analysis for overall survival

Prognostic nomogram model

A nomogram was constructed based on all independent OS-related factors from the training set (as shown in Fig. 2), and then scores were assigned to the clinical variables in each subgroup (as shown in Table 3). Meanwhile, the time-dependent ROC curves showed that the AUC values at 1, 3, and 5 years were 0.823(95%CI 0.803–0.845), 0.813 (95%CI 0.790–0.837), and 0.839 (95%CI 0.806–0.872), respectively. This suggested favorable discrimination of the nomogram (as shown in Fig. 3). In the internal validation set, the AUC values at 1, 3, and 5 years were 0.847 (95%CI 0.818–0.876), 0.844 (95%CI 0.812–0.876), and 0.800 (95%CI 0.751–0.849), respectively. In the external validation set, the AUC values at 1, 3, and 5 years were 0.732 (95%CI 0.521–0.943), 0.780 (95%CI 0.674–0.887), and 0.821 (95%CI 0.727–0.914), respectively (as shown in Supplementary Fig. 2).

Fig. 2
figure 2

Nomogram for predicting 1-year, 3-year and 5-year overall survival of older HCC patients

Table 3 Scores of clinical variables in each subgroup
Fig. 3
figure 3

Receiver operating characteristic (ROC) curves of nomogram in training set A & internal validation set B

Moreover, the calibration curves (bootstraps = 1000) of the training set and internal validation set indicated that the nomogram had a strong calibration (as shown in Fig. 4). In addition, we compared the clinical practicability of the nomogram and the TNM staging system using DCA. The results indicated that the nomogram had a better clinical benefit and a larger threshold probability range, which confirmed that the nomogram can be used as an effective tool in clinical practice (as shown in Fig. 5).

Fig. 4
figure 4

Calibration curves of nomogram at 1 year, 3 years, and 5 years for training set A and internal validation set B

Fig. 5
figure 5

Decision curve analysis at 1 year, 3 years, and 5 years for training set A–C and internal validation set D–F

Risk stratification model

Based on the constructed prognostic nomogram model, the patients were divided into the low-risk group (1,553/2,721, 57.1%; total score < 83), intermediate-risk group (846/2,721, 31.1%; total score 83–113), and high-risk group (322/2,721, 11.8%; total score > 113). The results of the Kaplan–Meier survival analysis with log-rank tests showed that there were different survival patterns among patients in the three groups. The prognosis of patients in the low-risk group was significantly better than that of those in the high-risk group (p < 0.0001) (as shown in Fig. 6).

Fig. 6
figure 6

Survival curves showed the survival status classified by our nomogram of the training set A and internal validation B in HCC patients

Risk stratification for subgroup analysis

Although the constructed nomogram model worked well in the training and internal validations, its effectiveness in the subgroups was unclear. Therefore, we divided these patients into different subgroups according to the age, tumor size, T stage, grade, AFP level and surgery, to further confirm the effectiveness of the nomogram. The results indicated that in both the training and internal validations, the risk stratification of all subgroups was statistically significant (p < 0.05). This implied that the nomogram was effective for the distinction of the prognosis of different subgroups of HCC patients (as shown in Figs. 7, 8, and 9).

Fig. 7
figure 7

Subgroup analysis of aged < 74, 74–80, and > 80 in training set A–C and internal validation set D–F Subgroup analysis of tumor size < 5.6 cm, 5.6–8.5 cm, and > 8.5 cm in training set G–I and internal validation set J–L

Fig. 8
figure 8

Subgroup analysis of T1–2 and T3–4 in training set A, B and internal validation set C, D; Subgroup analysis of grade I–II and grade III–IV in training set E, F and internal validation set G, H

Fig. 9
figure 9

Subgroup analysis of normal AFP and elevated AFP in training set A, B and internal validation set C, D Subgroup analysis of surgery and no surgery in training set E, F and internal validation set G, H

Stratification analysis of surgery

To further determine whether more radical surgeries should be taken for the older patients, we also performed surgical stratification analysis based on the univariate OS Cox model (age, T stage, tumor size, fibrosis score and grade). The results showed that LT had the best survival duration in the < 74 age band: HR 0.165 (95%CI 0.116–0.236), and unexpectedly, LR did not show better survival duration than SR in all ages ranges [< 74: HR 0.257 (95%CI 0.202–0.328) vs. HR 0.191 (95%CI 0.151–0.241); 74–80: HR 0.316 (95%CI 0.237–0.423) vs. HR 0.244 (95%CI 0.178–0.335); > 80: HR 0.322 (95%CI 0.205–0.507) vs. HR 0.169 (95%CI 0.092–0.312)]. Meanwhile, SR had the best survival duration in the 0–4 FS band: HR 0.215 (95%CI 0.132–0.351), LT had the best survival duration in the 5–6 FS band: HR 0.126 (95%CI 0.073–0.218) (as shown in Table 4).

Table 4 Stratification analysis of surgery

Moreover, a more detailed plot was conducted on OS. It was shown that both in training set and internal validation, the surgery group had a higher survival curve than the non-surgery group did, and LT had the highest survival curve (p < 0.0001) (as shown in Supplementary Fig. 3). LR had a better survival curve than LD but a worse one than SR and LT (p < 0.0001).

Online service of the constructed nomogram model

To provide easily obtained access of the proposed model, we developed an online website (https://juntaotan.shinyapps.io/dynnomapp_hcc/) to provide the service of nomogram model for medical staff. The provided function could automatically receive and calculate a patient’s survival probability. The scoring module enables the early identification of high-risk patients, which could further facilitate the appropriate treatment to prolong the survival time.

Discussion

A total of 2,822 older patients with HCC were selected in this study. The univariate and multivariate cox regression analysis successfully identified 13 independent prognostic factors including age, sex, marital status, T stage, N stage, surgery, chemotherapy, tumor size, AFP level, fibrosis score, bone metastasis, lung metastasis, and grade. The prognostic nomogram model was constructed based on these factors. The C-index, calibration curve, and DCA evaluated the model based on discrimination, calibration, and clinical usefulness, respectively. In addition, we constructed a risk stratification model based on the total score of each patient provided by the nomogram.

In 2020, through univariate and multivariate Cox analyses, Liu and colleagues developed a full age spectrum prognosis model that included 6 predictors to evaluate the prognosis of HCC patients [18]. However, we considered that it is impossible to accurately evaluate the prognosis of elderly patients with HCC by constructing a full age spectrum prognosis model of patients with HCC, because the clinical and pathological characteristics of young and elderly patients with HCC are different. Previous studies have shown that it is necessary to build a prognostic model to precisely assessing of the prognosis of elderly patients with HCC [19, 20]. He et al. showed that age, race, T stage, histological grade, surgery, radiotherapy, and chemotherapy were independent predictors of cancer-specific survival in elderly patients with HCC [21]. Building upon their research, we have introduced two more important indicators (AFP level and fibrosis score) to evaluate the prognosis of elderly patients with HCC. The value of AFP level and fibrosis score in evaluating the prognosis of patients with HCC has been widely verified [22,23,24]. In addition, we also enrolled patients from other center to perform external validation of the prognostic model built in this study. In conclusion, this study has constructed a more accurate and applicable prediction model for evaluating the prognosis of elderly patients with HCC.

Being older, male, and unmarried, and with bone metastasis and lung metastasis were independently associated with poor OS of HCC patients. These results were consistent with those of previous research [25, 26]. Elderly patients usually suffer from comorbidities, such as malnutrition, decreased immune function, and cognitive impairment, which limit: their treatment options and thereby results in a worse prognosis [27, 28]. The results indicated that the older the patient, the shorter their OS and the worse their prognosis. Interestingly, we found that the prognosis of unmarried patients was worse than the married did. This may be caused by the limited ability of affording the costs of continued treatment, which forces them to shorten their treatment. In addition, single patients are unable to receive the emotional support that a spouse would provide, which adversely affects their prognosis. Bone metastasis is a typical metastatic pattern in patients with HCC. Literatures reported that incidence rate of bone metastasis of HCC patients ranged from 3 to 20%, and it was on an upward trend [29, 30]. Although the management of HCC patients was improved in recent years, the prognosis of those with bone metastasis remains weak. This study found that the prognosis of HCC patients with bone metastases was worse than the ones without bone metastases (HR: 1.511; 95%CI 1.077–2.120).

Tumor size, T stage, N stage, AFP level, fibrosis score and grade were proved as independent prognostic factors of HCC patients. AFP level and tumor size have been used in many HCC prognoses models and have been proven to have good predictive ability and evaluation effects [31,32,33]. AFP is a glycoprotein synthesized from embryonic liver cells. It is the first tumor marker discovered in HCC and is widely used for the diagnosis of HCC. For example, Bai et al. found that AFP-elevated was associated with inferior survival compared with AFP-normal in patients with HCC [34]. Wu et al. indicated that the survival rate of patients with HCC decreased with an increase in tumor size. The 5-year survival rate of patients with a tumor diameter of ≤ 2 cm was 21.9%, whereas the 5-year survival rate of patients with tumor sizes of 2.1–5.0, 5.1–10.0, and 10.1–20.0 cm decreased to 14.3%, 9.2%, and 7.7%, respectively [35]. The results in this study showed that compared to those with a tumor size of < 5.6 cm, the risk of HCC patients with tumor sizes of 5.6–8.5 cm and > 8.5 cm increased by 1.186 times and 1.573 times, respectively.

Surgery and adjuvant chemotherapy can significantly improve the prognosis of patients with HCC [36, 37]. With the development of medical technology, surgery has become a standard treatment for improving the prognosis of patients with HCC. Studies have shown that HCC patients with regional lymph node infiltration or multiple metastases may benefit from surgery [38, 39]. A cohort study reported that the median overall survival (OS) of patients treated with surgery was significantly longer than that of patients treated without surgery (MOS: 20 months vs. 7 months, p < 0.001) [40]. Another study provided evidence that adjuvant chemotherapy after hepatectomy was beneficial for patients with operable HCC [41]. Oxaliplatin, sorafenib, and 5-fluorouracil are common chemotherapeutic drugs. Two randomized, placebo-controlled phase III trials demonstrated a significant improvement in OS of patients with advanced HCC [42, 43]. A recent study demonstrated that hepatic artery infusion chemotherapy combined with sorafenib can improve the OS of patients with HCC [44]. The results in this study also showed that surgery and chemotherapy can significantly improve the prognosis of patients with HCC.

To reflect the clinical value of our research, we further performed a surgical stratification analysis of age, T stage, tumor size, fibrosis score and grade. In our study, all of the elderly patients with HCC treated with surgery, had longer survival duration than non-surgical patients, which was consistent with the previous studies [45, 46]. It was shown that SR was usually a better option for older patients with HCC over the age of 74 when LT was not available. Interestingly, we found that SR and LT were a better option for elderly patients with HCC in the 0–4 FS band and 5–6 FS band, respectively. To our knowledge, this finding was reported for the first time in the SEER database. It was also shown that both in training set and internal validation, LR did not show better survival than SR does, which was consistent with other findings [47]. When surgeons were faced with the dilemma of how to choose the best treatment solution for older patients with HCC who could not undergo LT, the finding in this study may provide valuable suggestion.

Nevertheless, this study has some limitations. First, this was a retrospective study, which might result in the introduction of a partial selection bias. Second, some potential prognostic factors such as ALBI grade for liver reserve, specific chemotherapy regimens and multigene signature assessment were not included in the SEER database. In addition, we lack information on clinical indicators other than AFP level and fibrosis score. More indicators included in the study could have helped to identify clinical indicators with high specificity and sensitivity in elderly HCC patients.

Conclusion

In summary, age, sex, marital status, T stage, N stage, surgery, chemotherapy, tumor size, AFP level, fibrosis score, bone metastasis, lung metastasis, and grade were independent prognostic factors for older patients with HCC. The constructed nomogram model based on the above factors could accurately predict the prognosis of such patients. Besides, the developed online web interface of the predictive model provides easily obtained access for clinicians.