Prognostic and predictive value of a pathomics signature in gastric cancer

Chen, Dexin; Fu, Meiting; Chi, Liangjie; Lin, Liyan; Cheng, Jiaxin; Xue, Weisong; Long, Chenyan; Jiang, Wei; Dong, Xiaoyu; Sui, Jian; Lin, Dajia; Lu, Jianping; Zhuo, Shuangmu; Liu, Side; Li, Guoxin; Chen, Gang; Yan, Jun

doi:10.1038/s41467-022-34703-w

Download PDF

Article
Open access
Published: 12 November 2022

Prognostic and predictive value of a pathomics signature in gastric cancer

Nature Communications volume 13, Article number: 6903 (2022) Cite this article

7978 Accesses
31 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The current tumour-node-metastasis (TNM) staging system alone cannot provide adequate information for prognosis and adjuvant chemotherapy benefits in patients with gastric cancer (GC). Pathomics, which is based on the development of digital pathology, is an emerging field that might improve clinical management. Herein, we propose a pathomics signature (PS_GC) that is derived from multiple pathomics features of haematoxylin and eosin-stained slides. We find that the PS_GC is an independent predictor of prognosis. A nomogram incorporating the PS_GC and TNM staging system shows significantly improved accuracy in predicting the prognosis compared to the TNM staging system alone. Moreover, in stage II and III GC patients with a low PS_GC (but not in those with a high PS_GC), satisfactory chemotherapy benefits are observed. Therefore, the PS_GC could serve as a prognostic predictor in patients with GC and might be a potential predictive indicator for decision-making regarding adjuvant chemotherapy.

3D genomic mapping reveals multifocality of human pancreatic precancers

Article 01 May 2024

PERCEPTION predicts patient response and resistance to treatment using single-cell transcriptomics of their tumors

Article 18 April 2024

Prediction of tumor origin in cancers of unknown primary origin with cytology-based deep learning

Article Open access 16 April 2024

Introduction

Despite the remarkably reduced incidence and mortality of gastric cancer (GC), it remains an important contributor to the global burden of cancer¹. Currently, the tumour-node-metastasis (TNM) staging system is considered the cornerstone for prognosis prediction and treatment decision-making in GC². However, prognostic stratification of patients with GC according to the latest TNM staging system is often poor³. Adjuvant chemotherapy is recommended for advanced GC because of the improvement of oncological outcomes, but large variations in survival benefits from adjuvant chemotherapy have been reported even in patients with the same stage of disease and receiving similar treatment regimens^3,4. These findings suggest that the present TNM staging system provides inadequate prognostic information and cannot accurately identify patients who are more likely to benefit from adjuvant chemotherapy, which highlights the urgent need for discovering new biomarkers that are associated with prognosis and adjuvant chemotherapy benefits in GC.

To understand the heterogeneous prognoses and adjuvant chemotherapy benefits seen in the clinic, several subtyping algorithms based on gene expression data have been investigated^5,6,7. Although these methods have greatly improved our knowledge regarding GC and the potential of subgroup-specific rational treatment strategies has been enumerated by several studies, the cost and complexity of transcriptomic analyses, including expression microarray and RNA-seq analyses, prevent their active utilization in clinical practice^8,9.

Evaluation of haematoxylin and eosin (H&E)-stained slides by experienced pathologists is indispensable for determining the TNM stage and histological classification of GC cases in the clinic. Full digitalization of the stained tissue sections has become feasible because of advances in slide scanning technology and reductions in the cost of digital storage¹⁰. Recently, the term “pathomics” has attracted increased attention. Pathomics embodies a wide variety of data that are captured from digital pathology image analyses to generate quantitative features for characterizing diverse phenotypes of tissue samples, and these data are subsequently analysed to determine diagnosis or predict survival outcomes^11,12,13. Therefore, we hypothesized that analyses of the automatic digital pathomics features extracted from H&E-stained slides could predict the prognosis and survival benefits associated with adjuvant chemotherapy in patients with GC.

Integration of multiple features into a single signature, rather than individual analyses, might improve the performance of the prognostic prediction^14,15. The least absolute shrinkage and selection operator (LASSO)-Cox regression model is a state-of-the-art machine learning method for regression analysis of the relationships between high-dimensional features and survival^16,17,18. Here, we propose a pathomics signature of GC (PS_GC) that was developed with multiple pathomics features extracted from H&E-stained sections using a LASSO-Cox regression model. Thus, in this study, we intended to assess the prognostic value of the PS_GC for overall survival (OS) and disease-free survival (DFS) and explore whether the PS_GC could identify patients with stage II and III diseases who might benefit from adjuvant chemotherapy.

Results

Participants

Table 1 lists the clinicopathological characteristics of patients in the training (n = 264) and validation (n = 216) cohorts. Of the 480 patients included in this study, 69.4% (333/480) were male, and the median [interquartile range (IQR)] age was 58 (49–65) years. The majority of the patients (76.3%, 366/480) were diagnosed with stage II or III disease. No significant difference in clinicopathological characteristics between the training and validation cohorts was found. The clinicopathological characteristics of patients with and without complete data were similar (Supplementary Table 1). The median (IQR) follow-up duration in the training cohort was 64 (27–72) months, with 5-year OS and DFS rates of 58.7% and 55.3%, respectively (Supplementary Fig. 1a, b). In the validation cohort, the median (IQR) follow-up duration was 55 (22.25–92) months. The 5-year OS and DFS rates were 47.7% and 45.4%, respectively (Supplementary Fig. 1c, d).

Table 1 Characteristics of patients in the training and validation cohorts

Full size table

Construction of the PS_GC

The framework for constructing the PS_GC is presented in Fig. 1. In the training cohort, a LASSO-Cox regression model with 10-fold cross-validation was used to construct the PS_GC. The final PS_GC included 12 pathomics features (Supplementary Fig. 2). The PS_GC calculation formula is presented in the Supplementary Note, from which the PS_GC of the validation cohort was acquired directly. No statistically significant difference in the distribution of the PS_GC [median (IQR)] was found between the training [1.211 (0.857–1.593)] and validation [1.196 (0.902–1.734)] cohorts [median difference: −0.050; 95% confidence interval (CI): −0.151 to 0.051; P = 0.340]. In particular, compared with the intestinal type, a significantly higher PS_GC was detected in diffused and mixed type, with a median difference of 0.376 (95% CI: 0.250–0.497; P < 0.001) in the training cohort and 0.393 (95% CI: 0.258–0.541; P < 0.001) in the validation cohort (Supplementary Fig. 3). In terms of the tumour grade, the PS_GC in patients with grade 3 and grade 4 tumours was significantly higher than that in patients with grade 1 and grade 2 tumours in both the training (median difference: 0.415; 95% CI: 0.271–0.553; P < 0.001) and validation (median difference: 0.463; 95% CI: 0.307–0.630; P < 0.001) cohorts (Supplementary Fig. 4). In addition, the distribution of PS_GC was similar according to the tumour size subgroups in both the training (median difference: 0.099; 95% CI: −0.036 to 0.235; P = 0.158) and validation (median difference: 0.119; 95% CI: −0.031 to 0.268; P = 0.125) cohorts (Supplementary Fig. 5).

**Fig. 1: Schematic illustration of PS_GC construction.**

Association of the PS_GC with prognosis

An optimum cutoff value of 1.16, which provided the highest standardized log-rank statistic, was determined with the training cohort (Supplementary Fig. 6). Accordingly, patients in both the training and validation cohorts were classified into high- and low-PS_GC groups. The distribution of the PS_GC across survival statuses as well as select pathomics features is shown in Supplementary Fig. 7, which revealed that a higher PS_GC was associated with a higher risk of recurrence or death.

In the training cohort, the 5-year OS and DFS rates were 83.7% and 80.5% in low-PS_GC patients, respectively, which were significantly reduced to 36.2% and 33.3% in high-PS_GC patients (Fig. 2a, b, both log-rank P < 0.001). We subsequently performed the same analyses in the validation cohort. Among low-PS_GC patients, the 5-year OS and DFS rates were 73.6% and 71.7%, respectively, and significantly worse 5-year OS and DFS rates of 22.7% and 20.0% were found in high-PS_GC patients (Fig. 2c, d, both log-rank P < 0.001). The PS_GC remained a significant prognostic indicator after stratification by clinicopathological variables, indicating the independent association of the PS_GC with the prognosis (Supplementary Figs. 8–11).

**Fig. 2: Kaplan–Meier survival curves according to the PS_GC level.**

Development and validation of the pathomics nomogram for prognosis

In the univariate Cox regression analysis, the PS_GC, carcinoembryonic antigen (CEA) level, carbohydrate antigen (CA) 19-9 level, tumour location, tumour size, Lauren type, depth of invasion (T stage), lymph node metastasis (N stage) and distant metastasis (M stage) were significantly associated with OS in the training cohort (Table 2). The backwards stepwise multivariate Cox regression analysis showed that the PS_GC, depth of invasion, lymph node metastasis and distant metastasis were independent predictors of OS. The same results were found in the Cox regression analysis for DFS. The proportional hazards (PH) assumption tests for the Cox regression models were valid (Supplementary Figs. 12 and 13). No interaction effects were observed between PS_GC and the TNM staging system for OS and DFS (Supplementary Tables 2–4). Therefore, two pathomics nomograms were developed to predict OS and DFS by incorporating the four independent predictors (Fig. 3). Lymph node metastasis had the most important contribution to the prognostic prediction in the pathomics nomograms, followed by the PS_GC (Supplementary Fig. 14).

Table 2 Univariate and multivariate Cox regression analyses of the PS_GC and clinicopathological characteristics for overall survival and disease-free survival in training cohort

Full size table

**Fig. 3: Pathomics nomograms for the prediction of OS and DFS.**

In the training cohort, the pathomics nomogram yielded a concordance index (C-index) of 0.809 (95% CI: 0.741–0.878) for OS and 0.792 (95% CI: 0.718–0.866) for DFS. In addition, the time-dependent receiver operating characteristic (ROC) curve of the pathomics nomogram at 5 years produced an area under the receiver operating characteristic curve (AUROC) of 0.901 (95% CI: 0.863–0.939) for OS and 0.891 (95% CI: 0.850–0.932) for DFS (Supplementary Fig. 15a, b). Furthermore, the calibration curves showed good agreement between the nomogram-predicted survival and actual survival (Supplementary Fig. 16a, b). The good discrimination with a C-index of 0.784 (95% CI: 0.706–0.862) for OS and 0.794 (95% CI: 0.709–0.873) for DFS was externally validated in the validation cohort. The AUROCs for OS and DFS were 0.887 (95% CI: 0.842–0.931) and 0.888 (95% CI: 0.844–0.933), respectively (Supplementary Fig. 15c, d). The favourable agreement between the nomogram-predicted survival and actual survival of the calibration curves was also confirmed in the validation cohort (Supplementary Fig. 16c, d). Finally, the decision curve analysis indicated that using the pathomics nomograms to predict OS and DFS provided more net benefits than using the treat all scheme or treat none scheme in both the training and validation cohorts (Supplementary Fig. 17), indicating that the pathomics nomograms were clinically applicable.

Incremental value of the PS_GC added to the TNM stage model

Two TNM stage models for OS and DFS were built based on multivariate Cox regression analyses without the PS_GC to elucidate the incremental value of the PS_GC added to clinicopathological variables for predicting the prognosis (Supplementary Table 5). In the training cohort, the C-index of the PS_GC for the prediction of OS and DFS was 0.727 (95% CI: 0.641–0.813) and 0.712 (95% CI: 0.622–0.802), respectively, and the TNM stage models showed a C-index of 0.782 (95% CI: 0.709–0.855) for OS and 0.770 (95% CI: 0.694–0.846) for DFS. Compared with the TNM stage models, the pathomics nomograms, which were based on the combination of PS_GC and the TNM staging system, displayed a significantly improved C-index of 0.809 (95% CI: 0.741–0.878; P = 0.002) for OS and 0.792 (95% CI: 0.718–0.866; P = 0.022), respectively (Supplementary Table 6). Similarly, the AUROCs of the PS_GC for OS and DFS were 0.798 (95% CI: 0.744–0.852) and 0.794 (95% CI: 0.739–0.848), respectively, and the TNM stage models yielded an AUROC of 0.868 (95% CI: 0.825–0.910) for OS and 0.859 (95% CI: 0.814–0.904) for DFS. Compared with the TNM stage models, the pathomics nomograms exhibited a significantly higher AUROC of 0.901 (95% CI: 0.863–0.939; P = 0.004) for OS and 0.891 (95% CI: 0.850–0.932; P = 0.005) for DFS (Supplementary Fig. 15a, b). The decision curve analysis indicated that compared with the TNM stage models, the pathomics nomograms showed greater net benefits across most of the range of reasonable threshold probabilities (Supplementary Fig. 17a, b). Moreover, the pathomics nomograms showed a net reclassification improvement (NRI) of 0.177 (95% CI: 0.021–0.319; P = 0.026) for OS and 0.218 (95% CI: 0.048–0.344; P = 0.012) for DFS compared to the TNM stage models (Supplementary Table 7). The abovementioned results were well validated in the validation cohort. In the validation cohort, the PS_GC demonstrated a C-index of 0.725 (95% CI: 0.627–0.823) for OS and 0.738 (95% CI: 0.642–0.834) for DFS, respectively, and the TNM stage models presented a C-index of 0.742 (95% CI: 0.656–0.828) for OS and 0.748 (95% CI: 0.660–0.836) for DFS. Compared with the TNM stage models, a significantly increased C-index of 0.784 (95% CI: 0.706–0.862; P < 0.001) for OS and 0.794 (95% CI: 0.709–0.873; P < 0.001) for DFS was observed in the pathomics nomograms (Supplementary Table 6). Meanwhile, the AUROCs of the PS_GC for OS and DFS were 0.774 (95% CI: 0.710–0.837) and 0.775 (95% CI: 0.711–0.839), respectively, and the TNM stage models exhibited an AUROC of 0.848 (95% CI: 0.797–0.900) for OS and 0.846 (95% CI: 0.794–0.898) for DFS. Compared with the TNM stage models, a significantly enhanced AUROC of 0.887 (95% CI: 0.842–0.931; P = 0.003) for OS and 0.888 (95% CI: 0.844–0.933; P = 0.003) for DFS was also confirmed in the pathomics nomograms (Supplementary Fig. 15c, d). In addition, higher net benefits across most of the range of reasonable threshold probabilities in the pathomics nomograms compared to the TNM stage models were detected (Supplementary Fig. 17c, d). Finally, an NRI of 0.318 (95% CI: 0.147–0.497; P = 0.010) for OS and 0.380 (95% CI: 0.141–0.556; P = 0.028) for DFS in the pathomics nomograms compared to the TNM stage models was found in the validation cohort (Supplementary Table 7). Herein, the PS_GC could provide additional prognostic value to the TNM staging system for GC.

Predictive value of the PS_GC for adjuvant chemotherapy response

To assess the predictive value of the PS_GC for adjuvant chemotherapy response, we evaluated the association between the PS_GC and survival among GC patients with stage II and stage III disease who either received or did not receive postoperative adjuvant chemotherapy. Patient information after stratification according to adjuvant chemotherapy status is listed in Supplementary Table 8. For the low-PS_GC patients, adjuvant chemotherapy was significantly associated with improved OS and DFS in the training, validation and total cohorts; however, the improved prognosis was not observed in high-PS_GC patients (Fig. 4). Similar results were obtained from the subgroup analyses of patients with stage II and III tumours (Supplementary Figs. 18, 19). No difference in the performance status of patients with a high PS_GC to tolerate the full course of chemotherapy was observed (Supplementary Table 9). Subsequently, a test of the interaction between the PS_GC and adjuvant chemotherapy indicated that patients with a low PS_GC had superior adjuvant chemotherapy benefits compared to patients with a high PS_GC, with the P for interaction <0.05 for OS and DFS (Table 3, Supplementary Table 10). Taken together, these results indicated that the PS_GC could identify stage II and III GC patients who might obtain survival benefits from adjuvant chemotherapy.

**Fig. 4: Association between the PS_GC and survival benefits from adjuvant chemotherapy in stage II and stage III GC.**

Table 3 Association of the PS_GC with overall survival and disease-free survival in stage II and III patients receiving adjuvant chemotherapy

Full size table

Discussion

Accurate prediction of prognosis and adjuvant chemotherapy benefits is integral to the risk stratification and management of GC patients in the clinic. In this study, we constructed the PS_GC to predict the prognosis of patients with GC and found that the PS_GC successfully stratified patients into high- and low-PS_GC groups with significant differences in terms of OS and DFS. Furthermore, by combining the PS_GC and TNM staging systems, we developed and validated two pathomics nomograms with significantly improved prognostic predictions compared with the TNM staging system alone. These results indicate that the PS_GC might provide complementary information about the prognosis of GC.

Adjuvant chemotherapy is a standard treatment for nonmetastatic advanced GC^3,19. However, the variations in survival outcomes even in patients with the same TNM stage who receive the same regimens indicate that a considerable number of patients do not benefit from adjuvant chemotherapy. Individualized biomarkers that can distinguish patients who are likely to benefit from adjuvant chemotherapy could improve tailored therapy²⁰. Our results revealed that patients with a low PS_GC were predicted to benefit from adjuvant chemotherapy, but in patients with a high PS_GC, limited benefits were observed. Patient performance status is an important factor that might affect tolerance to adjuvant chemotherapy. In this study, we did not observe a difference in the performance status of patients with a high PS_GC to tolerate the full course of chemotherapy, indicating that the predictive value of the PS_GC for adjuvant chemotherapy benefits might be also applicable to patients with a poor physical condition. Although patients receiving neoadjuvant therapy were excluded, most patients included in this study were still diagnosed with locally advanced GC, which did not imply that patients with a potentially lower risk were included. To our knowledge, this is the first study to demonstrate the utility of fully quantitative imaging features extracted from H&E-stained slides to predict prognosis and benefits from adjuvant chemotherapy in GC.

Two factors were critical for the construction of the PS_GC. The first factor is the use of a convenient image-processing approach to extract quantitative pathomics features. To date, a consensus about the extraction of pathomics features has not been reached^12,13. CellProfiler is a free, open-source software that automatically measures phenotypes from biological images and has been used in digital pathology analysis with satisfactory performance^21,22. Therefore, CellProfiler is an easy-to-use and reproducible platform that allows clinicians to extract quantitative pathomics features. The second factor is a practical machine learning method for the selection of prognostic features²³. For this purpose, the LASSO-Cox regression model was employed because of its ability to deal with high-dimensional data^16,17.

Despite its limited performance, the TNM staging system remains the cornerstone for predicting the prognosis of patients with GC. To date, some investigators have explored potential biomarkers that might provide additional prognostic information in GC. Based on the gene expression data, several molecular classifications have been proposed. For example, according to The Cancer Genome Atlas project, GC was divided into four subtypes based on molecular classification, including tumours positive for Epstein-Barr virus, microsatellite unstable tumours, genomically stable tumours and tumours with chromosomal instability, which might aid in patient stratification and tailoring therapy^5,7. However, the cost and complexity of gene expression data analyses prevent their clinical application, especially in developing countries. In addition, a radiomics analysis of radiological images has also shown a favourable ability for predicting the prognosis of GC^24,25. Other studies have also revealed that the stromal immune cells, such as tumour-associated macrophages, cytotoxic T cells and neutrophils, are indicators of the GC prognosis^26,27. Prospective studies and further evaluations are needed to better clarify their impacts on the prognosis of GC. In addition to the pathological diagnosis, the evaluation of H&E-stained sections has provided limited information on patient prognosis and chemotherapy response. Currently, the literature regarding the prognostic information of pathomics analysis in GC has not yet been reported. In this study, we discovered that the PS_GC could also contribute to the prediction of prognosis and identify patients who are more likely to benefit from adjuvant chemotherapy in GC. Because the PS_GC was derived from the routinely used H&E-stained sections in the clinic, the PS_GC might be conveniently applied in clinical practice without additional financial burden and might favour the development of tailored therapy for GC. We expect that these biomarkers, including molecular subtypes, radiomics, stromal immune cells, and pathomics, will be utilized together to improve the prediction of prognosis and chemotherapy response of GC in the future.

In Western countries, patients with locally advanced GC are recommended to receive neoadjuvant chemotherapy because of the prolonged survival^28,29; however, radical gastrectomy followed by adjuvant chemotherapy remained the standard of care for these patients in Eastern Asia^2,30, and patients with locally advanced GC are treated with this therapeutic strategy in our medical centre³¹. Patients receiving neoadjuvant chemotherapy were excluded from our study, as neoadjuvant chemotherapy would result in morphological changes in the H&E-stained sections, including tumour regression and fibrosis; thus, the prediction models developed in this study might be inappropriate to be extended to patients with GC who receive neoadjuvant chemotherapy. However, our results revealed that the pathomics analysis reflected tumour heterogeneity, which was a potential indicator of the prognosis and chemotherapy response of patients with GC. Thus, the pathomics analysis might also be suitable for evaluating the response and outcomes of neoadjuvant chemotherapy. Further investigations are required in this specific setting.

Pathomics is a novel method that has been utilized to explore tumour heterogeneity since different degrees of disease progression, clinical outcomes, and treatment responses correspond to a range of histologic features in different tumour cells¹¹. Traditional pathological examination is performed by experienced pathologists at multiple magnifications to evaluate the characteristics of tumour cells; however, pathologists do not and cannot routinely characterize more detailed information for every slide. Thus, pathomics can serve as a useful method to complement traditional pathological evaluation³². Our results revealed significant differences in the Lauren type and tumour grade subgroups, indicating that the local image features would change according to the Lauren type and tumour grade, and the Lauren type and tumour grade might drive the PS_GC. Moreover, a similar distribution of PS_GC was found between the tumour size subgroups, which implied no bias in the selection of regions of interest due to tumour size.

The PS_GC was found to be a potential predictor of prognosis and adjuvant chemotherapy benefits for GC patients. However, it remains unclear whether the predictive value of the PS_GC is determined by tumour intrinsic factors or tumour microenvironment effects. Currently, the integrative analysis of pathomics features and genomics data provides a feasible way to explore the underlying mechanisms of PS_GC with prognosis and adjuvant chemotherapy benefits^33,34. Thus, further investigations should focus on the relationship between pathomics features and genomics data.

The tumour grade is a common term used to diagnose GC in the clinic, which evaluates the progression of tumour cells. Our data showed that the PS_GC substantially outperformed the tumour grade in predicting the OS and DFS (Supplementary Table 11 and Supplementary Fig. 20). The prognostic performance of the TNM staging system was not increased when the tumour grade was added; conversely, significantly improved prognostic performance was detected when the PS_GC was added to the TNM staging system (Supplementary Table 12 and Supplementary Fig. 21). Thus, the PS_GC performed better when it was included in the pathomics nomogram than when it was replaced by the tumour grade.

The improvement in AUROC ranged from 3.2% to 4.2% when the PS_GC was added to the TNM staging system, and the corresponding improvement in the C-index ranged from 2.2% to 4.6%, which might seem small. In this study, the pathomics nomograms were developed based on the depth of invasion, lymph node metastasis, distant metastasis and PS_GC. In terms of the individual variables, the prognostic value of the PS_GC was comparable to that of the lymph node metastasis (Supplementary Tables 13 and 14). However, in the pathomics nomograms, lymph node metastasis had the most important contribution to predict the prognosis, followed by the PS_GC. Based on these results, lymph node metastasis exerted more powerful effects on the AUROCs and C-indexes of the pathomics nomograms than the PS_GC, although the individual prognostic performance of the two variables was comparable, which explained the small incremental value of adding PS_GC to the TNM staging system. Several prognostic biomarkers with statistically different but numerically small incremental values have also been reported previously^35,36,37. Although the incremental value of adding PS_GC to the TNM staging system for predicting the prognosis was small, it did provide additional prognostic information. Meanwhile, the ability of the PS_GC to predict response to adjuvant chemotherapy was valuable, which might avoid the toxic effects of chemotherapy in those patients least likely to benefit. Thus, from the perspective of clinicians, the PS_GC was clinically relevant and worth further investigation.

The AUROCs of the pathomics nomograms were acquired from the multivariate analysis of potential prognostic factors. Currently, a unanimous consensus has not been reached about the calculation of sample size in the multivariate analysis for developing a prediction model. According to the TRIPOD Statement, at least 10 events per variable are needed³⁸. Thus, the minimum sample size of patients with recurrence is 100 according to the 10 events per variable criteria to assess the difference in AUROCs between the pathomics nomograms and TNM stage models. In the training cohort, 122 patients suffered from recurrence. Thus, our sample size in the training cohort was adequate to conduct the multivariate analysis. For the sample size in the validation cohort, Lei et al.³⁹ suggested that the ratio between the training and validation cohorts was 7:3. In our study, the validation cohort contained 216 patients, which was also sufficient.

Considering the survival differences between the training and validation cohorts (OS: 58.7% vs. 47.7%; DFS: 55.3% vs. 45.4%), we speculated that it might be due to the socioeconomic differences despite the similar clinicopathological characteristics between the two cohorts. The training cohort and validation cohort came from Guangdong and Fujian in China, respectively. Guangdong is the most economically advanced province in China, most patients in the training cohort live in urban areas, and the local medical insurance covers more examinations and therapies; conversely, the economic level of Fujian is moderate in China, and considerable numbers of patients in the validation cohorts reside in rural areas, and the local medical insurance covers the costs of fewer examinations and therapies⁴⁰. Therefore, despite the rigorous follow-up after surgery, early detection and treatment of recurrent diseases are limited for patients in the validation cohort, thus resulting in differences in survival.

According to the Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) 2017 Stomach Cancer Collaborators, the estimated 5-year OS of GC is approximately 20%, with the exceptions of 65% in Japan and 71·5% in South Korea, where population screening has led to the effective diagnosis of tumours at early stages¹. In this study, all included GC patients who had received radical gastrectomy; however, the data sources of the GBD 2017 were derived from patients diagnosed with GC, regardless of resectable or unresectable diseases. Thus, the survival outcomes of the included patients are high by global standards. In addition, the 5-year OS of GC patients receiving radical gastrectomy generally ranges from 45% to 60% in China, indicating that the survival outcomes of patients with GC included in this study are similar to those in other parts of China³⁰.

Our results revealed that elevated CEA and CA 19-9 levels were significantly associated with a worse prognosis. In clinical practice, CEA and CA 19-9 levels are the most common tumour markers measured before surgery and during follow-up for GC. CEA and CA 19-9 levels have been used as diagnostic markers and are apt to rise 2–3 months before metastatic lesions become detectable by imaging modalities². Several studies have also reported that elevated preoperative CEA and CA 19-9 levels are associated with a worse prognosis for patients with resectable GC^41,42. Currently, the intrinsic mechanisms of elevated CEA and CA 19-9 levels for worse survival are still unclear. One possible explanation might be that CEA and CA 19-9, which are a ligand of E-selectin and an intercellular adhesion molecule, respectively, play critical roles in the intercellular adhesion of tumour cells to vascular endothelial cells and contribute to tumour invasion and metastasis^43,44. Thus, the prognostic values of CEA and CA 19-9 levels for GC need to be further investigated.

In general, there are two main types of artificial intelligence-based computational approaches for pathomics analysis: deep neural network-based approaches and handcrafted feature-based approaches¹². The method used in this study is a handcrafted feature-based approach that was developed based on the close collaboration between pathologists and oncological surgeons, and thus could be complex and time-consuming⁴⁵. Deep neural network-based approaches are developed through unsupervised feature learning, which depends on the existence of learning sets and annotated exemplars from the categories of interest, and the network design usually focuses on fine-tuning the algorithm to maximize accuracy while minimizing processing time⁴⁶. In addition, deep neural network-based approaches trained on a particular disease subtype could be applied to other subtypes as well¹². However, in terms of interpretability, because of being more interpretable than deep neural network-based approaches, handcrafted feature-based approaches might be more likely to be used for high-level decision-making, such as that regarding oncological prognosis or prediction of benefit from therapy; in contrast, deep neural network-based approaches might be more appropriate in situations where the need to “explain the decision” is reduced; such situations could include low-level tasks such as object detection or segmentation^12,47,48. Considering the application scene of this study and that oncologists and pathologists are the primary end users, we select the handcrafted feature-based approaches.

There are some limitations in our study. First, given the retrospective design, our study was not free from inherent biases. Second, all enroled participants came from two medical centres in China. Thus, further validation in prospective randomized trials incorporating diverse populations is warranted to test the clinical utility of the PS_GC for individualized decision-making.

In conclusion, our study constructed the PS_GC and found that the PS_GC was significantly associated with the prognosis of patients with GC. By integrating the PS_GC with the TNM staging system, we developed and validated two pathomics nomograms, which improved the prediction of the GC prognosis compared to the TNM staging system alone. Moreover, the PS_GC could distinguish patients with stage II and III diseases who were likely to derive benefits from adjuvant chemotherapy.

Methods

This study was approved by the Institutional Review Boards of Nanfang Hospital of Southern Medical University and the Fujian Cancer Hospital of Fujian Medical University. Written informed consent was obtained from all patients before surgery, which contained a statement on the formalin-fixed, paraffin-embedded samples and clinicopathological data for scientific research. All procedures involving human participants were in accordance with the Declaration of Helsinki.