Abstract
T-cell exhaustion (Tex) is considered to be a reason for immunotherapy resistance and poor prognosis in lung adenocarcinoma. Therefore, we used weighted correlation network analysis to identify Tex-related genes in the cancer genome atlas (TCGA). Unsupervised clustering approach based on Tex-related genes divided patients into cluster 1 and cluster 2. Then, we utilized random forest and the least absolute shrinkage and selection operator to identify nine key genes to construct a riskscore. Patients were classified as low or high-risk groups. The multivariate cox analysis showed the riskscore was an independent prognostic factor in TCGA and GSE72094 cohorts. Moreover, patients in cluster 2 with high riskscore had the worst prognosis. The immune response prediction analysis showed the low-risk group had higher immune, stromal, estimate scores, higher immunophenscore (IPS), and lower tumor immune dysfunction and exclusion score which suggested a better response to immune checkpoint inhibitors (ICIs) therapy in the low-risk group. In the meantime, we included two independent immunotherapy cohorts that also confirmed a better response to ICIs treatment in the low-risk group. Besides, we discovered differences in chemotherapy and targeted drug sensitivity between two groups. Finally, a nomogram was built to facilitate clinical decision making.
Similar content being viewed by others
Introduction
Lung cancer remains one of the deadliest cancers in the world, and 85% of them are non-small cell lung cancer (NSCLC)1. According to statistics, lung adenocarcinoma (LUAD) is a major subtype of NSCLC, accounting for approximately 40% of all patients2. Despite significant advances in the treatment of LUAD, the 5-year overall survival (OS) rate for LUAD is less than 20%3.
In recent years, with the rapid development of immunotherapy, especially the application of immune checkpoint inhibitors (ICIs), the prognosis of lung cancer patients has been improved significantly4,5. However, not all patients respond to ICIs, and a minority of patients do not benefit from ICIs6. In addition, it is worth noting that similar to chemotherapy or targeted therapy, a majority of patients who initially respond to ICI therapy and then develop primary or secondary resistance.
A growing number of studies suggest that tumor microenvironment (TME) plays an important role in cancer development and antitumor processes and may involve in immunotherapy resistance7. TME contains various immune cells, stromal cells and extracellular matrix molecules, of which tumor-infiltrating lymphocytes (TILs) are the main immune cells exerting anti-tumor activity8,9. TILs have been reported as predictive immune efficacy biomarkers for lung cancer10. However, TILs, especially CD8 + T cells, eventually developed a state of dysfunction known as T-cell exhaustion during long-term tumor fight11. The exhausted T cells have been demonstrated to exhibit overexpression of inhibitory receptors, metabolic dysregulation, epigenetic reprogramming and loss of effector functions12,13,14. And studies have proven that exhausted T cells can induce immune tolerance15. Immunotherapy depends on the activation of T cells to eliminate tumors. However, the emergence of T-cell exhaustion impedes it capacity of anti-tumor and facilitates immune escape. The formation of exhausted T cells is regulated by multiple factors, and a deep understanding of key genes that regulated T cell exhaustion might help to interpret the underlying mechanism.
Therefore, our study aimed to explored the crucial molecules of T cell exhaustion in LUAD, and attempted to construct a prognostic signature and nomogram model to predict prognosis and immunotherapy response for LUAD patients.
Methods
Publicly available datasets
We downloaded the normalized RNA sequencing data (transcripts per million, TPM) and raw counts data of 539 LUAD patients from the cancer genome atlas (TCGA) (https://portal.gdc.cancer.gov/) database. The normalized matrix files of GSE72094 cohort from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/), which had included 442 LUAD patients, used as an independent external cohort for risk model validation. The patients’ characteristics and other information of TCGA-LUAD and GSE72094 cohorts were shown in Supplementary file 1: Table S1. In addition, immunotherapy cohorts including GSE135222 and GSE91061 from GEO database were included for validation of immunotherapy response. Probe IDs were converted to gene symbols according to the platform annotation file. If multiple probes matched the same gene symbol, their median expression values were selected. The information of TCGA-LUAD and GEO datasets was summarized in Supplementary file 1: Table S2.
Weighted correction network analysis (WGCNA)
Weighted correction network analysis (WGCNA) was used to construct co-expression networks of scale-free distributions of genes based on correlations to gene expression, from which the most relevant modules to clinical features were identified16. In our study, we used the "pickSoftThreshold" method to choose an appropriate soft threshold parameter. Then, the adjacency was transformed into a topological overlap matrix (TOM) and genes with similar expression patterns were divided into the same module. Finally, genes from modules that had high correlation coefficients with T-cell exhaustion and dysfunction were identified for subsequent analysis.
Consensus clustering analysis
The “ConsensusClusterPlus” R package was used to perform the unsupervised consensus clustering analysis17. We used 80% of the items for subsampling and divided each subsample into groups by the k-means algorithm, repeating this clustering process 1000 times. Then, The cumulative distribution function (CDF) curve and consensus matrix are used to identify the optimal k-value. Based on the optimal k-value, patients were clustered into sub-clusters. In addition, the principal component analysis (PCA) and principal coordinate analysis (PCoA) were performed to show the distribution difference of sub-clusters.
Functional enrichment analysis
Enrichment analysis in Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) were based on the “Clusterprofiler” R package, where “adjusted p < 0.05” was considered significant18. The gene sets of “hallmark” from the Molecular Signatures Database (MSigDB) were analyzed through the gene set enrichment analysis software (v4.3.2). P.adjust < 0.05 and q values < 0.25 were considered statistically significant19,20.
Tumor immune microenvironment evaluation
The ESTIMATE algorithm tool was used to was used to calculate stromal, immune and estimated scores21. The CIBERSORTx (https://cibersort.stanford.edu/) was used to evaluate the composition of 22 kinds of tumor-infiltrating immune cells based on the principle of linear support vector regression22. Single sample gene set enrichment analysis (ssGSEA) based on the R package GSVA was used to quantify single sample immune cell infiltration scores. The immune cell abundance identifier (ImmuCellAI) (http://bioinfo.life.hust.edu.cn/ImmuCellAI) was utilized to obtain the scores of T cell exhaustion23. Moreover, we compared the mRNA expression levels of immune checkpoint suppressor molecules including programed cell death protein 1 (PD1), programmed death ligand 1 (PD-L1), cytotoxic T-lymphocyte associated protein 4 (CTLA4), lymphocyte activating 3 (LAG3), T cell immunoreceptor with Ig and ITIM domains (TIGIT), hepatitis A virus cellular receptor 2 (HAVCR2) and other immunosuppressive molecule in two clusters.
Development and verification of prognostic signature
Firstly, differential genes between cluster 1 and cluster 2 in TCGA cohort were analyzed with the “DESeq2” R package based on the mRNA expression of raw counts data24. The threshold of adjusted P value less than 0.05 (P < 0.05) and the absolute log2 fold-change greater than 1 (|log2FC|> 1) were define differential genes. Next, univariate cox regression analysis was utilized to screen differential genes that were associated with OS in TCGA and GSE72094 cohort using the “survival” R package (P < 0.05). Then, we performed machine learning algorithms including the least absolute shrinkage and selection operator (LASSO) regression analysis25 and random forest (RF) algorithm26 to improve the accuracy and reliability of prognostic signature using the “glmnet” and “randomForestSRC” R packages, respectively. Furthermore, we used the intersection of LASSO and the RF to determine candidate genes, followed by multivariate cox regression analysis. According to the results of the multivariate cox analysis, the prognostic signature was established as follows: Riskscore = \(\sum_{i=1}^{n}\left(Coef\times Ni\right)\), where Coef referred the coefficient of gene i and Ni represented the expression value of gene i.
To further validate the predictive power of the prognostic signature, we performed Kaplan–Meier survival analysis in the TCGA-LUAD and GSE72094 cohorts, where the cutoff value was set to the median riskscore. Time-dependent receiver operating characteristic curve (ROC) were plotted using the “timeROC” R package to predict OS at 1, 2 and 3 years in the training dataset (TCGA-LUAD) and validation cohort (GSE72094). Finally, we further analyzed the relationship between the prognostic signature and clinical characteristics (including age, gender and stage) to clarify the independent prognostic value of riskscore using multivariate Cox regression analyses.
Development and evaluation of the nomogram
We performed the univariate and multivariate Cox regression analyses for clinical parameters and riskscore. In the multivariate Cox model, variables with P < 0.05 were included in the construction of the nomogram. The calibration analysis and time-dependent ROC curves were used to assess the prognostic accuracy of the nomogram model. Decision curve analysis (DCA) curves used to evaluate the net benefit of the nomogram.
Immunotherapeutic response prediction
Tumor Immune Dysfunction and Exclusion (TIDE) was a computational method to predict ICIs response27. Based on transcriptomic data from TCGA cohort, T-cell dysfunction scores and TIDE scores were calculated through an online website (http://tide.dfci.harvard.edu). The Cancer Immunome Atlas (TCIA) database describes the immune landscape of 20 solid tumors and developed a novel score called “Immunophenscore” (IPS), which also predicted the response to ICIs. We calculated each patients’ IPS scores through an online website (https://tcia.at/)28. Tumor mutational load (TMB) was a valuable biomarker to assess the efficiency of immunotherapy29. We utilized the “TCGAbiolinks” R package to download somatic mutation data, and then calculated TMB values for each LUAD patient in the TCGA cohort based on the “Maftools” R package30. In addition, the mRNA expression of PD-1, PD-L1 and CTLA4 was analyzed in low or high-risk groups.
Chemotherapeutic drug sensitivity prediction
We utilized the “pRophetic” R package to estimate the half-maximal inhibitory concentrations (IC50) of drugs to predict the sensitivity of LUAD patients to chemotherapy and targeted therapies31.
Statistical analysis
All statistical analyses were performed using R software (version 4.2.2). Wilcoxon test was used to compare the differences between groups. The log-rank test was used to compare Kaplan–Meier survival curves. Univariate and multivariate Cox analyses were performed to establish independent prognostic factors. All P values were two-sided and less than 0.05 were considered statistically significant.
Results
Identification of Tex-related genes and functional analysis
WGCNA analysis was conducted according to the expression of LUAD mRNA in the TCGA cohort (Fig. 1A). We choose a soft threshold of 4, which met the scale-free network rule (Fig. 1B). Figure 1C–E showed that the purple module was the most significantly related to T-cell exhaustion (Tex) (cor = 0.5, P = 7.3e-25) and T cell dysfunction (cor = 0.75, P = 3.2e-68). Therefore, the purple module (n = 371) was characterized as Tex-related genes. GO and KEGG enrichment analysis revealed possible biological processes involved in Tex-related genes (Supplementary file 2: Fig. S1).
Identification of Tex subgroups
Tex subgroups were identified by using an unsupervised clustering approach based on the expression levels of Tex related genes in TCGA. Based on the consensus CDF curve (Fig. 2A), the relative change in area under the CDF curve (Fig. 2B), and the consensus matrix (Fig. 2C), we finally selected k = 2 as the best cluster. Two different subgroups were identified, including 375 patients in cluster 1 and 155 patients in cluster 2. It is clear that patients in cluster 1 had a better OS than those in cluster 2 (P = 0.039, Fig. 2D). Two significantly different subgroups were further confirmed by the results of PCA as well as PCoA analysis (Fig. 2E–F). In addition, we compared the differences in clinical characteristics between cluster 1 and cluster 2. The results in Table 1 showed that patients in cluster 2 had a higher proportion of female and older patients (> 65 years), and there was no difference in the tumor stage between two subgroups.
Immunological landscape of Tex subtypes
First, we calculated the composition of immune cells between clsuter 1 and cluster 2 by the CIBERSORT algorithm (Fig. 3A). Comparing with cluster 1, cluster 2 had significantly more abundance of CD8 T cell and CD4 memory activated T cell. However, despite having higher CD4/CD8 T cell levels, patients in cluster 2 had worse prognosis than those in cluster 1. This was contradictory to the fact that high levels of tumor infiltrating CD4/CD8 T cells were generally associated with better prognosis. Therefore, we hypothesized that these CD4/CD8 T cells in cluster 2 were prone to exhaustion. Next, we compared the mRNA expression levels of immune checkpoint suppressor molecules in the two clusters. The results showed that PDCD1 (PD1), CD274 (PD-L1), CTLA4, LAG3, TIGIT, HAVCR2 and other immunosuppressive molecules were upregulated in cluster 2 (Fig. 3B). Then, we assessed the TME score using an estimation algorithm. The results displayed that cluster 2 had higher immune, stromal, and estimate scores than those in cluster 1 (Fig. 3C). In addition, we also found that T-cells exhaustion as well as T-cell dysfunction scores were significantly higher in cluster 2 than in cluster 1 (Fig. 3D–E). Finally, we performed GSEA analysis based on the Tex-related genes identified by WGCNA, and the results showed that cluster 2 was enriched in IL6/JAK/STAT3 signaling, interferon_gamma response, TNFA signaling via NFKB and inflammatory response (Fig. 4). Based on the above results, we inferred that cluster 2 mediated the suppressive immune microenvironment associated with T-cell exhaustion.
Machine learning identifies hub genes and functional analysis
First of all, we performed differential analysis of mRNA expression in cluster1 and cluster 2 based on the TCGA cohort to identify differential genes (Fig. 5A). Then, univariate cox regression analysis was utilized to screen differential genes that were associated with OS in TCGA and GSE72094 cohort. Venn diagram showed a total of 35 genes were identified as Tex related prognostic differential genes (Fig. 5B). Next, top 15 genes were selected using RF (Fig. 5C) and 11 genes were selected using LASSO (Fig. 5D–E). Finally, 9 intersected genes (including CPS1, FOSL1, GJB3, HLA-DOB, IGF2BP1, IGFBP1, KLK11, KRT6A, KRT81) were identified as hub genes (Fig. 5F). GO and KEGG analysis showed that these nine hub genes were involved in response to cAMP, carbon–nitrogen ligase activity, MHC class II receptor activity, gap junction channel activity and so on (all adjusted P < 0.05, Supplementary file 2: Fig. S2).
Construction of prognostic signature
The hub genes were enrolled in multivariate cox analysis to construct the prognostic signature: riskscore = (CPS1 × 0.0376) + (FOSL1 × 0.0731) + (GJB3 × 0.0641)—(HLA-DOB × 0.1590) + (IGF2BP1 × 0.0732) + (IGFBP1 × 0.1072)—(KLK11 × 0.0445) + (KRT6A × 0.0460) + (KRT81 × 0.0439)—0.2897. Patients in the TCGA cohort were classified as low- or high-risk group when the cutoff value was defined as the median riskscore (Fig. 6A). The Kaplan–Meier survival curves showed showed that the median OS of the low-risk group was longer than that of the low-risk group (P < 0.001, Fig. 6B). The time-dependent ROC curve displayed that the risk score had a good predictive performance, with an area under the curve (AUC) of 0.718, 0.703, 0.692 at 1, 2, and 3 years, respectively (Fig. 6C). In addition, when the riskscore was included in multivariate cox analysis combined with age, gender, and stage, it still was demonstrated as an independent prognostic factor for OS (HR = 1.904; 95% CI 1.389–2.612; P < 0.001; Fig. 6D). To further confirmed the reliability of the riskscore, we used the same formula to calculate the riskscore for patients in the GSE72094 cohort and used it for external validation. Similar to the previous results (Fig. 6E–H), the riskscore was an independent prognostic factor (HR = 1.990; 95% CI: 1.321–2.997; P < 0.001) as well and had good predictive power with an AUC at 1-, 2-, 3-year of 0.692, 0.709, 0.715. Moreover, we further stratified Tex subgroups based on riskscore and found that cluster 2 with high riskscore had the worst prognosis (Supplementary file 2: Fig. S3).
Correlation between the riskscore and clinical features
We explored relationships between the riskscore and clinical features in TCGA cohort and found that gender and TNM stage were significant correlated with the riskscore. Male, T3-4 stage, more lymph node metastases, distant metastases, and advanced stage patients tended to have higher riskscores (Fig. 7).
Construction and evaluation of nomogram model
The above results suggested that both tumor stage and risk score were independent prognostic factors. Thus, a nomogram was constructed to predict survival combined pathological stage with the riskscore (Fig. 8A). Calibration curves showed substantial agreement between the expected and actual probabilities of nomgram in predicting 1, 2, and 3-year survival (Fig. 8B). DCA curves showed the nomogram model had a greater net benefit in predicting the 3‐year OS (Fig. 8C). The time-dependent ROC curves showed a higher accuracy of the nomogram in predicting survival, with AUC values of 0.739, 0.716, 0.714 at 1, 2, and 3 years OS (Fig. 8D). Meanwhile, the model was validated using the GSE72094 cohort, and the results were similar to the results in TCGA cohort (Fig. 8E–G).
Immunotherapeutic response prediction
To assess ability of the riskscore as a biomarker for predicting immunotherapeutic response, we estimated the distribution of TME scores, IPS score, TIDE scores, TMB, immune cell infiltration scores and the mRNA expression of HLA and immune checkpoint inhibitors in different risk groups. The results of the TME score analysis indicated that the low-risk group had higher immune score (P < 0.001; Fig. 9A), stromal score (P = 0.05; Fig. 9B), and estimate score (P < 0.01; Fig. 9C). Further, TCIA database analysis revealed the low-risk group had higher IPS (P < 0.001; Fig. 9D). We also found that TIDE scores and TMB were higher in the high-risk group (both P < 0.001; Fig. 9E–F). In addition, our study results showed that the mRNA expression of PD-1 and PD-L1 were not significantly different between the two groups, except for CTLA4, which was significantly upregulated in the low-risk group. (Fig. 9G–I). We also observed that the low-risk group had significantly higher immune cell infiltration than the high-risk group, including activated CD8T cells, cytotoxic cells, dendritic cells, T helper cells, and so on (Supplementary file 2: Fig. S4). Lastly, we investigated the relationship between high- and low-risk groups and human leukocyte antigens (HLA). Except for HLA-A/B/C/F/G, the expression of HLA appeared to be higher in low-risk group (Supplementary file 2: Fig. S5).
To further validate that the riskscore could effectively predict immune efficacy response, we also included two immunotherapy cohorts to evaluate the riskscore model. GSE91061, which including 109 melanoma samples with anti-CTLA4 and anti-PD1 therapy demonstrated patients with complete response (CR) or partial response (PR) had a lower riskscore than those with stable disease (SD) or progressive disease (PD) (Fig. 10A). Moreover, GSE135222, which including 27 advanced NSCLC samples with anti-PD-1/PD-L1 confirmed that the low-risk group had better PFS compared to the high-risk group (Fig. 10B).
Chemotherapeutic drug sensitivity prediction
The IC50 values of common chemotherapy and targeted drugs for LUAD were calculated to further explored the drug sensitivity between low- and high-risk groups. By comparing the difference in IC50 values in two risk groups (Fig. 11), we found that patients in the low-risk group had a higher sensitivity to paclitaxel, docetaxel, doxorubicin, gefitinib, lapatinib, and tipifarnib while patients in high-risk group had a higher sensitivity to axitinib and methotrexate. Besides, patients in the high and low risk groups showed no significant difference in the sensitivity of cisplatin, etoposide, erlotinib and gefitinib.
Discussion
T-cell exhaustion is considered to be one of the most crucial reasons for immune resistance32. However, it is still not known what regulates T-cell exhaustion in the tumor microenvironment. Therefore, an in-depth exploration of key regulatory genes of T-cell exhaustion could help to understand the potential mechanisms of immune resistance and accordingly provide a theoretical basis for clinical decision making.
In the present study, we used WGCNA to identify the module of genes most related to T-cell exhaustion and T-cell dysfunction. Next, unsupervised clustering approach based on the expression levels of these genes divided patients into two different subgroups. Obviously, patients in Cluster 1 had better OS compared with patients in cluster 2. It was intriguing that patients in cluster 2 had higher abundance of CD8 T cell and CD4 memory activated T cell compared to group 1. This is contradictory to the fact that high levels of tumor-infiltrating CD4/CD8 T cells usually associated with good outcome33,34. Therefore, we hypothesized that these CD4/CD8 T cells in cluster 2 maybe exhausted T cells. Research had revealed that exhausted T cells often exhibit persistent high expression of multiple suppressive receptors, such as PD-1, CTLA-4, and LAG-313. In our study, we found that PD-1, CTLA4, LAG3, TIGIT, HAVCR2 and other immunosuppressive molecules were upregulated in cluster 2. In addition, GSEA analysis cluster 2 was enriched in IL6/JAK/STAT3 signaling, interferon_gamma response, TNF_alpha signaling via NFKB and inflammatory response. Studies had demonstrated that JAK/STAT signaling pathway35,36 the IFN-γsignaling pathway37 and TNF-ɑ signaling via NFKB38 caused immune escape by upregulating PD-L1 expression. Therefore, we inferred that cluster 2 might mediate the suppressive immune microenvironment.
Next, we further analyzed the significantly differentially expressed genes between the two groups and used machine learning algorithms to finally identify nine key regulatory genes, including CPS1, FOSL1, GJB3, IGF2BP1, HLA-DOB, IGFBP1, KRT6A, KLK11, KRT81, that may be involved in regulating T cell exhaustion. The enzyme carbamoyl phosphate synthetase 1 (CPS1) is a key rate-limiting enzyme in the urea cycle, involving in ammonium conversion and mediated arginine metabolism and pyrimidine metabolism39. T cell is highly sensitive to extracellular levels of arginine40, and arginine is essential for T cell function41,42. It has been shown that low arginine mediates an immunosuppressive microenvironment, which may suppress T-cell responses by providing a brake on T-cell proliferation43. In addition, CPS1 deficiency could lead to hyperammonemia, which impair mitochondrial function, reduce ATP synthesis, and increase free radical formation, leading to oxidative stress44, thereby possibly inducing T-cell exhaustion. FOSL1 encodes Fra-1, which is initially found to be highly expressed in solid tumors and is the member of the FOS family in activator protein (AP-1). Studies indicate that the AP-1 family members, including Fra-1, have essential effects in T cell development45. In melanoma, Fra-1 suppress the conversion of Treg cells into effector T cells under the regulation of Ubc13-IKK signaling axis46. In addition, Fra-1 is strongly associated with Epithelial-to-Mesenchymal Transition (EMT) in cancer cells46,47. It has been recently shown that EMT-related pathways could harm CD8 + T cell function, leading to immune evasion48. Moreover, a study from Lee et al. found a significant correlation between Fra-1 and PD-L1 expression, and high Fra-1 expression was associated with poorer overall survival49. Connexin 31 encoded by gap junction protein Beta 3 (GJB3), is one of the major members of the connexin family. Studies have indicated that connexin proteins mainly serve as channels to transport metabolites such as nucleotides, glutamate and glucose50. There is increasing evidence to support that metabolic changes in tumor cells affect the function of immune cells52. A study from Huo et al. reported that GJB3 promoted neutrophil survival and polarization by forming a channel between pancreatic tumor cells and neutrophils, transferring cyclic adenosine monophosphate (cAMP) from cancer to neutrophils53. Meanwhile, some studies suggested that cAMP could induce T-cell senescence54,55. Therefore, we hypothesized that GJB1 might be able to transfer cAMP to T cells in the same way which might cause T-cell exhaustion. The RNA modification N6-methyladenosine (m6A) has a significant value in the immune system56, and its dysregulation is associated with poor prognosis57. For example, a study showed that the m6A reader insulin-like growth factor-2 mRNA-binding protein 1 (IGF2BP1) could recognized the 30-UTR of PD-L1 mRNA and thus mediated stable PD-L1 expression58. The major histocompatibility complex, class II, DO beta (HLA-DOB) belonged to the HLA class II beta chain paralogue. It is demonstrated that HLA-II has antigen-presenting functions, participates in T-cell differentiation, and mediates the activation of T cells to provoke immune responses59. The insulin-like growth factor binding protein 1 (IGFBP1) has a vital action in regulating cell growth60. It had shown that ICFBP1 was expressed in T cells61. A research from Han et al. suggested that IGFBP1 had effects on CD4 + T cell immunomodulation62. Moreover, high expression of IGFBP1 is demonstrated to be closely associated with unfavorable OS in NSCLC63. Kallikrein related peptidase 11 (KLK11) is a member of the human KLK gene family, which is known to perform in a number of physiological processes, including extracellular matrix (ECM) remodeling, cell proliferation and differentiation64. Many studies have shown that KLK11 is aberrantly expressed in tumors and significantly correlated with survival65,66,67. The keratin 6A (KRT6A) and keratin 81 (KRT81) are the member of the keratin gene family. KRT6A gene overexpression in LUAD promotes lung cancer cell proliferation by EMT68. KRT81 has been identified as a promising biomarker for the identification of squamous cell lung cancer69.
The nine key genes described above may involved in T-cell exhaustion. Although the specific mechanisms have not yet been reported, we constructed a prognostic signature based on these genes that was confirmed to have some clinical significance. In the TCGA cohort, the prognostic signature was associated with OS, and patients in the low-risk group had a good prognosis. Then we performed external validation using the GSE72094 cohort to further confirm the general applicability of the prognostic signature. Moreover, we found that cluster 2 with high risk scores had the worst prognosis.
Immunotherapy had improved prognosis in lung cancer, but the benefit was limited. Only a minority of patients did benefit from ICIs6. One of the more widely utilized immune efficacy markers in clinical practice was PD-L1 expression. Numerous studies demonstrated that the expression level of PD-L1 is closely related to the efficacy of immunotherapy70,71. However, PD-L1 was not completely accurate in predicting immune efficacy. Hence, it was necessary to discover a signature that could predict the efficacy of immunotherapy. In our study, the prognostic signature we constructed may be useful to predict the immune response. Patients in the low-risk group had higher immune, stromal, and estimate scores, higher IPS, lower TIDE score which suggested a better response to immune checkpoint inhibitors (ICIs) therapy in the low-risk group. In the meantime, we included two separate immunotherapy cohorts, GSE135222 and GSE91061, which demonstrated better immune efficacy in the low-risk group. Besides, chemotherapy and targeted therapy were both the major strategies for the treatment of advanced LUAD. We found differences in the sensitivity of chemotherapeutic and targeted agents between high and low risk groups, suggesting that this prognostic signature might provide assistance in the selection of clinically sensitive agents for LUAD. Finally, the nomogram model combined with tumor stage and the riskscore could effectively predict the prognosis of LUAD patients.
Although our study had been successfully validated in an external cohort. However, there are several limitations in our study. Firstly, the study was based on publicly available databases and its training cohort (TCGA) and validation cohort (GEO) were retrospective. Therefore prospective studies are necessary to validate our conclusions. Secondly, there was a lack of molecular mechanism studies to investigate the functional role of candidate key genes, and in the future we will conduct further in vitro and in vivo experiments to confirm the potential regulatory mechanisms. Thirdly, in lack of a large cohort of lung cancer immunotherapy patients, our findings still need to be further validated by future prospective studies with larger samples.
Conclusions
In summary, we identified nine key genes (including CPS1, FOSL1, GJB3, HLA-DOB, IGF2BP1, IGFBP1, KLK11, KRT6A, KRT81) that may involved in the regulation of T-cell exhaustion and constructed a riskscore that could help predict immunotherapy response and the selection of chemotherapeutic and targeted agents. In addition, the nomogram built in combination with tumor stage and the riskscore may be a powerful tool for LUAD survival prediction.
Data availability
The datasets generated and analyzed during the current study are available in The Cancer Genome Atlas (TCGA) database (https://portal.gdc.cancer.gov/), and Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/). Access to both databases is not required accession number. The datasets analyzed during the current study are available from the corresponding author on reasonable request.
References
Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics, 2020. CA Cancer J. Clin. 70(1), 7–30 (2020).
Dela Cruz, C. S., Tanoue, L. T. & Matthay, R. A. Lung cancer: Epidemiology, etiology, and prevention. Clin. Chest Med. 32(4), 605–644 (2011).
Imielinski, M. et al. Mapping the hallmarks of lung adenocarcinoma with massively parallel sequencing. Cell 150(6), 1107–1120 (2012).
Rittmeyer, A. et al. Atezolizumab versus docetaxel in patients with previously treated non-small-cell lung cancer (OAK): A phase 3, open-label, multicentre randomised controlled trial. Lancet 389(10066), 255–265 (2017).
Gettinger, S. N. et al. Overall survival and long-term safety of nivolumab (anti-programmed death 1 antibody, BMS-936558, ONO-4538) in patients with previously treated advanced non-small-cell lung cancer. J. Clin. Oncol. 33(18), 2004–2012 (2015).
Sui, H. et al. Anti-PD-1/PD-L1 therapy for non-small-cell lung cancer: Toward personalized medicine and combination strategies. J. Immunol. Res. 2018, 6984948 (2018).
Jin, M. Z. & Jin, W. L. The updated landscape of tumor microenvironment and drug repurposing. Signal Transduct. Target Ther. 5(1), 166 (2020).
Menares, E. et al. Tissue-resident memory CD8(+) T cells amplify anti-tumor immunity by triggering antigen spreading through dendritic cells. Nat. Commun. 10(1), 4401 (2019).
Hanahan, D. & Coussens, L. M. Accessories to the crime: Functions of cells recruited to the tumor microenvironment. Cancer Cell 21(3), 309–322 (2012).
Bremnes, R. M. et al. The role of tumor-infiltrating lymphocytes in development, progression, and prognosis of non-small cell lung cancer. J. Thorac. Oncol. 11(6), 789–800 (2016).
Thommen, D. S. & Schumacher, T. N. T cell dysfunction in cancer. Cancer Cell 33(4), 547–562 (2018).
Franco, F., Jaccard, A., Romero, P., Yu, Y. R. & Ho, P. C. Metabolic and epigenetic regulation of T-cell exhaustion. Nat. Metab. 2(10), 1001–1012 (2020).
McLane, L. M., Abdel-Hakeem, M. S. & Wherry, E. J. CD8 T cell exhaustion during chronic viral infection and cancer. Annu. Rev. Immunol. 37, 457–495 (2019).
Wherry, E. J. & Kurachi, M. Molecular and cellular insights into T cell exhaustion. Nat. Rev. Immunol. 15(8), 486–499 (2015).
Boyero, L. et al. Primary and acquired resistance to immunotherapy in lung cancer: Unveiling the mechanisms underlying of immune checkpoint blockade therapy. Cancers (Basel) 12(12), 3729 (2020).
Langfelder, P. & Horvath, S. WGCNA: An R package for weighted correlation network analysis. BMC Bioinform. 9, 559 (2008).
Wilkerson, M. D. & Hayes, D. N. ConsensusClusterPlus: A class discovery tool with confidence assessments and item tracking. Bioinformatics 26(12), 1572–1573 (2010).
Yu, G. W. L., Han, Y. & He, Q. Y. clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS 15(5), 284–287 (2012).
Liberzon, A. B. C., Thorvaldsdóttir, H., Ghandi, M., Mesirov, J. P. & Tamayo, P. The molecular signatures database (MSigDB) hallmark gene set collection. Cell Syst. 1(6), 417–425 (2015).
Subramanian, A. T. P. et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. U. S. A. 102(43), 15545–15550 (2005).
Yoshihara, K. et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat. Commun. 4, 2612 (2013).
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12(5), 453–457 (2015).
Miao, Y. R. et al. ImmuCellAI: A unique method for comprehensive T-cell subsets abundance prediction and its application in cancer immunotherapy. Adv. Sci. (Weinh.) 7(7), 1902880 (2020).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15(12), 550 (2014).
Tibshirani, R. The lasso method for variable selection in the Cox model. Stat. Med. 16(4), 385–395 (1997).
Taylor, J. M. Random survival forests. J. Thorac. Oncol. 6(12), 1974–1975 (2011).
Jiang, P. et al. Signatures of T cell dysfunction and exclusion predict cancer immunotherapy response. Nat. Med. 24(10), 1550–1558 (2018).
Charoentong, P. et al. Pan-cancer immunogenomic analyses reveal genotype-immunophenotype relationships and predictors of response to checkpoint blockade. Cell Rep. 18(1), 248–262 (2017).
Chan, T. A. Y. M. et al. Development of tumor mutation burden as an immunotherapy biomarker: Utility for the oncology clinic. Ann. Oncol. 30(1), 44–56 (2019).
Mayakonda, A., Lin, D. C., Assenov, Y., Plass, C. & Koeffler, H. P. Maftools: Efficient and comprehensive analysis of somatic variants in cancer. Genome Res. 28(11), 1747–1756 (2018).
Geeleher, P., Cox, N. & Huang, R. S. pRRophetic: An R package for prediction of clinical chemotherapeutic response from tumor gene expression levels. PLoS ONE 9(9), e107468 (2014).
Wang, Z. & Wu, X. Study and analysis of antitumor resistance mechanism of PD1/PD-L1 immune checkpoint blocker. Cancer Med. 9(21), 8086–8121 (2020).
Schalper, K. A. et al. Objective measurement and clinical significance of TILs in non-small cell lung cancer. J. Natl. Cancer Inst. 107(3), dju435 (2015).
Al-Shibli, K. I. et al. Prognostic effect of epithelial and stromal lymphocyte infiltration in non-small cell lung cancer. Clin. Cancer Res. 14(16), 5220–5227 (2008).
Xiang, Z. et al. Dexamethasone suppresses immune evasion by inducing GR/STAT3 mediated downregulation of PD-L1 and IDO1 pathways. Oncogene 40(31), 5002–5012 (2021).
Arumuggam, N., Bhowmick, N. A. & Rupasinghe, H. P. A review: Phytochemicals targeting JAK/STAT signaling and IDO expression in cancer. Phytother. Res. 29(6), 805–817 (2015).
Qian, J. et al. The IFN-gamma/PD-L1 axis between T cells and tumor microenvironment: Hints for glioma anti-PD-1/PD-L1 therapy. J. Neuroinflamm. 15(1), 290 (2018).
Ju, X., Zhang, H., Zhou, Z., Chen, M. & Wang, Q. Tumor-associated macrophages induce PD-L1 expression in gastric cancer cells through IL-6 and TNF-a signaling. Exp. Cell Res. 396(2), 112315 (2020).
Morris, S. M. Jr. Regulation of enzymes of the urea cycle and arginine metabolism. Annu. Rev. Nutr. 22, 87–105 (2002).
Tarasenko, T. N., Gomez-Rodriguez, J. & McGuire, P. J. Impaired T cell function in argininosuccinate synthetase deficiency. J. Leukoc. Biol. 97(2), 273–278 (2015).
Ochoa, J. B. et al. Effects of L-arginine on the proliferation of T lymphocyte subpopulations. JPEN J. Parenter Enteral. Nutr. 25(1), 23–29 (2001).
Zea, A. H. et al. L-Arginine modulates CD3zeta expression and T cell function in activated human T lymphocytes. Cell Immunol. 232(1–2), 21–31 (2004).
Mussai, F. et al. Neuroblastoma arginase activity creates an immunosuppressive microenvironment that impairs autologous and engineered immunity. Cancer Res. 75(15), 3043–3053 (2015).
Kosenko, E., Venediktova, N., Kaminsky, Y., Montoliu, C. & Felipo, V. Sources of oxygen radicals in brain in acute ammonia intoxication in vivo. Brain Res. 981(1–2), 193–200 (2003).
Jochum, W. P. E. & Wagner, E. F. AP-1 in mouse development and tumorigenesis. Oncogene 20(19), 2401–2412 (2001).
Chang, J. H. X. Y. et al. Ubc13 maintains the suppressive function of regulatory T cells and prevents their conversion into effector-like T cells. Nat. Immunol. 13(5), 481–490 (2012).
Dhillon, A. S. & Tulchinsky, E. FRA-1 as a driver of tumour heterogeneity: A nexus between oncogenes and embryonic signalling pathways in cancer. Oncogene 34(34), 4421–4428 (2015).
Talotta, F., Casalino, L. & Verde, P. The nuclear oncoprotein Fra-1: A transcription factor knocking on therapeutic applications’ door. Oncogene 39(23), 4491–4506 (2020).
Chai, A. W. Y., Lim, K. P. & Cheong, S. C. Translational genomics and recent advances in oral squamous cell carcinoma. Semin. Cancer Biol. 61, 71–83 (2020).
Lee, M. H. Y. J. et al. FRA1 contributes to MEK-ERK pathway-dependent PD-L1 upregulation by KRAS mutation in premalignant human bronchial epithelial cells. Am. J. Transl. Res. 12(2), 409–427 (2020).
Sanchez, A., Castro, C., Flores, D. L., Gutierrez, E. & Baldi, P. Gap junction channels of innexins and connexins: Relations and computational perspectives. Int. J. Mol. Sci. 20(10), 2476 (2019).
Chang, C. H. et al. Metabolic competition in the tumor microenvironment is a driver of cancer progression. Cell 162(6), 1229–1241 (2015).
Huo, Y. et al. GJB3 promotes pancreatic cancer liver metastasis by enhancing the polarization and survival of neutrophil. Front Immunol. 13, 983116 (2022).
Ye, J. & Peng, G. Controlling T cell senescence in the tumor microenvironment for tumor immunotherapy. Oncoimmunology 4(3), e994398 (2015).
Ye, J. et al. TLR8 signaling enhances tumor immunity by preventing tumor-induced T-cell senescence. EMBO Mol. Med. 6(10), 1294–1311 (2014).
Shulman, Z. & Stern-Ginossar, N. The RNA modification N(6)-methyladenosine as a novel regulator of the immune system. Nat. Immunol. 21(5), 501–512 (2020).
Huang, H., Weng, H. & Chen, J. m(6)A modification in coding and non-coding RNAs: Roles and therapeutic implications in cancer. Cancer Cell 37(3), 270–288 (2020).
Ni, Z. et al. JNK signaling promotes bladder cancer immune escape by regulating METTL3-mediated m6A modification of PD-L1 mRNA. Cancer Res. 82(9), 1789–1802 (2022).
Adams, E. J. & Luoma, A. M. The adaptable major histocompatibility complex (MHC) fold: Structure and function of nonclassical and MHC class I-like molecules. Annu. Rev. Immunol. 31, 529–561 (2013).
Baxter, R. C. IGF binding proteins in cancer: Mechanistic and clinical insights. Nat. Rev. Cancer 14(5), 329–341 (2014).
Uhlen, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347(6220), 1260419 (2015).
Han, K. et al. Identification and validation of nutrient state-dependent serum protein mediators of human CD4(+) T cell responsiveness. Nutrients 13(5), 1492 (2021).
Wang, J., Hu, Z. G., Li, D., Xu, J. X. & Zeng, Z. G. Gene expression and prognosis of insulin-like growth factor-binding protein family members in non-small cell lung cancer. Oncol. Rep. 42(5), 1981–1995 (2019).
Borgono, C. A. & Diamandis, E. P. The emerging roles of human tissue kallikreins in cancer. Nat. Rev. Cancer 4(11), 876–890 (2004).
Sasaki, H. et al. Decreased kallikrein 11 messenger RNA expression in lung cancer. Clin. Lung Cancer 8(1), 45–48 (2006).
Patsis, C., Yiotakis, I. & Scorilas, A. Diagnostic and prognostic significance of human kallikrein 11 (KLK11) mRNA expression levels in patients with laryngeal cancer. Clin. Biochem. 45(9), 623–630 (2012).
Kolin, D. L. et al. Prognostic significance of human tissue kallikrein-related peptidases 11 and 15 in gastric cancer. Tumour Biol. 37(1), 437–446 (2016).
Yang, B. Z. W., Zhang, M., Wang, X., Peng, S. & Zhang, R. KRT6A promotes EMT and cancer stem cell transformation in lung adenocarcinoma. Technol. Cancer Res. Treat. 19, 1533033820921248 (2020).
Campayo, M. et al. A dual role for KRT81: A miR-SNP associated with recurrence in non-small-cell lung cancer and a novel marker of squamous cell lung carcinoma. PLoS ONE 6(7), e22509 (2011).
Garon, E. B. et al. Five-year overall survival for patients with advanced non-small-cell lung cancer treated with pembrolizumab: Results from the phase I KEYNOTE-001 study. J. Clin. Oncol. 37(28), 2518–2527 (2019).
Borghaei, H. et al. Nivolumab versus docetaxel in advanced nonsquamous non-small-cell lung cancer. N. Engl. J. Med. 373(17), 1627–1639 (2015).
Funding
The study was supported by the Joint Funds for the Innovation of Science and Technology of Fujian Province (2018Y9063).
Author information
Authors and Affiliations
Contributions
Y.H.W., B.D. and J.H.L. designed the study. M.Q.L., X.H.J. and C.L.L. participated in data collection and analysis. Y.H.W. and B.D. performed data analysis and wrote the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wu, Y., Du, B., Lin, M. et al. The identification of genes associated T-cell exhaustion and construction of prognostic signature to predict immunotherapy response in lung adenocarcinoma. Sci Rep 13, 13415 (2023). https://doi.org/10.1038/s41598-023-40662-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-40662-z
This article is cited by
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.