Characterization of butyrate-metabolism in colorectal cancer to guide clinical treatment

Luo, Qinghua; Zhou, Ping; Chang, Shuangqing; Huang, Zhifang; Zeng, Xuebo

doi:10.1038/s41598-023-32457-z

Download PDF

Article
Open access
Published: 29 March 2023

Characterization of butyrate-metabolism in colorectal cancer to guide clinical treatment

Qinghua Luo¹,
Ping Zhou²,
Shuangqing Chang¹,
Zhifang Huang¹ &
…
Xuebo Zeng³

Scientific Reports volume 13, Article number: 5106 (2023) Cite this article

1653 Accesses
2 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Colorectal cancer (CRC) is the third most prevalent one in the world among the most common malignant tumors. Numerous studies have shown that butyrate has demonstrated promise as an antitumor agent in a variety of human cancer types. However, butyrate remains understudied in CRC tumorigenesis and progression. In this study, we explored therapeutic strategies to treat CRC by examining the role of butyrate metabolism. First, from the Molecular Signature Database (MSigDB), we identified 348 butyrate metabolism-related genes (BMRGs). Next, we downloaded 473 CRC and 41 standard colorectal tissue samples from The Cancer Genome Atlas (TCGA) database and the transcriptome data of GSE39582 dataset from Gene Expression Omnibus (GEO) database. Then we evaluated the expression patterns of butyrate metabolism-related genes with difference analysis in CRC. Through univariate Cox regression and least absolute shrinkage and selection operator (LASSO) analysis, a prognostic model was constructed based on differentially expressed BMRGs. In addition, we discovered an independent prognostic marker for CRC patients. According to the expression levels and coefficients of identified BMRGs, the risk scores of all CRC samples were calculated. Utilizing differentially expressed genes in the high- and low-risk groups, we also constructed a Protein–Protein Interaction (PPI) network to visualize the interactions between proteins. Through the results of PPI network, we screened out differentially expressed target butyrate metabolism-related genes from ten hub genes. Finally, we performed clinical correlation analysis, immune cell infiltration analysis, and mutation analysis for these target genes. One hundred and seventy three differentially expressed butyrate metabolism-related genes were screened out in all the CRC samples. The prognostic model was established with univariate Cox regression and LASSO regression analysis. CRC patients’ overall survival was significantly lower in the high-risk group than in the low-risk group for both training and validation set. Among the ten hub genes identified from the PPI network, four target butyrate metabolism-related genes were identified containing FN1, SERPINE1, THBS2, and COMP, which might provide novel markers or targets for treating CRC patients. Eighteen butyrate metabolism-related genes were used to develop a risk prognostic model that could be helpful for doctors to predict CRC patients’ survival rate. Using this model, it is beneficial to forecast the response of CRC patients to immunotherapy and chemotherapy, thus making it easier to custom tailor cancer chemotherapy and immunotherapy to the individual patient.

Construction and validation of a prognostic model for colon adenocarcinoma based on bile acid metabolism-related genes

Article Open access 05 August 2023

Construction and validation of a metabolic risk model predicting prognosis of colon cancer

Article Open access 25 March 2021

Pan-cancer and multi-omics analyses revealed the diagnostic and prognostic value of BAZ2A in liver cancer

Article Open access 04 March 2024

Introduction

In 2020, a total of 1,148,515 new colon cancer patients were diagnosed around the world, or 6% of new cancer cases; 576,858 new colon cancer deaths were reported, representing 5.8% of all cancer deaths all over the world according to the World Health Organization report¹. With the rapid advancement of science and technology, medical treatment methods are changing rapidly. In addition to the two conventional faecal occult blood testing and colonoscopy, faecal genetic testing is now employed in pre-colorectal-cancer screening². In terms of common colorectal surgical procedures, Laparoscopic assisted surgery and da Vinci robot-assisted surgery are replacing orthodox open surgery³. In terms of postoperative chemotherapy regimens, patients with metastatic or advanced disease are often treated with biologic agents that reinforce cytotoxic therapy including bevacizumab and cetuximab, in addition to FOLFOX6 and CapeOX regimens⁴. Nevertheless, the prognosis of each CRC patient is highly heterogeneous, as their genetic characteristics and different risk factors can lead to inconsistent disease progression and varying therapy outcomes, particularly in cases of recurrent postoperative CRC, where surgery and chemotherapy are not beneficial^5,6. Besides, a poor prognosis is associated with CRC in its advanced stages⁷. More importantly, as a result of the rapid growth in developing countries, Arnold et al. have forecast that there will be 2.5 million cases of CRC in 2035⁸. Therefore, it is indispensable for us to hunt for a novel method to better predict the prognosis of CRC patients.

A short-chain fatty acid (SCFA), butyrate is formed by bacteria in the colon during fermentation of fibre, and is used by colonocytes for energy⁹. By promoting colonic motility, butyrate accelerates blood flow to the colon and reduces colonic anastomosis healing time indirectly¹⁰. The use of butyrate reduces intestinal permeability, improving intestinal mucosal barrier function and improving immune function, which is very beneficial for ulcerative colitis and other conditions that affect the intestinal wall¹¹. Butyrate plays an essential role in intestinal cancer by providing nutrition to healthy mucosa, promoting its proliferation, and maintaining good intestinal barrier condition by accelerating mucus production. In metabolomic and proteomic studies, researchers found that butyrate inhibited CRC cell proliferation by directly targeting pyruvate kinase M2 and subsequently reprogramming metabolism¹². Combining butyrate with the G protein-coupled receptor GPR109A can also suppress tumors in the colon¹³. Metabolic pathway analyses in CRC can provide us with a better understanding of the molecular mechanisms involved and provide us with new and more effective therapeutic approaches¹⁴. Currently, there are no systematic studies on butyrate metabolism-related genes in CRC.

In the present study, we constructed a prognostic model based on butyrate metabolism-related genes using univariate Cox regression and LASSO regression analysis, which was validated by the GEO-CRC cohort. CRC patients’ overall survival outcomes were comprehensively predicted and analyzed using this prognostic model. We also analyzed the differences between high-risk and low-risk CRC patients regarding immune cell infiltration, gene mutation, chemotherapeutic drug sensitivity, and immunotherapy efficacy. Next, we constructed a PPI network to identify ten hub genes utilizing differentially expressed genes in the high- and low-risk groups. Four target butyrate metabolism-related genes were identified from 10 hub genes, which might provide novel markers or targets for treating CRC patients. Further, we performed an in-depth comparison of these four genes in terms of survival prognosis, clinical characteristics and immune cells. The prognostic model has the potential to guide prognostic prediction and clinical medication of CRC patients.

Materials and methods

Data collection and collation

First, we identified a total of 348 butyrate metabolism-related genes from the Molecular Signature Database (MSigDB, https://www.gsea-msigdb.org/gsea/msigdb/index.jsp) by using “butyric acid” as a search term¹⁵. Subsequently, through the TCGA database (https://portal.gdc.cancer.gov/), we performed the download of raw RNA sequencing (RNA-seq) data profiles and obtained relevant data (survival status, follow-up time, sex, age, pathological stage, and TNM stage) for 473 CRC and 41 standard colorectal tissue samples. Through the GEO database (https://www.ncbi.nlm.nih.gov/geo/), we downloaded the transcriptional data and corresponding clinical data of CRC samples (gene ID: GSE39582). The gene IDs of the samples were converted to the related gene symbols using human gene annotation files.

Identification of differentially expressed butyrate metabolism-related genes

We set |Log2fold change (FC)|> 0.585 and false discovery rate (FDR) < 0.05 as a threshold to screen out differentially expressed butyrate metabolism-related genes through performing the “limma” package in R software.

Functional enrichment analysis of differentially expressed butyrate metabolism-related genes

To better investigate the biological characteristics and functional cellular pathways of differentially expressed butyrate metabolism-related genes, we performed Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis with p-value < 0.05 for statistical significance^16,17. Finally, we used two R packages “ggplot 2” and “goplot” to visualize the enrichment analysis results. GO enrichment analysis is divided into three major functional aspects, called GO types, including biological process (BP), cellular component (CC), molecular function (MF).

Identification of prognostic genes

We used TCGA-CRC cohort samples as a training set. In order to identify the prognostic differentially expressed genes associated with butyrate metabolism, we set a p-value lower than 0.05. The “survival” package was used to identify the association between gene expression levels and patients’ overall survival using univariate Cox regression analysis on differentially expressed butyrate metabolism-related genes of the training set. The number of genes was further reduced and gene collinearity was eliminated using LASSO Cox regression. Moreover, we analyzed the intrinsic relationship between mutation frequency and mutated genes in the training set using the “maftools” R package.

Principal component analysis

We used the “limma” package in R software for principal component analysis to better differentiate CRC patients between high and low-risk groups. We used the “ggplot2” package to obtain two-dimensional principal component analysis plots of the two principal components based on the expression profiles of genes related to butyrate metabolism and the gene expression profiles of the predictive risk score model, respectively.

Construction and validation of a prognostic model

The risk scores of all samples were calculated using the following equation based on the results of LASSO Cox regression: risk score = Coef₁ × ExpGene₁ + Coef₂ × ExpGene₂ + Coef₃ × ExpGene₃ + … + Coef_i × ExpGene_i.

“Coef” corresponds to the non-zero regression coefficient obtained by LASSO Cox regression analysis, and “ExpGene” corresponds to the expression value of the gene in the prognostic risk score model. All the CRC patients were divided into low-risk and high-risk groups based on the median risk score. Kaplan–Meier survival plots were calculated using the R package “survival”. In order to evaluate the prognostic ability of the model, a receiver operating characteristic (ROC) curve was generated using the timeROC package. We validated the results using samples from the GEO database simultaneously. The same formula used in the training set was applied to patient risk scores from the GEO cohort. Next, we investigated whether CRC risk score was an independent predicator of overall survival using univariate and multivariate independent prognostic analysis. Statistical significance was determined by a p-value less than 0.05.

Construction and evaluation of the nomogram of CRC patients

To build a predictive model based on independent clinical parameters that can be used intuitively to study the overall survival of individual CRC patients, we used the “nomogram” package in R for this process. The relationship between risk score and clinical information was explored using the “limma” package, which included gender, age, pathological stage, and TNM stage. Calibration plots and ROC curves were used to assess the predictive performance of this model. To determine whether nomogram could be a reliable independent prognostic indicator for CRC patients, univariate and multivariate independent analyses were performed. P-value of < 0.05 was used to show statistical significance.

Characteristics of patients in high- and low-risk groups

Using the “ggpubr” package, we obtained a correlation between patients’ risk score and tumor mutation frequency. To estimate the relationship between immune cell infiltration and risk score, we downloaded the immune cell infiltration files from Timer2.0 (http://timer.cistrome.org/), and then used the “limma” and “pheatmap” R packages for differential analysis. The “GSVA” and “GSEABase” packages were used to analyze the differences in immune-related functions between the high-risk and low-risk groups. To predict the effect of immunotherapy in the high and low-risk groups, we used the TIDE online database (http://tide.dfci.harvard.edu/). P-values of less than 0.05 indicated a statistically significant difference.

Identification of potential drugs for CRC patients

The “PRRophetic” R package was used to predict the semi-inhibitory concentrations of drugs in the high-risk and low-risk groups and identify drugs with differential efficacy.

Protein–protein interaction network and target gene characteristics

Using the STRING online database (https://cn.string-db.org/), the interaction network of these differentially expressed genes between high and low-risk groups were mapped (medium confidence > 0.40). Using the cubHubba plugin of Cytoscape software (version 3.9.1), we screened out the top 10 network core genes. Using the GEPIA (http://gepia.cancer-pku.cn) online database, differentially expressed target genes in tumor tissues and normal tissues were filtered from the top 10 core genes by setting threshold log2FC > 1 and p-value < 0.05. To analyze the infiltration of the target genes in 22 tumour-infiltrating lymphocytes in the microenvironment of high-risk and low-risk CRC patients, the CIBERSORTx online database (https://cibersortx.stanford.edu/index.php) was used, and then “reshape2 “reshape2” and “ggpubr” R packages were used to map out the differential results. The “limma” and “ggpubr” R packages were used to analyze the relationship between target gene expression and clinical characteristics (age, sex, grade, TNM stage) in CRC patients. Finally, we performed an in-depth study of the correlation between target genes using the GEPIA online database with Spearman test.

Results

Enrichment analysis of differentially expressed butyrate metabolism-related genes

Gene expression levels were compared between normal colorectal and cancerous tissue samples related to butyrate metabolism. We screened out a total of 173 differentially expressed butyrate metabolism-related genes between CRC and normal colorectal tissues, including 132 upregulated genes and 41 down-regulated genes. The distribution of these differentially expressed butyrate metabolism-related genes in normal and tumor samples were visualized by the heatmap (Fig. 1A). According to the visualization chart of GO enrichment analysis, we can see that in terms of BP, the differentially expressed butyrate metabolism-related genes were mostly gathered in nuclear-transcribed mRNA catabolic process, nuclear-transcribed mRNA catabolic process, deadenylation- dependent decay. In terms of CC, the differentially expressed butyrate metabolism-related genes were mostly gathered in nuclear exosome (RNase complex), cytoplasmic exosome (RNase complex), and exosome (RNase complex). In terms of MF, the differentially expressed butyrate metabolism-related genes were mostly gathered in 3' − 5' − exoribonuclease activity, exoribonuclease activity, producing 5' − phosphomonoesters, and exoribonuclease activity (Fig. 1B). According to the visualization of KEGG enrichment analysis, differentially expressed butyrate metabolism-related genes were mostly gathered in nuclear − transcribed mRNA catabolic process exonucleolytic 3' − 5', nuclear − transcribed mRNA catabolic process exonucleolytic, nuclear RNA surveillance (Fig. 1C). From the above results, we found that these differentially expressed butyrate metabolism-related genes in CRC are closely related to RNA metabolism.

Construction of a prognostic model in the train set

Our training set was made up of TCGA-CRC cohort samples. We selected 27 butyrate metabolism-associated genes associated with prognosis from the 173 differentially expressed butyrate metabolism-associated genes with p value < 0.05 using univariate Cox regression analysis (Fig. 2A). The somatic mutation profiles of CRC patients were drawn for 27 genes linked to butyrate metabolism. There were 112 mutations in butyrate metabolism-related genes in 447 CRC samples, resulting in a 25.06 percent frequency (Fig. 2B). The more apparent mutations could be observed in DNAH17, CDK5RAP2, IGF2BP1, CLCN3, SLC2A2, and PLCB2, while the corresponding mutation frequencies were 12%, 6%, 4%, 4%, 3% and 3%, respectively. The visualization of co-mutations revealed a mutational positive relationship between DNAH17 and most of the other genes (Fig. 2C). Then, eighteen genes (PTGDS, STC2, CDK5RAP2, ETS2, CALCOCO1, DNAH17, ENKD1, SLC2A2, IGF2BP1, FABP4, GDI1, TIMP1, ALAD, CLCN3, PRKAR2A, PYGL, CDK10, and HSPB1) were screened out to construct a prognostic model using LASSO Cox regression analysis (Fig. 2D,E). The prognostic model was built according to the following risk score formula: Riskscore = (0.0744 * PTGDS) + (0.1617 * STC2) + (0.3586 * CDK5RAP2) + (− 0.2655 * ETS2) + (0.0244 * CALCOCO1) + (0.2413 * DNAH17) + (0.0420 * ENKD1) + (− 0.7597 * SLC2A2) + (0.0762 * IGF2BP1) + (0.0308 * FABP4) + (0.0344 * GDI1) + (0.1029 * TIMP1) + (0.2256 * ALAD) + (− 0.0333 * CLCN3) + (− 0.4618 * PRKAR2A) + (0.0428 * PYGL) + (0.1531 * CDK10) + (0.0426 * HSPB1). The CRC samples in TCGA were distinguished between the low-risk and high-risk groups by this risk model (Fig. 2F,G).

The relationship between risk score and clinical features

In the constructed risk model, we divided CRC patients into two groups on the basis of their median risk scores in the training set and test set. The clinical prognosis of the low-risk group was found to be better than that of the high-risk group in both the training and test groups (p-value < 0.05) (Fig. 3A,B). Univariate and multivariate independent prognostic analysis were performed on the training set to investigate whether risk score could be an independent factor to predict the overall survival of CRC patients. The results indicated that age, T stage, and risk score could be independent factors to predict the overall survival of CRC patients (Fig. 3C,D). In assessing the reliability of the risk score of the prognostic model, we plotted receiver operating characteristic (ROC) curves and the area under the curves (AUCs) of CRC patients were 0.739, 0.753 and 0.727 for 1-, 3- and 5-year, respectively (Fig. 3E). The AUC value of the risk score was only slightly lower than the AUC value of the stage, which could indicate that the accuracy of the risk model is excellent (Fig. 3F). Furthermore, the prognostic model was validated in GSE39582 dataset. The result demonstrated that risk score and tumor stage were strongly correlated with overall survival of CRC patients based on univariate and multivariate Cox analyses (Fig. 4A,B). AUC values of CRC patients were 0.780, 0.816, 0.776 for 1-, 3- and 5-year, respectively (Fig. 4C). Meanwhile, Fig. 4D indicating that our prognostic model could have a higher prognostic value than traditional models. Considering the above results, we are able to conclude that risk score is a reliable independent indicator for predicting the overall survival rate of patients with CRC.

Construction and evaluation of nomogram

A nomogram including risk score, grade, TNM stage, age, and gender can predict the 1-year, 3-year, and 5-year survival rates of patients with CRC (Fig. 5A). A calibration curve at 1 year, 3 years, and 5 years demonstrated that the nomogram accurately predicted the overall survival of CRC patients (Fig. 5B). It was evident from the ROC curve that AUC value of nomogram (AUC = 0.816) was superior to any single indicator (Fig. 5C). The results of univariate and multivariate independent prognostic analysis suggest our nomogram could be an independent factor to predict the overall survival of CRC patients (p-value < 0.05) (Fig. 5D,E). In conclusion, the predictive accuracy of the presently constructed prognostic model is confirmed by the above aspects.

Immune-related characteristics between the low- and high-risk groups

The immune cells upregulated in the low-risk group were Plasma cells, CD4 memory resting T cells, resting Dendritic cells, activated Dendritic cells, and Eosinophils (p-value < 0.05), while upregulated immune cells in the high-risk group were CD8 T cells, Macrophages M0 (p-value < 0.05) (Fig. 6A). Immune function analysis showed that the high-risk group was active in HLA, Type II IFN Reponse immune-related functions (p-value < 0.05) (Fig. 6B). In the graph of the Tumor Immune Dysfunction and Exclusion (TIDE) score, the score in the high-risk group was higher than in the low-risk group (Fig. 6C), which could indicate that the high-risk group was more prone to immune escape, and their immunotherapy was less effective.

Potential drugs for CRC patients

In the treatment of patients with CRC, potential drugs may be special targets. In this study, we obtained a total of 58 drugs with statistical difference including 19 drugs more sensitive to the low-risk group and 39 drugs more sensitive to the low-risk group (Supplementary Table 1). The top 3 drugs more sensitive to the low-risk group were Erlotinib, GSK591, and AZD3759 (Fig. 7A,C,E), while the top 3 drugs more sensitive to the high-risk group were IGF1R, AZ960, and AZD1332 (Fig. 7G,I,K). Interestingly, the relationship between risk score and 3 drugs more sensitive to the low-risk group were positively correlated (Fig. 7B,D,F), while the relationship between risk score and 3 drugs more sensitive to the high-risk group were negatively correlated (Fig. 7H,J,L). Oxaliplatin, a commonly used chemotherapeutic agent in CRC patients, was more sensitive in the high-risk group (p-value < 0.05) (Fig. 7M), and the relationship between risk score and its chemotherapeutic drug sensitivity was negatively correlated (Fig. 7N). In conclusion, these drugs can be beneficial in treating patients with CRC by providing new targets.

PPI network of differentially expressed genes in low- and high-risk groups

We constructed a PPI network using differentially expressed genes in low- and high-risk groups (Fig. 8A). We used Cytoscape software with degree algorithm to identify ten hub genes including FN1, COL1A1, ACTC1, ACTG2, MYH11, SERPINE1, KRT5, THBS2, COMP, APOE (Fig. 8B). From ten hub genes, four differentially expressed target genes (FN1, SERPINE1, THBS2, COMP) were identified using the GEPIA database, which had significant statistical differences in the survival analysis and expression analysis (p-value < 0.05). The expression levels of these four genes were higher in tumor tissues than in normal tissues (Fig. 9A,C,E,G), and the high expression of these genes had poor prognosis (Fig. 9B,D,F,H). Further analysis of these four genes was carried out next. According to the results of immune cell infiltration, the immune cells that were upregulated in the gene COMP high expression group were Macrophages M2 and resting Dendritic cells, and the downregulated immune cells were memory activated T cells CD4 and activated Dendritic cells (Supplementary Fig. 1A); the immune cells that were upregulated in the gene FN1 high expression group were Macrophages M1, and the downregulated cells were naive B cells, memory B cells, and Plasma cells (Supplementary Fig. 1B); the immune cells that were upregulated in the SERPINE1 high expression group were Macrophages M0, Macrophages M0, Macrophages M2, activated Mast cells, Eosinophils Neutrophils, memory B cells, follicular helper T cells, resting Mast cells (Supplementary Fig. 1C); THBS2 high expression group The upregulated immune cells in the expression group were Macrophages M2 and resting Dendritic cells, and the downregulated immune cells were naive B cells, memory B cells, Plasma cells, activated Dendritic cells (Supplementary Fig. 1D). Next, we explored the relationship between gene expression and clinical features of these four genes (Supplementary Figs. 2–5). Clinical correlation analysis showed that in COMP, FN1, and SERPINE1 genes, the expression levels were significantly lower in stage I patients than in stage II, III and IV patients (p-value < 0.05) (Supplementary Fig. 2E, Supplementary Fig. 3E, Supplementary Fig. 5E). Mutation analysis revealed that only FN1 has significant differences between mutant and wild types (Supplementary Fig. 6). In the target gene correlation analysis, THBS2 and other three genes (COMP, FN1, SERPINE1) were positively correlated with each other with statistically significant differences (p-value < 0.001). The relationship between COMP and FN1, SERPINE1 were also positively correlated with each other with statistically significant differences (p-value < 0.001) (Supplementary Fig. 7). Based on the above findings, it can be concluded that the high expression group had a high level of immune cell infiltration and high levels of activity, which makes it suitable for immunotherapy.

Discussion

Colorectal cancer ranks second in mortality and third in morbidity worldwide according to statistics in 2021, which seriously affects human health^1,18. Current treatment options for CRC are mainly surgery, chemotherapy, radiotherapy and targeted therapy, which can improve the prognosis to some extent but also have limitations, especially for patients with advanced CRC with recurrence or distant metastases. Besides, it has been reported that immunotherapy-related therapeutic strategies had the ability to overcome the limitations of classical treatment¹⁹. Standard conventional treatments including chemotherapy and radiotherapy have many side effects owing to their cytotoxicity and unspecificity toward any cells are growing and dividing. Immunotherapy overcomes the issue of specificity which is the major problem in chemotherapy and radiotherapy. Cancer immunotherapy targets cancer antigens of tumor cells specifically, alerts the immune systems to the presence of foreign substances and eradicates cancer through the concert of immune responses. The normal cells with no cancer antigens are not affected²⁰. The principle of immunotherapy treatment is to inhibit cancer progression by activating the natural immune molecular components of the tumor microenvironment. According to reports, the cells with antitumor characteristics in the tumor microenvironment are CD8 + cytotoxic T cells, Th1 helper cells and their associated cytokines such as interferon²¹. In vivo, there is a high proportion of Th1, CD8 + T and effector memory T cells, indicating better prognosis²². A number of cytokines can be produced by Treg cells to enhance their ability to fight tumors²³. In this study, the results indicated that the TIDE score in the high-risk group was higher than in the low-risk group, which could indicate that the high-risk group was more prone to immune escape, and their immunotherapy was less effective. Previous studies showed that a high TIDE score indicates a high likelihood of antitumor immune escape²⁴. This is consistent with our above results. Thus, the treatment of patients with CRC may be improved by immunotherapy in the low-risk group, which is expected to have good efficacy.

Different types of cancer have different mechanisms for butyrate’s anticancer properties²⁵. By activating MCT4, butyrate enhances the antitumor activity of 3-bromopyruvate in vivo and inhibits breast cancer growth²⁶. The butyrate also inhibited bladder cancer cell migration by inducing autophagy and apoptosis²⁷. Furthermore, butyrate inhibits cellular activity, which results in hepatocellular carcinoma and glioblastoma carcinoma that progress more slowly²⁸. Butyrate has a strong relationship with CRC, and its anticancer and anti-inflammatory effects are evident in this disease^29,30,31. Research on the intrinsic oncogene pathways in cancer is in full swing. However, no predictive model for butyrate metabolism-related genes has been developed in CRC so far. In the present work, the TCGA-CRC cohort was used as the training set, and the GEO-CRC cohort was used as the test set for accuracy validation. AUC value of the risk model was 0.739, 0.753 and 0.727 for 1-, 3- and 5-year respectively, exhibiting superior performance than some other signatures in predicting the prognosis of CRC. In previous studies, ferroptosis-related gene signature was constructed in CRC, and AUC was 0.64, 0.64, and 0.71 for 1-, 3- and 5-year respectively³². A necroptosis-related risk score model was also identified, and AUC was 0.665, 0.712, and 0.758 for the 1-, 3-, and 5-year OS³³.

Past studies have found the intimate relationship between butyrate metabolism-related genes in the risk model and tumorigenesis and progression of CRC. As a secretory glycoprotein hormone, Stanniocalcin 2 (STC2) was involved in the progression and development of CRC through activating the Wnt/β-catenin signaling pathway and promoting CRC cell proliferation and migration³⁴. Previous study have demonstrated that RNA-binding protein CELF1 targeted ETS2 in colorectal cancer, contributing to tumor cell migration, invasion and promotion of chemoresistance³⁵. Insulin-like growth factor 2 RNA binding protein 1 (IGF2BP1/IMP1) has the ability to shape extracellular vesicles cargo in human CRC, and higher expression level of IGF2BP1 is correlated with poor clinical outcome³⁶. FABP4 expression was associated with E-cadherin and Snail expression in CRC tissue, indicating that FABP4 may promote CRC development related to epithelial-mesenchymal transition (EMT)³⁷. TIMP1 was found to be overexpressed in colon tissue and lymph node metastasis specimens, and suppression of TIMP1 expression inhibited proliferation, and metastasis but promote apoptosis through inducing FAK-PI3K/AKT and MAPK pathway³⁸.

This new risk prediction model can aid in predicting the treatment outcome for patients and analyze the relationship between target genes and immune cell infiltration, clinical characteristics. In immune cell infiltration analysis, the low-risk group was enriched in Plasma cells, CD4 memory resting T cells, resting Dendritic cells, activated Dendritic cells, and Eosinophils. With their potent antigen presenting ability, dendritic cells have long been considered a key component of antitumor immunity³⁹. Activated dendritic cells are key to the development of long-term and effective anticancer immunity⁴⁰. It is well documented that eosinophils infiltrate tumors, and in most cases, this translates into a better prognosis⁴¹. The high-risk group is active in HLA, Type_II_IFN_Reponse and there is a study that Type_II_IFN_Reponse is essential for immune editing of tumors⁴², which indicates that the high-risk group is suitable for immunotherapy. In conclusion, targeting different targets, immunotherapy may be beneficial for both high-risk and low-risk CRC patients.

Relying on the differentially expressed genes between high and low-risk groups, we cleverly mapped the PPI network, from which we filtered the top 10 core genes of the network. We finally obtained four target genes (FN1, SERPINE1, THBS2, COMP), which were significantly different in survival analysis and expression analysis (p-value < 0.05). It is thought that these four genes are high-risk genes since they are highly expressed in tumors, and a high level of expression of these genes is associated with poor prognoses. FN1 can promote embryogenesis and host defense, as well as inhibit apoptosis while promoting the growth of CRCs in combination with ITGA5⁴³. CRC aggressiveness is triggered by SERPINE1 activation by ARNTL2 and circadian rhythm variations in circulating PAI-1 levels⁴⁴. CRC cells migrate and invade more rapidly when THBS2 regulates the Wnt/β-linked protein signaling pathway⁴⁵. It has been reported that COMP promotes cell proliferation during the early stages of colon cancer tumorigenesis⁴⁶. According to these studies, the results are generally consistent with ours, confirming the accuracy of the model and its scientific validity. Moreover, the ability of these four genes to act on various immune cells suggests that immunotherapy may improve survival rates in patients with poor prognoses. However, further investigation of these results is necessary.

To conclude, the butyrate prognostic model is capable of evaluating butyrate metabolism patterns comprehensively. The risk score can be used to predict chemotherapy sensitivity, as well as the prognosis of CRC patients. An effective clinical follow-up strategy can be achieved through an understanding of the risk score and the clinical stage. The prognostic model of our study will help to promote the development of new diagnostic ideas, and facilitate the search for new therapeutic targets and prognostic molecular markers in the future.

Conclusion

In summary, a novel risk model for butyrate metabolism was developed based on data from the TCGA and GEO databases. The butyrate metabolism-related four target genes were closely related to the clinical stage and prognosis of patients with CRC and showed specificity in immune cell infiltration. It may be possible to use these four genes as biomarkers for individualized treatment of patients with CRC to improve their prognoses.

Data availability

This study relied on publicly available data. The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases contain this information.

References

Sung, H. et al. Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71(3), 209–249 (2021).
Article PubMed Google Scholar
Dollinger, M. M., Behl, S. & Fleig, W. E. Early detection of colorectal cancer: A multi-center pre-clinical case cohort study for validation of a combined DNA stool test. Clin. Lab. 64(10), 1719–1730 (2018).
CAS PubMed Google Scholar
Hollandsworth, H. M. et al. Multiquadrant surgery in the robotic era: A technical description and outcomes for da Vinci Xi robotic subtotal colectomy and total proctocolectomy. Surg. Endosc. 34(11), 5153–5159 (2020).
Article PubMed Google Scholar
Gustavsson, B. et al. A review of the evolution of systemic chemotherapy in the management of colorectal cancer. Clin. Colorectal Cancer 14(1), 1–10 (2015).
Article CAS PubMed Google Scholar
Sasaki, N. & Clevers, H. Studying cellular heterogeneity and drug sensitivity in colorectal cancer using organoid technology. Curr. Opin. Genet. Dev. 52, 117–122 (2018).
Article CAS PubMed Google Scholar
Sagaert, X., Vanstapel, A. & Verbeek, S. Tumor heterogeneity in colorectal cancer: What do we know so far?. Pathobiology 85(1–2), 72–84 (2018).
Article CAS PubMed Google Scholar
Tang, J. et al. LncRNA GLCC1 promotes colorectal carcinogenesis and glucose metabolism by stabilizing c-Myc. Nat. Commun. 10(1), 3499 (2019).
Article ADS PubMed PubMed Central Google Scholar
Arnold, M. et al. Global patterns and trends in colorectal cancer incidence and mortality. Gut 66(4), 683–691 (2017).
Article PubMed Google Scholar
Astbury, S. M. & Corfe, B. M. Uptake and metabolism of the short-chain fatty acid butyrate, a critical review of the literature. Curr. Drug Metab. 13(6), 815–821 (2012).
Article CAS PubMed Google Scholar
Velázquez, O. C., Lederer, H. M. & Rombeau, J. L. Butyrate and the colonocyte. In Dietary Fiber in Health and Disease (eds Kritchevsky, D. & Bonfield, C.) 123–134 (Springer, 1997).
Chapter Google Scholar
Shen, Z. H. et al. Relationship between intestinal microbiota and ulcerative colitis: Mechanisms and clinical application of probiotics and fecal microbiota transplantation. World J. Gastroenterol. 24(1), 5–14 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, Q. et al. Butyrate suppresses the proliferation of colorectal cancer cells via targeting pyruvate kinase M2 and metabolic reprogramming. Mol. Cell. Proteomics 17(8), 1531–1545 (2018).
Article CAS PubMed PubMed Central Google Scholar
Thangaraju, M. et al. GPR109A is a G-protein-coupled receptor for the bacterial fermentation product butyrate and functions as a tumor suppressor in colon. Cancer Res. 69(7), 2826–2832 (2009).
Article CAS PubMed PubMed Central Google Scholar
Tabe, Y., Konopleva, M. & Andreeff, M. Fatty acid metabolism, bone marrow adipocytes, and AML. Front. Oncol. 10, 155 (2020).
Article PubMed PubMed Central Google Scholar
Liberzon, A. et al. The molecular signatures database (MSigDB) hallmark gene set collection. Cell Syst. 1(6), 417–425 (2015).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28(1), 27–30 (2000).
Article CAS PubMed PubMed Central Google Scholar
Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M. & Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 44(D1), D457–D462 (2016).
Article CAS PubMed Google Scholar
Li, N. et al. Incidence, mortality, survival, risk factor and screening of colorectal cancer: A comparison among China, Europe, and Northern America. Cancer Lett. 522, 255–268 (2021).
Article CAS PubMed Google Scholar
Signorini, L., Delbue, S., Ferrante, P. & Bregni, M. Review on the immunotherapy strategies against metastatic colorectal carcinoma. Immunotherapy 8(10), 1245–1261 (2016).
Article CAS PubMed Google Scholar
Johdi, N. A. & Sukor, N. F. Colorectal cancer immunotherapy: Options and strategies. Front. Immunol. 11, 1624 (2020).
Article CAS PubMed PubMed Central Google Scholar
Taube, J. M. et al. Implications of the tumor immune microenvironment for staging and therapeutics. Mod. Pathol. 31(2), 214–234 (2018).
Article CAS PubMed Google Scholar
Pages, F. et al. Immune infiltration in human tumors: A prognostic factor that should not be ignored. Oncogene 29(8), 1093–1102 (2010).
Article CAS PubMed Google Scholar
Ferreira, C. et al. Type 1 Treg cells promote the generation of CD8(+) tissue-resident memory T cells. Nat. Immunol. 21(7), 766–776 (2020).
Article CAS PubMed Google Scholar
Fu, J. et al. Large-scale public data reuse to model immunotherapy response and resistance. Genome Med. 12(1), 21 (2020).
Article PubMed PubMed Central Google Scholar
O’Keefe, S. J. Diet, microorganisms and their metabolites, and colon cancer. Nat. Rev. Gastroenterol. Hepatol. 13(12), 691–706 (2016).
Article CAS PubMed PubMed Central Google Scholar
Queiros, O. et al. Butyrate activates the monocarboxylate transporter MCT4 expression in breast cancer cells and enhances the antitumor activity of 3-bromopyruvate. J. Bioenerg. Biomembr. 44(1), 141–153 (2012).
Article CAS PubMed Google Scholar
Wang, F. et al. Sodium butyrate inhibits migration and induces AMPK-mTOR pathway-dependent autophagy and ROS-mediated apoptosis via the miR-139-5p/Bmi-1 axis in human bladder cancer cells. FASEB J. 34(3), 4266–4282 (2020).
Article CAS PubMed Google Scholar
Joachimiak, R., Kaznica, A. & Drewa, T. Influence of sodium butyrate on hepatocellular carcinoma (hepG2) and glioblastoma (C6) cell lines in vitro. Acta Pol. Pharm. 64(6), 561–563 (2007).
CAS PubMed Google Scholar
Hassig, C. A., Tong, J. K. & Schreiber, S. L. Fiber-derived butyrate and the prevention of colon cancer. Chem. Biol. 4(11), 783–789 (1997).
Article CAS PubMed Google Scholar
Velazquez, O. C. & Rombeau, J. L. Butyrate: Potential role in colon cancer prevention and treatment. Adv. Exp. Med. Biol. 427, 169–181 (1997).
Article CAS PubMed Google Scholar
Ooi, C. C. et al. Structure-activity relationship of butyrate analogues on apoptosis, proliferation and histone deacetylase activity in HCT-116 human colorectal cancer cells. Clin. Exp. Pharmacol. Physiol. 37(9), 905–911 (2010).
Article CAS PubMed Google Scholar
Hu, D., Zhou, Z., Wang, J. & Zhu, K. Screening of ferroptosis-related genes with prognostic effect in colorectal cancer by bioinformatic analysis. Front. Mol. Biosci. 9, 979854 (2022).
Article CAS PubMed PubMed Central Google Scholar
Yang, W. et al. Integrated analysis of necroptosis-related genes for evaluating immune infiltration and colon cancer prognosis. Front. Immunol. 13, 1085038 (2022).
Article CAS PubMed PubMed Central Google Scholar
Li, Q., Zhou, X., Fang, Z. & Pan, Z. Effect of STC2 gene silencing on colorectal cancer cells. Mol. Med. Rep. 20(2), 977–984 (2019).
CAS PubMed PubMed Central Google Scholar
Wang, H. et al. RNA-binding protein CELF1 enhances cell migration, invasion, and chemoresistance by targeting ETS2 in colorectal cancer. Clin. Sci. (Lond.) 134(14), 1973–1990 (2020).
Article CAS PubMed Google Scholar
Kuhn, M. et al. IMP1/IGF2BP1 in human colorectal cancer extracellular vesicles. Am. J. Physiol. Gastrointest. Liver Physiol. 323(6), G571–G585 (2022).
Article CAS PubMed Google Scholar
Zhang, Y. et al. High expression of FABP4 in colorectal cancer and its clinical significance. J. Zhejiang Univ. Sci. B 22(2), 136–145 (2021).
Article CAS PubMed PubMed Central Google Scholar
Song, G. et al. TIMP1 is a prognostic marker for the progression and metastasis of colon cancer through FAK-PI3K/AKT and MAPK pathway. J. Exp. Clin. Cancer Res. 35(1), 148 (2016).
Article PubMed PubMed Central Google Scholar
Veglia, F. & Gabrilovich, D. I. Dendritic cells in cancer: The role revisited. Curr. Opin. Immunol. 45, 43–51 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gilardini Montani, M. S. et al. Capsaicin-mediated apoptosis of human bladder cancer cells activates dendritic cells via CD91. Nutrition 31(4), 578–581 (2015).
Article CAS PubMed Google Scholar
Davis, B. P. & Rothenberg, M. E. Eosinophils and cancer. Cancer Immunol. Res. 2(1), 1–8 (2014).
Article PubMed Google Scholar
Di Franco, S., Turdo, A., Todaro, M. & Stassi, G. Role of type I and II interferons in colorectal cancer and melanoma. Front. Immunol. 8, 878 (2017).
Article PubMed PubMed Central Google Scholar
Cai, X. et al. Down-regulation of FN1 inhibits colorectal carcinogenesis by suppressing proliferation, migration, and invasion. J. Cell. Biochem. 119(6), 4717–4728 (2018).
Article CAS PubMed Google Scholar
Mazzoccoli, G. et al. ARNTL2 and SERPINE1: Potential biomarkers for tumor aggressiveness in colorectal cancer. J. Cancer Res. Clin. Oncol. 138(3), 501–511 (2012).
Article CAS PubMed Google Scholar
Qu, H. L., Hasen, G. W., Hou, Y. Y. & Zhang, C. X. THBS2 promotes cell migration and invasion in colorectal cancer via modulating Wnt/beta-catenin signaling pathway. Kaohsiung J. Med. Sci. 38(5), 469–478 (2022).
Article CAS PubMed Google Scholar
Nfonsam, V. N. et al. Cartilage oligomeric matrix protein (COMP) promotes cell proliferation in early-onset colon cancer tumorigenesis. Surg. Endosc. 34(9), 3992–3998 (2020).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors thank the TCGA and the GEO database for allowing them to upload valuable datasets.

Author information

Authors and Affiliations

Department of Anorectal Surgery, Jiangmen Wuyi Hospital of Traditional Chinese Medicine, Jiangmen, China
Qinghua Luo, Shuangqing Chang & Zhifang Huang
Department of Anorectal Surgery, Jiangxi Hospital of Integrated Traditional Chinese and Western Medicine, Nanchang, China
Ping Zhou
Department of Brain Diseases, Shenzhen Pingle Orthopaedic Hospital, Shenzhen, China
Xuebo Zeng

Authors

Qinghua Luo
View author publications
You can also search for this author in PubMed Google Scholar
Ping Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Shuangqing Chang
View author publications
You can also search for this author in PubMed Google Scholar
Zhifang Huang
View author publications
You can also search for this author in PubMed Google Scholar
Xuebo Zeng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Q.L. and P.Z. designed the implementation of the research, drafted preliminary papers, and participated in investigations. S.C., Z.H. and X.Z. participated in research design and implementation, manuscript revision, and manuscript submission. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Qinghua Luo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Legends.

Supplementary Figure 1.

Supplementary Figure 2.

Supplementary Figure 3.

Supplementary Figure 4.

Supplementary Figure 5.

Supplementary Figure 6.

Supplementary Figure 7.

Supplementary Table 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Luo, Q., Zhou, P., Chang, S. et al. Characterization of butyrate-metabolism in colorectal cancer to guide clinical treatment. Sci Rep 13, 5106 (2023). https://doi.org/10.1038/s41598-023-32457-z

Download citation

Received: 17 November 2022
Accepted: 28 March 2023
Published: 29 March 2023
DOI: https://doi.org/10.1038/s41598-023-32457-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.