Genomic-driven nutritional interventions for radiotherapy-resistant rectal cancer patient

Southern, Joshua; Gonzalez, Guadalupe; Borgas, Pia; Poynter, Liam; Laponogov, Ivan; Zhong, Yoyo; Mirnezami, Reza; Veselkov, Dennis; Bronstein, Michael; Veselkov, Kirill

doi:10.1038/s41598-023-41833-8

Download PDF

Article
Open access
Published: 08 September 2023

Genomic-driven nutritional interventions for radiotherapy-resistant rectal cancer patient

Joshua Southern¹^na1,
Guadalupe Gonzalez^1,2^na1,
Pia Borgas³,
Liam Poynter⁴,
Ivan Laponogov⁴,
Yoyo Zhong⁴,
Reza Mirnezami⁵,
Dennis Veselkov¹,
Michael Bronstein⁶ &
…
Kirill Veselkov^2,7

Scientific Reports volume 13, Article number: 14862 (2023) Cite this article

820 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Radiotherapy response of rectal cancer patients is dependent on a myriad of molecular mechanisms including response to stress, cell death, and cell metabolism. Modulation of lipid metabolism emerges as a unique strategy to improve radiotherapy outcomes due to its accessibility by bioactive molecules within foods. Even though a few radioresponse modulators have been identified using experimental techniques, trying to experimentally identify all potential modulators is intractable. Here we introduce a machine learning (ML) approach to interrogate the space of bioactive molecules within food for potential modulators of radiotherapy response and provide phytochemically-enriched recipes that encapsulate the benefits of discovered radiotherapy modulators. Potential radioresponse modulators were identified using a genomic-driven network ML approach, metric learning and domain knowledge. Then, recipes from the Recipe1M database were optimized to provide ingredient substitutions maximizing the number of predicted modulators whilst preserving the recipe’s culinary attributes. This work provides a pipeline for the design of genomic-driven nutritional interventions to improve outcomes of rectal cancer patients undergoing radiotherapy.

A distinct Fusobacterium nucleatum clade dominates the colorectal cancer niche

Article Open access 20 March 2024

Microbiota in health and diseases

Article Open access 23 April 2022

Two-year effects of semaglutide in adults with overweight or obesity: the STEP 5 trial

Article Open access 10 October 2022

Introduction

Mesorectal excision is the surgical standard of care in rectal cancer (RC)¹. The additive benefit of radiotherapy (RT) in reducing local recurrence in advanced RC has been extensively documented^2,3,4,5. However, there is considerable variability in radioresponse across patients, with patients showing either (1) complete tumor destruction, (2) moderate tumor regression, or (3) negligible tumor shrinkage. For patients in the last category, the delay in proceeding to tumor excision while completing RT may increase the likelihood of distant metastases, therefore, modulation of radioresponse to improve RT outcomes is a critical need.

RT response is governed by various molecular mechanisms including response to stress, cell death, and cell metabolism⁶. Current strategies to improve radioresponse focus on the modulation of cell death and response to stress using chemotherapies, such as fluorouracil (5-FU), capecitabine, gemcitabine, and cisplatin, to enhance tumor sensitivity to RT⁷. However, combined therapy often produces mixed results and can increase toxicity in normal tissues⁷. In contrast, bioactive molecules within foods appear as a promising alternative to modulate radioresponse, through lipid metabolism modulation^8,9. Proteins corresponding to up-regulated genes in RT-resistant RC patients participate in lipid biosynthetic and metabolic pathways with various roles (Fig. 1A). The up-regulation of most of these genes translates into increased lipid availability, which leads to a myriad of downstream tumor-promoting effects. For example, CDS1- and CDS2-encoded proteins regulate growth and maturation of lipid droplets which serve as storage, providing nutrients necessary for cell growth, and can serve as additional nutrients for the uncontrolled growth of cancerous cells¹⁰. Moreover, over-expression of PLA2G5, the ELOVL family of genes, FASN and the PLP family of genes translates into increased lipid availability leading to downstream activation of inflammation and stress pathways. Proteins encoded by these genes increase lipid availability through different mechanisms: PLA2G5-encoded protein through the generation of lysophospholipids and free fatty acids, including arachidonic acid^10,11; encoded proteins by the ELOVL family of genes through the elongation of long chain fatty acids to provide precursors for synthesis of sphingolipids and ceramides^10,12; FASN-encoded protein through the synthesis of long-chain fatty acids¹⁰; and encoded proteins by the PLP family of genes through the hydrolysis and uptake of lipids from extracellular space^10,13. Increased lipid availability in cancer cells can also lead to increased immunosuppressive properties, as is the case with PTDSS1 over-expression, whose encoded protein catalyzes the formation of phosphatidylserine which, exposed on the surface of tumor cells, increases their immunosuppressive properties and facilitates tumor growth and metastasis^10,14. On the other hand, PTEN has documented tumor-suppressing properties¹⁰. Loss of PTEN leads to elevated de novo lipogenesis through induction of SREBP and FASN expression¹⁵. Therefore, over-expression of PTEN in this context might be a compensatory mechanism to inhibit FASN in an attempt to decrease lipogenesis.

Bioactive molecules in food can modulate lipid metabolism, have a promising safety profile in toxicity studies, and have documented chemopreventive and chemotherapeutic effects^16,17,18. This means dietary interventions could be a promising strategy to increase treatment efficacy, prevent resistance acquisition and reduce side effects¹⁹. However, experimental large-scale testing of chemotherapeutic or chemopreventive properties of bioactive molecules within food is not generally feasible due to a large number of food-based bioactive molecules. As a result, a unique wave of research has leveraged network machine learning (ML) and genomic data to carry out a large-scale screening of anticancer molecules within food^20,21,22. Building on these works, we propose a computational genomic-driven approach to mine the space of bioactive molecules within food for potential radioresponse modulators and propose phytochemically-enriched recipes to improve radioresponse of RC patients.

The proposed pipeline, shown in Fig. 1, comprises (1) Identifying over-expressed proteins in RT-resistant RC patients (Fig. 1A) (2) a radioresponse modulators identification module (Fig. 1B) and (3) a recipe generation module (Fig. 1C). In order to identify radioresponse modulators, we map food molecule protein-coding gene targets and RT resistance dysregulated genes onto a heterogeneous network representing proteins and biological functions. Using a network propagation algorithm combined with metric learning, we learn effects of food molecules and the phenotype across the heterogeneous network, and find food molecules with similar effects to those observed in the phenotype. The third stage involves recipe optimization to maximise the number of ingredients with these molecules. Dietary recommendations can then be proposed for RT-resistant RC patients using these recipes and taking into account other user-specific requirements such as taste preferences and allergies.

Results

Random walks and metric learning to predict drug-phenotype associations

We compute propagated profiles of drugs and diseases on the multiscale-interactome using random-walk with restarts and then use metric learning to minimise the distance between a disease and drugs that treat this disease and maximise the distance between a disease and drugs with no known benefit. To show the gain of combining metric learning with the random walk algorithm, we evaluated the improvement on the multiscale-based drug-disease prediction task proposed in²³. We show that the addition of metric learning improves the random walk diffusion profiles resulting in a 20% increase in performance (\(AUROC = 0.714\ vs\ 0.876\)). Additionally, the choice of the restart probability only has a small effect on the results in the initial implementation and no influence when combined with metric learning. These results, shown in Fig. 2, confirm the benefit of fixing the restart probability and instead of optimising the weights of the walker, optimizing an MLP by directly back-propagating information from the prediction task using a triplet loss function.

The model identifies molecules with therapeutic potential to reverse RT resistance

Using propagated profiles, we find the top 100 food molecules closest to the phenotype. These molecules affect similar proteins and biological functions as those responsible for radioresistance, however, diffusion profiles do not provide information whether the modulation is positive or negative. Experimental evidence indicates that the phenotype-associated genes are over-expressed in patients exhibiting RT resistance leading to a positive modulation of lipid metabolism (Fig. 1A). Therefore, we use domain knowledge and literature search to filter out identified molecules with positive regulatory effects on lipid metabolism, leaving 33 modulators to retrieve the list of ingredients (Appendix A). Modulators belong to a myriad of compound classes including flavonoids, isoflavonoids, and bezenoids, in alignment with the current knowledge on chemotherapeutic bioactive molecules within foods¹¹. Overall, predicted modulators are involved in cell signaling, cell growth and lipid metabolism. For example, Mangiferol and Dihydrosphingosine modulate downstream effects linked to fatty acid biosynthetic and elongation pathways, down-regulating stress and inflammation processes (Figure 3). Additionally, we have compiled a list of ingredients with the highest number of modulators (Appendix B).

Highest scoring foods modulating RT response

In order to validate the recipes, we explored the mechanisms by which the substituted ingredients could modulate radiotherapy response. The tables in appendices A and B give a more extensive treatment of the RT response modulators within food and their potential mechanism for modulation. In Fig. 4, we show how a chicken korma recipe is mutated by substituting kale for spinach and new potatoes for beetroot. Whilst it is difficult to evaluate these substitutions from a culinary perspective, the substitutions do increase the number of potential radioresponse modulators. New potatoes contain none of the found potential modulators, whereas beetroot contains both kaempferol and syringic acid. It has been shown that syringic acid-treated cells developed anti-cancer activities by losing MMP, cell viability, and enhancing intracellular ROS and kaempferol has been shown to be a potential chemo-therapeutic agent to be used alone or in combination with 5-FU to overcome colon cancer drug resistance²⁴²⁵. Additionally, spinach also contains kaempferol as well as alpha-lipoic acid. Alpha-lipoic acid can effectively induce apoptosis in human colon cancer cells by a mechanism that is initiated by an increased uptake of oxidizable substrates into mitochondria²⁶. The addition of these molecules in the recipe, which have been found using our drug-disease association model, and have demonstrated chemotherapeutic effects could be beneficial to radiotherapy-resistant rectal cancer patient as an added measure alongside their standard treatment.

Discussion

In 2017, dietary risk factors were attributed to approximately 11 million deaths globally, equivalent to about 1 in 5 deaths²⁷. This stark statistic emphasises the global need for dietary improvements. Furthermore, evidence has mounted on the potential benefits of drug-like molecules in foods against diseases such as cancer^28,29, Covid-19³⁰ and other health conditions³¹. The prospect of dietary recommendations both for the general population and patients with specific diseases becomes increasingly important. We delved into understanding the role of bioactive food molecules as potential modulators of radiotherapy response. This was achieved by expanding a drug-disease prediction model based on RWR with metric learning, pinpointing radioresponse modulators and showcasing enhanced results on a benchmark dataset. The integration of these analytical methodologies is pivotal; it not only facilitates a comprehensive understanding of intricate interactions but also combines the strengths of prediction and metric learning, ensuring a system-wide appraisal of the potential therapeutic influence of bioactive food molecules on radiotherapy efficacy. By utilising propagated profiles from our model, we identified radioresponse modulators in food, subsequently integrating this with experimental evidence from literature reviews to determine the modulation direction - either positive or negative. It is important to acknowledge, however, that while our findings are encouraging, the model’s transfer-ability may necessitate further validations. This arises from discrepancies, albeit reasonable, in data distribution between the dataset for optimisation of propagation weights and the datasets for food molecules and phenotypes (Appendix C). In this study, we adopted an assumption of direct correspondence between effects on genes and proteins, neglecting potential post-translational modifications. These modifications could be profoundly influenced by dietary intake and merit further exploration in subsequent studies. In terms of advancements, future iterations of the recipe recommendation module could contemplate the de novo creation of recipes using text or cooking graph representations, surpassing current NLP-based models. Furthermore, the optimal timing for dietary interventions, aimed at maximising radiotherapy outcomes, was beyond our current scope but warrants attention, potentially encompassing clinical trials assessing the interplay between dietary intervention timing and therapy outcomes. The current work, in general, provides a framework for the discussion of methodological approaches for the task of modulating radioresponse using bioactive molecules within foods. We consider this work as a first milestone approach in the design of genome-guided phytochemically-enriched recipes to improve RT outcomes in RC patients and envision its use as a baseline for future work. Our approach, centred on lipid metabolism modulation, offers a novel avenue to augment radiotherapy outcomes. Nevertheless, individual biological and health variations signify that it might not universally benefit all patients. Aspects like obesity and BMI, which intrinsically modify lipid metabolism and various physiological processes, could dictate the intervention’s effectiveness. In such instances, personalised strategies, ranging from dietary modifications to manage weight to pharmacological measures addressing obesity-related comorbidities, might be indispensable. By employing machine learning, our study enables recipe adjustments in line with identified potential radiotherapy modulators. This presents an opportunity for bespoke recipe alterations aligning with individual patient requirements, considering elements like obesity and BMI. Such a comprehensive, personalised treatment paradigm accentuates the essence of optimising radiotherapy outcomes and overall patient health. The flexibility of our approach also encompasses patient-specific data such as allergies, cost considerations, food preferences, and concurrent treatments, ensuring dietary compatibility and synergy.

Contrasting with gut microbiota modification strategies, our method prioritises direct dietary alterations aimed at cellular mechanisms, including lipid metabolism, rather than reshaping the gut microbiome. Nevertheless, dietary effects on gut microbiota composition and functionality are undeniable and can sway health outcomes, including therapy responses. Since the gut microbiota orchestrates the bioavailability of bioactive food molecules, these two strategies might be synergistically combined for a comprehensive therapeutic approach encompassing both cellular mechanisms and microbial interactions. Our proposed pipeline possesses the adaptability to address any disease given the knowledge of target genes, offering a holistic framework for recipe recommendations that complement prevailing treatment standards. We foresee evaluating these findings via clinical trials, providing participants with enriched recipes and evaluating dietary intervention impacts through outcomes such as progression-free survival (PFS) or disease-free survival (DFS). Moreover, the approach, although primarily centred on radiotherapy for rectal cancer, hints at the broader applicability, extending possibly to other therapeutic modalities or diseases.

Conclusion

We introduce a network machine learning pipeline for predicting radioresponse modulators within foods and generating recipes to enhance RT response in RC patients. For the identification of radioresponse modulators within foods, we adopted a genomic-driven approach, hypothesising that these modulators should exhibit similar effects on protein networks as those observed in RT-resistant RC patients. To model the genomic effects of food molecules and phenotype, we integrated metric learning with biased RWR, mapping the influence of food molecules and the phenotype across a multiscale interactome. This process illuminated the proteins and biological functions most impacted. Overall, this study establishes a foundation for discussing methodological strategies aimed at modulating radioresponse through bioactive molecules in foods. We view this as a pioneering step in creating genome-guided, phytochemically-enriched recipes to enhance RT outcomes in RC patients and see its potential as a reference point for subsequent research.

Methods

Identifying radioresponse modulators

We propose the approach outlined in Fig. 1A for the identification of radioresponse modulators within foods. The core of our model is a graph \(G=(\mathcal {V}, E)\) representing the multiscale interactome described by²³, where nodes are proteins and biological functions, and edges represent protein-protein, protein-biological functions, and biological function-biological function interactions. Protein-protein interactions describe physical interactions between proteins. Protein-biological function interactions connect proteins to the biological functions they affect and biological function-biological function interactions represent the hierarchy of biological functions using the Gene Ontology’s Biological Processes³². For more details on the construction of the multiscale interactome, we refer the reader to²³. Specifically, our graph G has \(|\mathcal {V}| = N+M = 27,458\) nodes of which \(N = 17,660\) are proteins and \(M = 9798\) are biological functions. The phenotype, i.e., the over-expressed proteins in patients exhibiting RT resistance, is modeled as an N-dimensional vector \({\textbf {p}} \in \{0,1 \}^N\) where \(p _i = 1\) if gene i is over-expressed and 0 otherwise. Similarly, protein targets of food molecules are represented as N-dimensional vectors \({\textbf {m}}^j \in \{0,1 \}^N\) where \({m}^j_i = 1\) if protein i is targeted by food molecule j and 0 otherwise. Information of 2100 food molecules and their targets are obtained from FoodDB³³ and STITCH³⁴ datasets. Using the multiscale interactome allows us to explain identified molecules, even when they seem unrelated to the phenotype. It additionally allows us to identify which biological functions are being modulated in cases where a short protein-protein path exists between food molecule targeted proteins and RC-resistant over-expressed genes, adding a level of interpretability.

Network propagation algorithm and metric learning

We combine a network propagation algorithm based on biased random walks with restarts with deep metric learning. The network propagation algorithm starts from initial nodes encoded in binary vectors encoding food molecules and the phenotype. At every step, the walker can restart its walk or jump to an adjacent node. The outputted diffusion profile measures how often each node in the multiscale interactome is visited by the RWR, encoding the effect of food molecules and the phenotype on every protein and biological function. In²³, they optimise the edge weights of the algorithm for a multiscale-based drug-disease prediction task, in which an AUROC = 0.705 was achieved. The task involves predicting whether a drug treats a disease based on known drug-disease pairs taken from the Drug Repurposing Database, the Drug Repurposing Hub and the Drug Indication Database with only FDA-approved treatment relationships. Given that optimising the edge weights of the random walk algorithm has a very small effect on the prediction task (a fixed random walk probability of \(\alpha = 0.64\) and edge-weights all being 1 gives 0.702 AUROC), we propose to fix the edge weights and optimise the weights of a multilayer perceptron (MLP) instead, using deep metric learning in order to minimise the distance between known drug-disease pair embeddings and maximise the distance to unknown drug-disease pairs. We set the propagation value of the RWR to 10 times the mean maximum propagated value over all drugs after propagating with \(\alpha = 0\), giving a value of \(\alpha = 0.64\). For each disease, we randomly sample both a positive drug (a drug which is known to be beneficial against the disease) and a negative drug (a drug which has no known benefit). This triple (disease, positive drug, negative drug) is passed to a MLP in order to get an embedding for the disease and the two drugs. A triplet loss is then used in order to minimise the distance between the disease and positive drug and maximise the distance to the negative drug. We use 5-fold cross-validation to optimize the model in the set of drugs and diseases (N = 1651), and use the trained model to give a ranking of food molecules based on distance to the phenotype embedding. In each split, we train the model for a maximum of 100 epochs using the Adam optimizer. Final propagation profiles reflect protein and biological functions affected. However, the model alone is not sufficient to filter out toxic molecules or metals from the food molecule database. Additionally, it is difficult for the model to learn whether the molecules affect the biological functions disrupted by the phenotype rather than directly targeting disease proteins or their regulators.

Filtering predictions

Using propagated profiles or the entity embeddings, we find the top 100 food molecules closest to the phenotype. These molecules affect similar proteins and biological functions as those responsible for radioresistance. Experimental evidence indicates that the phenotype-associated genes are over-expressed in patients exhibiting RT resistance leading to a positive modulation of lipid metabolism. Therefore, we use domain knowledge and literature search to filter out identified molecules with positive regulatory effects on lipid metabolism, leaving 33 modulators to retrieve the list of ingredients (Appendix A). Modulators belong to a myriad of compound classes including flavonoids, isoflavonoids, and bezenoids, in alignment with the current knowledge on chemotherapeutic bioactive molecules within foods¹¹. Overall, predicted modulators are involved in cell signaling, cell growth and lipid metabolism. For example, Genistein works by inhibiting the Arachidonic Acid pathway, making it a suitable natural agent for cancer prevention and therapy¹¹.

Recipe optimisation module

Having found radioresponse modulators in the previous step, we propose to provide patients with recipes that maximize the number of ingredients with these molecules (Fig. 1C). Associations between foods and the molecules they contain are taken from FoodDB³⁵, and a baseline set of recipes from the Recipe1M dataset³⁶. Ingredients from these two datasets were preprocessed (turned to lowercase, spaces and plurals removed) and matched if they shared the first or last two words, or if they had the same word in the first or in the last position. This meant that ingredients such as king oyster mushroom and dried porcini mushroom were treated as being the same ingredient.

After combining these datasets, an enrichment score is calculated for each recipe based on the number of radioresponse modulators that they contain. Additionally, ingredient context embeddings from the BERT model³⁷ are used to optimize the recipes and provide recommended ingredient substitutions to patients. These substitutions are done to increase the amount of anti-RT-resistance molecules whilst also preserving the recipe’s culinary attributes. Ingredient substitutions for the Recipe1M dataset were then found using the same method outlined in³⁸. Starting with the bert-base-cased model in the Hugging Face library³⁹, the BERT vocabulary was extended to include all the ingredients in the dataset. The BERT model, with a hidden representation of dimension 768, was then trained on the cooking instructions for each recipe in the dataset. Given that BERT gives different embeddings for the same ingredient in different contexts, there ends up being approximately 285,000 embeddings for all ingredients. For all the embeddings of a single ingredient, the 200 nearest neighbors were found using KNN and a substitute score given to other ingredients based on how often it appeared in the 200 nearest neighbors for all the embeddings. Suggested substitutes were then found for an ingredient by finding ingredients which had a score of over 100 and which were greater than 1/10 of the highest score for that ingredient.

To visualize the embedding space, we averaged all the embeddings for the same ingredient in order get a single embedding of dimension 768 for each ingredient. A 2D projection of this space using Principal Component Analysis is shown for a few of the ingredients in Fig. 4A. The suggested ingredient substitutes for a particular ingredient were then filtered to only include ingredients that had a higher number of molecules with potential for RT modulation than the initial ingredient. Some examples of these substitutions are shown in Fig. 4B. The number of beneficial molecules for each ingredient was found using the FoodDB database and is shown in Appendix A. Recipes in the Recipe1M dataset were optimized by looping through the ingredients and randomly selecting a substitute within the filtered list of substitutes. Additionally, it was constrained such that the same substitute can not be made for different ingredients within the recipe and a substitute suggestion which is already in the recipe is not allowed. Some examples of a mutated recipe are shown in Fig. 4C.

Dietary recommendations

When recommending recipes to a patient, it is also important to take into account other factors such as allergies, food preferences and general nutritional guidance. The flexibility of our approach and scoring function makes this possible. We showcase this by further optimising our recipes to take into account allergies and food preferences. Additional input is given to the model in the form of a list of user allergies and a dictionary of user food preferences. The allergy list contains which of the 14 main food allergens the user has and the food preference dictionary has keys corresponding to ingredients and values being a score of 1–5 indicating the patient’s like of the food (1 indicating a strong dislike and 5 a strong like). In order to take into account this information, we create a database containing all the unique ingredients and whether they satisfy each of the 14 allergies. Given an allergy list input, we loop through all recipes and make an ingredient substitution for all ingredients where the patient is allergic. If there doesn’t exist a substitution or all substituted ingredients also cause allergies then the recipe is removed. We then optimise these new recipes as before to take into account both the number of radioresponse modulators and also the patient’s food preferences. This is done by making ingredient substitutions in a recipe if either the patient prefers the new ingredient or if there is an increase in the number of radioresponse modulators whilst also enforcing that there is not a reduction in the other.

Data availability

All data used in the paper is publicly available. Genome data can be collected from STRING⁴⁰ (https://string-db.org), UniProt⁴¹ (https://www.uniprot.org), COSMIC⁴² (https://cancer.sanger.ac.uk/cosmic), and NCBI Gene⁴³ (https://www.ncbi.nlm.nih.gov/gene/). Drug data can be extracted from DrugBank⁴⁴ (https://www.drugbank.ca), DrugCentral⁴⁵ (http://drugcentral.org), and STITCH⁴⁶ (http://stitch.embl.de). Food data can be extracted from FooDB⁴⁷ (https://foodb.ca) and STITCH⁴⁶ (http://stitch.embl.de). The recipes can be obtained from Recipe1M³⁶ (http://pic2recipe.csail.mit.edu/) and the Multiscale Interactome data and analysis from (github.com/snap-stanford/multiscale-interactome)²³.

References

Heald, R., Husband, E. & Ryall, R. The mesorectum in rectal cancer surgery-the clue to pelvic recurrence?. Br. J. Surg. 69, 613–616. https://doi.org/10.1002/BJS.1800691019 (1982).
Article CAS PubMed Google Scholar
Kreis, M. E. et al. Use of preoperative magnetic resonance imaging to select patients with rectal cancer for neoadjuvant chemoradiation-interim analysis of the German OCUM Trial (NCT01325649). J. Gastrointest. Surg. 20, 25–33. https://doi.org/10.1007/S11605-015-3011-0 (2015).
Article PubMed Google Scholar
Sebag-Montefiore, D. et al. Preoperative radiotherapy versus selective postoperative chemoradiotherapy in patients with rectal cancer (MRC CR07 and NCIC-CTG C016): A multicentre, randomised trial. The Lancet 373, 811–820. https://doi.org/10.1016/S0140-6736(09)60484-0 (2009).
Article Google Scholar
Erlandsson, J. et al. Optimal fractionation of preoperative radiotherapy and timing to surgery for rectal cancer (Stockholm III): A multicentre, randomised, non-blinded, phase 3, non-inferiority trial. Lancet Oncol. 18, 336–346. https://doi.org/10.1016/S1470-2045(17)30086-4 (2017).
Article PubMed Google Scholar
Gijn, W. V. et al. Preoperative radiotherapy combined with total mesorectal excision for resectable rectal cancer: 12-year follow-up of the multicentre, randomised controlled TME trial. Lancet Oncol. 12, 575–582. https://doi.org/10.1016/S1470-2045(11)70097-3 (2011).
Article PubMed Google Scholar
Poynter, L. et al. Network mapping of molecular biomarkers influencing radiation response in rectal cancer. Clin. Colorectal Cancer 18, e210–e222. https://doi.org/10.1016/J.CLCC.2019.01.004 (2019).
Article PubMed Google Scholar
Buckley, A. M., Lynam-Lennon, N., O’Neill, H. & O’Sullivan, J. Targeting hallmarks of cancer to enhance radiosensitivity in gastrointestinal cancers. Nat. Rev. Gastroenterol. Hepatol. 17, 298–313. https://doi.org/10.1038/s41575-019-0247-2 (2020).
Article CAS PubMed Google Scholar
Gavrilas, L. I. et al. Plant-derived bioactive compounds in colorectal cancer: Insights from combined regimens with conventional chemotherapy to overcome drug-resistance. Biomedicines 10, 85 (2022).
Article Google Scholar
Mahmod, A. I., Haif, S. K., Kamal, A., Al-Ataby, I. A. & Talib, W. H. Chemoprevention effect of the Mediterranean diet on colorectal cancer: Current studies and future prospects. Front. Nutr. 9, 924192 (2022).
Article PubMed PubMed Central Google Scholar
Stelzer, G. et al. The GeneCards suite: From gene data mining to disease genome sequence analyses. Curr. Protoc. Bioinform. 54, 1–1. https://doi.org/10.1002/CPBI.5 (2016).
Article Google Scholar
Yarla, N. S. et al. Targeting arachidonic acid pathway by natural products for cancer prevention and therapy. Semin. Cancer Biol. 40–41, 48–81. https://doi.org/10.1016/J.SEMCANCER.2016.02.001 (2016).
Article PubMed Google Scholar
Hama, K. et al. Very long-chain fatty acids are accumulated in triacylglycerol and nonesterified forms in colorectal cancer tissues. Sci. Rep. 11, 1–10. https://doi.org/10.1038/s41598-021-85603-w (2021).
Article CAS Google Scholar
Tang, X. & Brindley, D. N. Lipid phosphate phosphatases and cancer. Biomolecules 10, 1–24. https://doi.org/10.3390/BIOM10091263 (2020).
Article Google Scholar
Chang, W., Fa, H., Xiao, D. & Wang, J. Targeting phosphatidylserine for cancer therapy: Prospects and challenges. Theranostics 10, 9214. https://doi.org/10.7150/THNO.45125 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, C.-Y., Chen, J., He, L. & Stiles, B. L. PTEN: Tumor suppressor and metabolic regulator. Front. Endocrinol. 0, 338. https://doi.org/10.3389/FENDO.2018.00338 (2018).
Article Google Scholar
Kim, Y. S., Young, M. R., Bobe, G., Colburn, N. H. & Milner, J. A. Bioactive food components, inflammatory targets, and cancer prevention. Cancer Prev. Res. 2, 200–208. https://doi.org/10.1158/1940-6207.CAPR-08-0141 (2009).
Article CAS Google Scholar
Pan, M.-H., Lai, C.-S., Dushenkov, S. & Ho, C.-T. Modulation of inflammatory genes by natural dietary bioactive compounds. J. Agric. Food Chem. 57, 4467–4477. https://doi.org/10.1021/JF900612N (2009).
Article CAS PubMed Google Scholar
Samadi, A. K. et al. A multi-targeted approach to suppress tumor-promoting inflammation. Semin. Cancer Biol. 35, S151–S184. https://doi.org/10.1016/J.SEMCANCER.2015.03.006 (2015).
Article PubMed Google Scholar
Nencioni, A., Caffa, I., Cortellino, S. & Longo, V. D. Fasting and cancer: Molecular mechanisms and clinical application. Nat. Rev. Cancer 18, 707–719 (2018).
Article CAS PubMed PubMed Central Google Scholar
Veselkov, K. et al. HyperFoods: Machine intelligent mapping of cancer-beating molecules in foods. Sci. Rep. 9, 9237. https://doi.org/10.1038/s41598-019-45349-y (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Gonzalez, G., Gong, S., Laponogov, I., Bronstein, M. & Veselkov, K. Predicting anticancer hyperfoods with graph convolutional networks. Hum. Genom. 15, 33. https://doi.org/10.1186/s40246-021-00333-4 (2021).
Article Google Scholar
Laponogov, I. et al. Network machine learning maps phytochemically rich Hyperfoods to fight COVID-19. Hum. Genom. 15, 1. https://doi.org/10.1186/s40246-020-00297-x (2021).
Article CAS Google Scholar
Ruiz, C., Zitnik, M. & Leskovec, J. Identification of disease treatment mechanisms through the multiscale interactome. Nat. Commun. 12, 1–15. https://doi.org/10.1038/s41467-021-21770-8 (2021).
Article CAS Google Scholar
Pei, J., Velu, P., Zareian, M., Feng, Z. & Vijayalakshmi, A. Effects of syringic acid on apoptosis, inflammation, and akt/mtor signaling pathway in gastric cancer cells. Front. Nutr. 8, 1109. https://doi.org/10.3389/fnut.2021.788929 (2021).
Article CAS Google Scholar
Riahi-Chebbi, I. et al. The Phenolic compound Kaempferol overcomes 5-fluorouracil resistance in human resistant LS174 colon cancer cells. Sci. Rep. 9, 195. https://doi.org/10.1038/s41598-018-36808-z (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Wenzel, U., Nickel, A. & Daniel, H. alpha-lipoic acid induces apoptosis in human colon cancer cells by increasing mitochondrial respiration with a concomitant o2-*-generation. Apoptosis Int. J. Program. Cell Death 10, 359–68. https://doi.org/10.1007/s10495-005-0810-x (2005).
Article CAS Google Scholar
Afshin, A. et al. Health effects of dietary risks in 195 countries, 1990–2017: A systematic analysis for the Global Burden of Disease Study 2017. The Lancet 393, 1958–1972. https://doi.org/10.1016/S0140-6736(19)30041-8 (2019).
Article Google Scholar
Gonzalez, G., Gong, S., Laponogov, I., Bronstein, M. & Veselkov, K. Predicting anticancer hyperfoods with graph convolutional networks. Hum. Genom. 15, 741. https://doi.org/10.1186/s40246-021-00333-4 (2021).
Article Google Scholar
Mittelman, S. D. The role of diet in cancer prevention and chemotherapy efficacy. Annu. Rev. Nutr. 40, 273–297 (2020).
Article CAS PubMed PubMed Central Google Scholar
Laponogov, I. et al. Network machine learning maps phytochemically rich hyperfoods to fight COVID-19. Hum. Genom. 15, 741. https://doi.org/10.1186/s40246-020-00297-x (2021).
Article CAS Google Scholar
Cory, H., Passarelli, S., Szeto, J., Tamez, M. & Mattei, J. The role of polyphenols in human health and food systems: A mini-review. Front. Nutr. 5, 753. https://doi.org/10.3389/fnut.2018.00087 (2018).
Article CAS Google Scholar
The Gene Ontology Consortium. The gene Ontology resource: 20 years and still GOing strong. Nucleic Acids Res.47, D330–D338. https://doi.org/10.1093/NAR/GKY1055 (2019).
Wishart, D. S. et al. DrugBank 5.0: A major update to the DrugBank database for 2018. Nucleic Acids Res.46, D1074–D1082. https://doi.org/10.1093/nar/gkx1037 (2018).
Kuhn, M., von Mering, C., Campillos, M., Jensen, L. J. & Bork, P. STITCH: Interaction networks of chemicals and proteins. Nucleic Acids Res. 36, 684–8. https://doi.org/10.1093/nar/gkm795 (2008).
Article CAS Google Scholar
Harrington, R. A., Adhikari, V., Rayner, M. & Scarborough, P. Nutrient composition databases in the age of big data: FoodDB, a comprehensive, real-time database infrastructure. BMJ Open 9, 1–10. https://doi.org/10.1136/bmjopen-2018-026652 (2019).
Article CAS Google Scholar
Marin, J. et al. Recipe1M+: A dataset for learning cross-modal embeddings for cooking recipes and food images. IEEE Trans. Pattern Anal. Mach. Intell. 43, 187–203. https://doi.org/10.1109/TPAMI.2019.2927476 (2021).
Article Google Scholar
Devlin, J., Chang, M. W., Lee, K. & Toutanova, K. BERT: Pre-training of deep bidirectional transformers for language understanding. In NAACL HLT 2019– 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies—Proceedings of the Conference, vol. 1 4171–4186. https://doi.org/10.18653/v1/N19-1423 (2019).
Pellegrini, C., Özsoy, E., Wintergerst, M. & Groh, G. Exploiting food embeddings for ingredient substitution. In Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies. https://doi.org/10.5220/0010202000670077 (Science and Technology Publications, 2021).
Wolf, T. et al. HuggingFace’s Transformers: State-of-the-art Natural Language Processing. CoRRabs/1910.0, https://doi.org/10.18653/v1/2020.emnlp-demos.6 (2019).
Szklarczyk, D. et al. The STRING database in 2021: Customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 49, D605–D612. https://doi.org/10.1093/nar/gkab835 (2021).
Article CAS PubMed Google Scholar
The UniProt Consortium. UniProt: The universal protein knowledgebase. Nucleic Acids Res.45, D158–D169. https://doi.org/10.1093/nar/gkw1099 (2016).
Tate, J. G. et al. COSMIC: The catalogue of somatic mutations in cancer. Nucleic Acids Res. 47, D941–D947. https://doi.org/10.1093/nar/gky1015 (2019).
Article CAS PubMed Google Scholar
Brown, G. R. et al. Gene: A gene-centered information resource at NCBI. Nucleic Acids Res. 43, D36–D42. https://doi.org/10.1093/nar/gku1055 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Wishart, D. S. et al. DrugBank 5.0: A major update to the DrugBank database for 2018. Nucleic Acids Res. 46, D1074–D1082. https://doi.org/10.1093/nar/gkx1037 (2017).
Article CAS PubMed Central Google Scholar
Ursu, O. et al. DrugCentral: Online drug compendium. Nucleic Acids Res. 45, D932–D939. https://doi.org/10.1093/nar/gkw993 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kuhn, M., von Mering, C., Campillos, M., Jensen, L. J. & Bork, P. STITCH: Interaction networks of chemicals and proteins. Nucleic Acids Res. 36, D684–D688. https://doi.org/10.1093/nar/gkm795 (2007).
Article CAS PubMed PubMed Central Google Scholar
Wishart Research Group. FooDB. http://foodb.ca (2022).

Download references

Acknowledgements

J.S. is supported by the UKRI CDT in AI for Healthcare http://ai4health.io (Grant No. P/S023283/1) and by the Vodafone Foundation through the DreamLab/DRUGS and CORONA-AI projects. K.V, D.V, I.L., and G.G were supported by the Vodafone Foundation as part of the ongoing DreamLab/DRUGS and CORONA-AI projects and the ERC Proof of Concept Grant No. 899932 (Hyperfoods). M.B, D.V., and G.G were supported by the ERC-Consolidator Grant No. 724228 (LEMAN). The work was additionally funded by the UK Research and Innovation (Grant No. 10058099), and the European Union (Grant No. 101095359), as part of the AIDA project.

Author information

These authors contributed equally: Joshua Southern and Guadalupe Gonzalez.

Authors and Affiliations

Department of Computing, Imperial College London, London, SW7 2BX, UK
Joshua Southern, Guadalupe Gonzalez & Dennis Veselkov
Prescient Design, Genentech, Basel, 4052, Switzerland
Guadalupe Gonzalez & Kirill Veselkov
North Middlesex University Hospital, London, N18 1QX, UK
Pia Borgas
Department of Surgery and Cancer, Imperial College London, London, SW7 2BX, UK
Liam Poynter, Ivan Laponogov & Yoyo Zhong
Royal Free Hospital, London, NW3 2QG, UK
Reza Mirnezami
Department of Computer Science, University of Oxford, Oxford, OX1 3QD, UK
Michael Bronstein
Department of Environmental Health Sciences, Yale University, New Haven, CT, 06510, USA
Kirill Veselkov

Authors

Joshua Southern
View author publications
You can also search for this author in PubMed Google Scholar
Guadalupe Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Pia Borgas
View author publications
You can also search for this author in PubMed Google Scholar
Liam Poynter
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Laponogov
View author publications
You can also search for this author in PubMed Google Scholar
Yoyo Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Reza Mirnezami
View author publications
You can also search for this author in PubMed Google Scholar
Dennis Veselkov
View author publications
You can also search for this author in PubMed Google Scholar
Michael Bronstein
View author publications
You can also search for this author in PubMed Google Scholar
Kirill Veselkov
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.V. and M.B. designed the concept and supervised the study. J.S and G.G. developed the methodology, implemented the computational workflow. J.S, G.G., R.M., P.B., L.P., I.L, Y.Z. aggregated the data sets. L.P. did the gene expression studies and provided the list of DE genes; K.V., M.B., I.L., R.M., D.V. designed research and helped with idea creation, Y.Z. benchmarked the models, P.B. helped with filtering radioresponse modulators. All authors contributed to writing the manuscript and results interpretation. The authors read and approved the final manuscript.

Corresponding author

Correspondence to Kirill Veselkov.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Southern, J., Gonzalez, G., Borgas, P. et al. Genomic-driven nutritional interventions for radiotherapy-resistant rectal cancer patient. Sci Rep 13, 14862 (2023). https://doi.org/10.1038/s41598-023-41833-8

Download citation

Received: 07 December 2022
Accepted: 31 August 2023
Published: 08 September 2023
DOI: https://doi.org/10.1038/s41598-023-41833-8

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.