Abstract
Biological validation of preliminary findings is a key prerequisite in biomarker discovery. In recent years, the development of advanced large-scale sequencing technologies combined with high-throughput computational analysis methods led to the extraction of considerable amount of data from healthy and diseased tissues. Stored in large open-access repositories, these data can be accessed and interrogated by researchers aiming at understanding the biological rationale behind their results. These so called in silico analyses, in opposite to in vitro analyses, have gained increasing importance in recent years, becoming a major component of research projects and publications. However, making sense of the large amount of data available can be challenging and may lead to a misinterpretation of the data. To reduce the dimensionality of this data, recent years have seen the development of statistical m\ethods and advanced graph analytics which help researchers summarize the available data and draw appropriate conclusions. In this chapter we will describe three in silico methods to investigate the biological underpinnings of a panel of seven blood-based biomarkers of Alzheimer’s disease.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Califf RM (2018) Biomarker definitions and their applications. Exp Biol Med (Maywood) 243:213–221. https://doi.org/10.1177/1535370217750088
Ioannidis JPA, Bossuyt PMM (2017) Waste, leaks, and failures in the biomarker pipeline. Clin Chem 63:963–972. https://doi.org/10.1373/clinchem.2016.254649
GTEx Consortium (2013) The genotype-tissue expression (GTEx) project. Nat Genet 45:580–585. https://doi.org/10.1038/ng.2653
Weinstein JN, Collisson EA, Mills GB et al (2013) The cancer genome atlas pan-cancer analysis project. Nat Genet 45:1113–1120. https://doi.org/10.1038/ng.2764
Barrett T, Troup DB, Wilhite SE et al (2011) NCBI GEO: archive for functional genomics data sets – 10 years on. Nucleic Acids Res 39:D1005–D1010. https://doi.org/10.1093/nar/gkq1184
Harris MA, Clark J, Ireland A et al (2004) The gene ontology (GO) database and informatics resource. Nucleic Acids Res 32:D258–D261. https://doi.org/10.1093/nar/gkh036
Jassal B, Matthews L, Viteri G et al (2020) The reactome pathway knowledgebase. Nucleic Acids Res 48:D498–D503. https://doi.org/10.1093/nar/gkz1031
Kanehisa M, Goto S (2000) KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28:27–30. https://doi.org/10.1093/nar/28.1.27
Wijesooriya K, Jadaan SA, Perera KL et al (2022) Urgent need for consistent standards in functional enrichment analysis. PLoS Comput Biol 18:e1009935. https://doi.org/10.1371/journal.pcbi.1009935
Liao Y, Wang J, Jaehnig EJ et al (2019) WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res 47:W199–W205. https://doi.org/10.1093/nar/gkz401
Raudvere U, Kolberg L, Kuzmin I et al (2019) G:profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res 47:W191–W198. https://doi.org/10.1093/nar/gkz369
Sherman BT, Hao M, Qiu J et al (2022) DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic Acids Res 50:W216–W221. https://doi.org/10.1093/nar/gkac194
Chen J, Bardes EE, Aronow BJ, Jegga AG (2009) ToppGene suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res 37:W305–W311. https://doi.org/10.1093/nar/gkp427
Eden E, Navon R, Steinfeld I et al (2009) GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists. BMC Bioinformatics 10:48. https://doi.org/10.1186/1471-2105-10-48
Mi H, Ebert D, Muruganujan A et al (2021) PANTHER version 16: a revised family classification, tree-based classification tool, enhancer regions and extensive API. Nucleic Acids Res 49:D394–D403. https://doi.org/10.1093/nar/gkaa1106
Chen EY, Tan CM, Kou Y et al (2013) Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinformatics 14:128. https://doi.org/10.1186/1471-2105-14-128
Merico D, Isserlin R, Stueker O et al (2010) Enrichment map: a network-based method for gene-set enrichment visualization and interpretation. PLoS One 5:e13984. https://doi.org/10.1371/journal.pone.0013984
Maere S, Heymans K, Kuiper M (2005) BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics 21:3448–3449. https://doi.org/10.1093/bioinformatics/bti551
Fröhlich H, Speer N, Poustka A, Beißbarth T (2007) GOSim – an R-package for computation of information theoretic GO similarities between terms and gene products. BMC Bioinformatics 8:166. https://doi.org/10.1186/1471-2105-8-166
Shannon P, Markiel A, Ozier O et al (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13:2498–2504. https://doi.org/10.1101/gr.1239303
Hardy-Sosa A, León-Arcia K, Llibre-Guerra JJ et al (2022) Diagnostic accuracy of blood-based biomarker panels: a systematic review. Front Aging Neurosci 14
Stelzer G, Rosen N, Plaschkes I et al (2016) The GeneCards suite: from gene data mining to disease genome sequence analyses. Curr Protoc Bioinformatics 54:1.30.1–1.30.33. https://doi.org/10.1002/cpbi.5
Thul PJ, Lindskog C (2018) The human protein atlas: a spatial map of the human proteome. Protein Sci 27:233–244. https://doi.org/10.1002/pro.3307
Mowinckel AM, Vidal-Piñeiro D (2020) Visualization of brain statistics with R packages ggseg and ggseg3d. Adv Methods Pract Psychol Sci 3:466–483. https://doi.org/10.1177/2515245920928009
Kolberg L, Raudvere U, Kuzmin I et al (2020) gprofiler2 – an R package for gene list functional enrichment analysis and namespace conversion toolset g:profiler. F1000Research 9:709
Slenter DN, Kutmon M, Hanspers K et al (2018) WikiPathways: a multifaceted pathway database bridging metabolomics to other omics research. Nucleic Acids Res 46:D661–D667. https://doi.org/10.1093/nar/gkx1064
Kucera M, Isserlin R, Arkhangorodsky A, Bader GD (2016) AutoAnnotate: a Cytoscape app for summarizing networks with semantic annotations. F1000Res 5:1717. https://doi.org/10.12688/f1000research.9090.1
Subramanian A, Tamayo P, Mootha VK et al (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102:15545–15550. https://doi.org/10.1073/pnas.0506580102
Gable AL, Szklarczyk D, Lyon D et al (2022) Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments. Brief Bioinform 23:bbac355. https://doi.org/10.1093/bib/bbac355
Mooney MA, Wilmot B (2015) Gene set analysis: a step-by-step guide. Am J Med Genet B Neuropsychiatr Genet 168:517–527. https://doi.org/10.1002/ajmg.b.32328
Da Wei Huang, Brad T. Sherman, Richard A. Lempicki (2009) Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 37(1):1–13, https://doi.org/10.1093/nar/gkn923
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Sadlon, A. (2024). In Silico Models to Validate Novel Blood-Based Biomarkers. In: Perneczky, R. (eds) Biomarkers for Alzheimer’s Disease Drug Development. Methods in Molecular Biology, vol 2785. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-3774-6_20
Download citation
DOI: https://doi.org/10.1007/978-1-0716-3774-6_20
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-3773-9
Online ISBN: 978-1-0716-3774-6
eBook Packages: Springer Protocols