Abstract
Complex traits show clear patterns of tissue-specific expression influenced by single nucleotide polymorphisms (SNPs), yet current strategies aggregate SNP effects to genes by employing simple physical proximity-based windows. Here, we examined whether incorporating SNPs with effects on tissue-specific cis-expression would improve our ability to detect trait-relevant tissues across 31 complex traits using stratified linkage disequilibrium score regression (S-LDSC). We found that a physical proximity annotation produced more significant tissue enrichments and larger S-LDSC regression coefficients, as compared to an expression-based annotation. Furthermore, we showed that our expression-based annotation did not outperform an annotation strategy in which an equal number of randomly chosen SNPs were annotated to genes within the same genomic window, suggesting extensive redundancy among SNP effect estimates due to linkage disequilibrium. That said, current sample sizes limit estimation of cis-genetic SNP effects; therefore, we recommend reexamination of the expression-based annotation when larger tissue-specific expression datasets become available. To examine the influence of sample size, we used a large whole blood eQTL reference panel (N = 31,684) applying a similar expression-based annotation strategy. We found that significant cis-expression QTLs in whole blood did not outperform the physical proximity annotation when estimating tissue-specific SNP heritability enrichment for either high- or low-density lipoprotein phenotypes but performed similarly for inflammatory bowel disease. Finally, we report new and updated tissue enrichment estimates across 31 complex traits, such as significant heritability enrichment of the frontal cortex for cognitive performance, educational attainment, and intelligence, providing further evidence of this structure’s importance in higher cognitive function.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
Data availability
GWAS summary statistics (Supplementary Table 2), FUSION expression weights (http://gusevlab.org/projects/fusion/), and S-LDSC specifically expressed genes (https://alkesgroup.broadinstitute.org/LDSCORE/) are available online. All data generated herein are reported in the Supplementary tables. ICD10 United Kingdom Biobank GWAS summary statistics are available upon request.
Code availability
All code is available upon request.
References
Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BWJH, et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet. 2016;48:245–52.
Finucane HK, Reshef YA, Anttila V, Slowikowski K, Gusev A, Byrnes A, et al. Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types. Nat Genet. 2018;50:621–9. https://doi.org/10.1038/s41588-018-0081-4.
Ongen H, Brown AA, Delaneau O, Panousis NI, Nica AC, Dermitzakis ET. Estimating the causal tissues for complex traits and diseases. Nat Genet. 2017;49:1676–83. https://doi.org/10.1038/ng.3981.
Gusev A, Mancuso N, Won H, Kousi M, Finucane HK, Reshef Y, et al. Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights. Nat Genet. 2018;50:538–48. https://doi.org/10.1038/s41588-018-0092-1.
Heinzen EL, Ge D, Cronin KD, Maia JM, Shianna KV, Gabriel WN, et al. Tissue-specific genetic control of splicing: implications for the study of complex traits. PLoS Biol. 2008;6:e1000001. https://doi.org/10.1371/journal.pbio.1000001.
Guo Y, Jamison DC. The distribution of SNPs in human gene regulatory regions. BMC Genomics. 2005;6:140. https://doi.org/10.1186/1471-2164-6-140.
Shen LX, Basilion JP, Stanton VP. Single-nucleotide polymorphisms can cause different structural folds of mRNA. Proc Natl Acad Sci USA. 1999;96:7871–6.
Beaudoin JD, Perreault JP. 5′-UTR G-quadruplex structures acting as translational repressors. Nucleic Acids Res. 2010;38:7022–36. https://doi.org/10.1093/nar/gkq557.
Xu H, Wang P, You J, Zheng Y, Fu Y, Tang Q, et al. Screening of Kozak-motif-located SNPs and analysis of their association with human diseases. Biochem Biophys Res Commun. 2010;392:89–94. https://doi.org/10.1016/j.bbrc.2010.01.002.
Robert F, Pelletier J. Exploring the impact of single-nucleotide polymorphisms on translation. Front Genet. 2018;9:507. https://doi.org/10.3389/fgene.2018.00507.
Calvo SE, Pagliarini DJ, Mootha VK. Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humans. Proc Natl Acad Sci USA. 2009;106:7507–12. https://doi.org/10.1073/pnas.0810916106.
Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017;8:1826. https://doi.org/10.1038/s41467-017-01261-5.
Barbeira AN, Dickinson SP, Bonazzola R, Zheng J, Wheeler HE, Torres JM, et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun. 2018;9:1825. https://doi.org/10.1038/s41467-018-03621-1.
Yao DW, O’Connor LJ, Price AL, Gusev A. Quantifying genetic effects on disease mediated by assayed gene expression levels. Nat Genet. 2020;52:626–33. https://doi.org/10.1038/s41588-020-0625-2.
Liu X, Li YI, Pritchard JK. Trans effects on gene expression can drive omnigenic inheritance. Cell. 2019;177:1022–34.e6. https://doi.org/10.1016/j.cell.2019.04.014.
Tashman KC, Cui R, O’Connor LJ, Neale BM, Finucane HK. Significance testing for small annotations in stratified LD-score regression. 2021. https://doi.org/10.1101/2021.03.13.21249938.
Watanabe K, Stringer S, Frei O, Mirkov MU, de Leeuw C, Polderman TJC, et al. A global overview of pleiotropy and genetic architecture in complex traits. Nat Genet. 2019;51:1339–48. https://doi.org/10.1038/s41588-019-0481-0.
Lonsdale J, Thomas J, Salvatore M, Phillips R, Lo E, Shad S, et al. The genotype-tissue expression (GTEx) project. Nat Genet. 2013;45:580–5.
Finucane HK, Bulik-Sullivan B, Gusev A, Trynka G, Reshef Y, Loh P, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet. 2015;47:1228–35.
Akbarian S, Liu C, Knowles JA, Vaccarino FM, Farnham PJ, Crawford GE, et al. The PsychENCODE project. Nat Neurosci. 2015;18:1707–12. https://doi.org/10.1038/nn.4156.
Brodie A, Azaria JR, Ofran Y. How far from the SNP may the causative genes be? Nucleic Acids Res. 2016;44:6046–54. https://doi.org/10.1093/nar/gkw500.
Võsa U, Claringbould A, Westra HJ, Bonder MJ, Deelen P, Zeng B, et al. Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression. Nat Genet. 2021;53:1300–10. https://doi.org/10.1038/s41588-021-00913-z.
Liu M, Jiang Y, Wedow R, Li Y, Brazel DM, Chen F, et al. Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nat Genet. 2019;51:237–44.
Walters RK, Polimanti R, Johnson EC, McClintick JN, Adams MJ, Adkins AE, et al. Transancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. Nat Neurosci. 2018;21:1656–69. https://doi.org/10.1038/s41593-018-0275-1.
Jansen IE, Savage JE, Watanabe K, Bryois J, Williams DM, Steinberg S, et al. Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nat Genet. 2019;51:404–13. https://doi.org/10.1038/s41588-018-0311-9.
Watson HJ, Yilmaz Z, Thornton LM, Hubel C, Coleman JRI, Gaspar HA, et al. Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa. Nat Genet. 2019;51:1207–14. https://doi.org/10.1038/s41588-019-0439-2.
Otowa T, Hek K, Lee M, Byrne EM, Nivard MG, Bigdeli T, et al. Meta-analysis of genome-wide association studies of anxiety disorders. Mol Psychiatry. 2016;21:1391–9. https://doi.org/10.1038/mp.2015.197.
Mullins N, Forstner AJ, O’Connell KS, Coombes B, Coleman JRI, Qiao Z, et al. Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nat Genet. 2021;53:817–29. https://doi.org/10.1038/s41588-021-00857-4.
Pulit SL, Stoneman C, Morris AP, Wood AR, Glastonbury CA, Tyrrell J, et al. Meta-analysis of genome-wide association studies for body fat distribution in 694 649 individuals of European ancestry. Hum Mol Genet. 2019;28:166–74. https://doi.org/10.1093/hmg/ddy327.
Jansen PR, Nagel M, Watanabe K, Wei Y, Savage JE, de Leeuw CA, et al. Genome-wide meta-analysis of brain volume identifies genomic loci and genes shared with intelligence. Nat Commun. 2020;11:5606. https://doi.org/10.1038/s41467-020-19378-5.
Johnson EC, Demontis D, Thorgeirsson TE, Walters RK, Polimanti R, Hatoum AS, et al. A large-scale genome-wide association study meta-analysis of cannabis use disorder. Lancet Psychiatry. 2020;7:1032–45. https://doi.org/10.1016/S2215-0366(20)30339-4.
Lee JJ, Wedow R, Okbay A, Kong E, Maghzian O, Zacher M, et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat Genet. 2018;50:1112–21. https://doi.org/10.1038/s41588-018-0147-3.
Jansen PR, Watanabe K, Stringer S, Skene N, Bryois J, Hammerschlag AR, et al. Genome-wide analysis of insomnia in 1,331,010 individuals identifies new risk loci and functional pathways. Nat Genet. 2019;51:394–403. https://doi.org/10.1038/s41588-018-0333-3.
Okbay A, Baselmans BML, De Neve JE, Turley P, Nivard MG, Fontana MA, et al. Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nat Genet. 2016;48:624–33. https://doi.org/10.1038/ng.3552.
Yengo L, Sidorenko J, Kemper KE, Zheng Z, Wood AR, Weedon MN, et al. Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry. Hum Mol Genet. 2018;27:3641–9. https://doi.org/10.1093/hmg/ddy271.
Savage JE, Jansen PR, Stringer S, Watanabe K, Bryois J, de Leeuw CA, et al. Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence. Nat Genet. 2018;50:912–9. https://doi.org/10.1038/s41588-018-0152-6.
Wray NR, Ripke S, Mattheisen M, Trzaskowski M, Byrne EM, Abdellaoui A, et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nat Genet. 2018;50:668–81. https://doi.org/10.1038/s41588-018-0090-3.
International Obsessive Compulsive Disorder Foundation Genetics Collaborative (IOCDF-GC) and OCD Collaborative Genetics Association Studies (OCGAS). Revealing the complex genetic architecture of obsessive-compulsive disorder using meta-analysis. Mol Psychiatry. 2018;23:1181–8. https://doi.org/10.1038/mp.2017.154.
Nievergelt CM, Maihofer AX, Klengel T, Atkinson EG, Chen C, Choi KW, et al. International meta-analysis of PTSD genome-wide association studies identifies sex- and ancestry-specific genetic risk loci. Nat Commun. 2019;10:4558. https://doi.org/10.1038/s41467-019-12576-w.
Yu D, Sul JH, Tsetsos F, Nawaz MS, Huang AY, Zelaya I, et al. Interrogating the genetic determinants of Tourette syndrome and other tic disorders through genome-wide association studies. Am J Psychiatry. 2019;176:217–27. https://doi.org/10.1176/appi.ajp.2018.18070857.
Ripke S, Neale BM, Corvin A, Walters JTR, Farh K, Holmans PA, et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature. 2014;511:421–7. https://doi.org/10.1038/nature13595.
Autism Spectrum Disorders Working Group of The Psychiatric Genomics Consortium. Meta-analysis of GWAS of over 16,000 individuals with autism spectrum disorder highlights a novel locus at 10q24.32 and a significant overlap with schizophrenia. Mol Autism. 2017;8:21. https://doi.org/10.1186/s13229-017-0137-9.
Neale Lab. UK Biobank—Neale Lab. 2018. http://www.nealelab.is/uk-biobank/.
Jostins L, Ripke S, Weersma RK, Duerr RH, McGovern DP, Hui KY, et al. Host–microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature. 2012;491:119–24. https://doi.org/10.1038/nature11582.
The International HapMap 3 Consortium. Integrating common and rare genetic variation in diverse human populations. Nature. 2010;467:52–58. https://doi.org/10.1038/nature09298.
Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011;88:76–82. https://doi.org/10.1016/j.ajhg.2010.11.011.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75. https://doi.org/10.1086/519795.
Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12:e1001779. https://doi.org/10.1371/journal.pmed.1001779.
Pardiñas AF, Holmans P, Pocklington AJ, Escott-Price V, Ripke S, Carrera N, et al. Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection. Nat Genet. 2018;50:381–9. https://doi.org/10.1038/s41588-018-0059-2.
Badre D, Nee DE. Frontal cortex and the hierarchical control of behavior. Trends Cogn Sci. 2018;22:170–88. https://doi.org/10.1016/j.tics.2017.11.005.
Acknowledgements
We thank the participants and researchers of the original studies whose summary statistics and expression weights were utilized in this work. We also thank the authors of FUSION for generating the expression weights, as well as the Finucane lab for the specifically expressed gene sets. This work utilized the Summit supercomputer, which is supported by the National Science Foundation (awards ACI-1532235 and ACI-1532236), the University of Colorado Boulder, and Colorado State University. The Summit supercomputer is a joint effort of the University of Colorado Boulder and Colorado State University.
Funding
This work was supported by the National Institutes of Health (grant numbers T32 DA017637 to John K. Hewitt, R01 AG046938-06 to Chandra A. Reynolds, R01 DA044283-01 to Scott I. Vrieze, and R01 MH100141-06 to Matthew C. Keller) and the University of Colorado Boulder Institute for Behavioral Genetics.
Author information
Authors and Affiliations
Contributions
TJM and LME contributed to study conception, design, manuscript preparation, and approval of the final version of the manuscript. TJM conducted analyses and curated datasets.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
This research was approved by the CU IRB under protocol 18-0091.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Mize, T.J., Evans, L.M. Examination of a novel expression-based gene-SNP annotation strategy to identify tissue-specific contributions to heritability in multiple traits. Eur J Hum Genet 32, 263–269 (2024). https://doi.org/10.1038/s41431-022-01244-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41431-022-01244-1
This article is cited by
-
Solving medical mysteries with genomics
European Journal of Human Genetics (2024)