Next Article in Journal
Single-Setting 3D MRI/US-Guided Frozen Sectioning and Cryoablation of the Index Lesion: Mid-Term Oncologic and Functional Outcomes from a Pilot Study
Previous Article in Journal
Shock Waves and Therapeutic Exercise in Greater Trochanteric Pain Syndrome: A Prospective Randomized Clinical Trial with Cross-Over
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Development and Replication of a Genome-Wide Polygenic Risk Score for Chronic Back Pain

by
Yakov A. Tsepilov
1,2,*,
Elizaveta E. Elgaeva
1,3,
Arina V. Nostaeva
3,
Roger Compte
4,
Ivan A. Kuznetsov
5,
Lennart C. Karssen
6,
Maxim B. Freidin
7,
Pradeep Suri
8,9,10,11,
Frances M. K. Williams
4 and
Yurii S. Aulchenko
1,6
1
Laboratory of Recombination and Segregation Analysis, Institute of Cytology and Genetics, Novosibirsk 630090, Russia
2
Kurchatov Genomics Center, Institute of Cytology and Genetics, Novosibirsk 630090, Russia
3
Department of Natural Sciences, Novosibirsk State University, Novosibirsk 630090, Russia
4
Department of Twin Research and Genetic Epidemiology, School of Life Course Sciences, King’s College London, London SE1 7EH, UK
5
Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow 121205, Russia
6
PolyOmica, 5237 PA s-Hertogenbosch, The Netherlands
7
Department of Biology, School of Biological and Behavioural Sciences, Queen Mary University of London, London E1 4NS, UK
8
Division of Rehabilitation Care Services, VA Puget Sound Health Care System, Seattle, WA 98208, USA
9
Seattle Epidemiologic Research and Information Center, VA Puget Sound Health Care System, Seattle, WA 98208, USA
10
Department of Rehabilitation Medicine, University of Washington, Seattle, WA 98208, USA
11
Clinical Learning, Evidence, and Research (CLEAR) Center, University of Washington, Seattle, WA 98208, USA
*
Author to whom correspondence should be addressed.
J. Pers. Med. 2023, 13(6), 977; https://doi.org/10.3390/jpm13060977
Submission received: 18 April 2023 / Revised: 25 May 2023 / Accepted: 5 June 2023 / Published: 10 June 2023

Abstract

:
Chronic back pain (CBP) is a complex heritable trait and a major cause of disability worldwide. We developed and validated a genome-wide polygenic risk score (PRS) for CBP using a large-scale GWAS based on UK Biobank participants of European ancestry (N = 265,000). The PRS showed poor overall predictive ability (AUC = 0.56 and OR = 1.24 per SD, 95% CI: 1.22–1.26), but individuals from the 99th percentile of PRS distribution had a nearly two-fold increased risk of CBP (OR = 1.82, 95% CI: 1.60–2.06). We validated the PRS on an independent TwinsUK sample, obtaining a similar magnitude of effect. The PRS was significantly associated with various ICD-10 and OPCS-4 diagnostic codes, including chronic ischemic heart disease (OR = 1.1, p-value = 4.8 × 10−15), obesity, metabolism-related traits, spine disorders, disc degeneration, and arthritis-related disorders. PRS and environment interaction analysis with twelve known CBP risk factors revealed no significant results, suggesting that the magnitude of G × E interactions with studied factors is small. The limited predictive ability of the PRS that we developed is likely explained by the complexity, heterogeneity, and polygenicity of CBP, for which sample sizes of a few hundred thousand are insufficient to estimate small genetic effects robustly.

1. Introduction

Back pain is a major cause of disability worldwide. According to the Global Burden of Disease Study 2015, low back pain together with neck pain were the leading causes of years lived with disability in 1990–2015 [1]. The most debilitating form of back pain is chronic back pain (CBP; back pain lasting for more than three months). The prevalence of CBP is as high as 30% depending on the population and CBP definition used [2]. It is known that CBP is a complex heritable trait with narrow-sense heritability in the range of 40–68% [3]. Genome-wide association studies (GWASs) using data from national-scale biobanks and electronic medical records have suggested a number of genes to be associated with CBP and similar conditions [4,5,6,7,8,9], with estimated SNP-based heritability up to 13% on the liability scale.
A polygenic risk score (PRS) is an estimate of genetic liability for a trait or disease, calculated according to an individual’s genotype profile and relevant GWASs. A PRS is a powerful modern tool for studying the genetic epidemiology of a disease with potential clinical utility, for example, for developing prognostic biomarkers or subphenotyping patients [10]. There are a few PRS models for general chronic pain [11,12]. These models have been shown to be statistically significantly associated with chronic pain and other traits, but they provide limited ability to predict back pain. Moreover, these models were constructed based on a small number of SNPs or using GWASs with modest sample sizes, and their epidemiological properties have not been sufficiently described.
In this work, we have developed a genome-wide PRS for CBP based on a recent large-scale CBP GWAS. We assessed the ability of the resulting PRS to predict CBP in an independent sample; examined associations between the PRS and diagnostic codes from the International Statistical Classification of Diseases and Related Health Problems, tenth revision (ICD-10), and Classification of Interventions and Procedures, the fourth edition (OPCS-4), in UK biobank in a hypothesis-free manner; and evaluated the interactions between the PRS and major epidemiologic and clinical risk factors for CBP.

2. Materials and Methods

2.1. Developing the PRS Using the UK Biobank

The general scheme of the model creation is presented in Figure 1.
To create and evaluate the CBP PRS model, we used the UK Biobank dataset [13]. Sociodemographic, physical, lifestyle, and health-related characteristics of this cohort have been reported elsewhere [14]. In brief, individuals enrolled in UK Biobank were aged 40–69 years; were less likely to be obese, to smoke, or to drink alcohol; and had fewer self-reported health conditions as compared to the general population. All study participants provided written informed consent, and the study was approved by the North West Multi-Centre for Research Ethics Committee (11/NW/0382).
Cases and controls of CBP were defined based on questionnaire responses. First, participants responded to “Pain type(s) experienced in the last months” followed by questions inquiring as to whether the specific pain had been present for more than three months. Those who reported back pain lasting more than three months were considered to be CBP cases. Participants who reported no pain or pain lasting for less than three months were categorized as controls, including those who reported pain lasting between 1 month and 3 months. Individuals who preferred not to answer were excluded from the study. We excluded from the analysis individuals who reported more than three months of pain all over the body, because this phenotype was negatively correlated with other pain phenotypes.
For training the model, we used a CBP GWAS from our previous study based on 265,000 randomly selected individuals of European ancestry (defined using SNP-based principal component analysis) from UK Biobank [4]. A GWAS was carried out using BOLT-LMM v.2.3.2 software [15]. Linear mixed-effect models were fitted to test for additive effects of the SNPs (genotype dosage) on pain phenotypes, adjusting for age, sex, genotyping platform batch, and the first ten principal components of the kinship matrix. Using this GWAS, we developed a set of six draft PRS models using the SBayesR method [16]. The optimal PRS model was chosen based on the maximum area under the receiver–operator curve (AUC) in the UK Biobank validation dataset, which included 30,000 individuals of European ancestry, not overlapping with the 265,000 individuals from the training dataset. Using the optimal PRS model, we calculated PRS values for the rest of the individuals from UK Biobank data using PLINK v 2.0 software [17].
For testing the model, we used a set of unrelated individuals (as defined by UK Biobank) of European ancestry (N = 120,217, with 21,543 cases and 98,674 controls) who were not included in the training and validation sets. Prior to testing, PRS values were standardized (transformed to have variance equal to 1 and zero mean).

2.2. Assessment of the CBP PRS Using TwinsUK Data

We then validated the CBP PRS in an independent sample from TwinsUK. TwinsUK is an adult twin registry, predominantly female, which includes over 14,000 participants. It is the most deeply phenotyped and genotyped twin cohort in the world [18]. We used a low back pain (LBP) phenotype as a proxy of CBP. The phenotype was derived from validated self-reported questionnaires [19]. From a subgroup of TwinsUK with available genetic data, we randomly sampled one individual from each twin pair, resulting in 2654 participants in the analysis. Random selection bias was accounted for by running ten permutations of the selection. We used a generalized linear model (GLM) to fit logistic regression using the R stats package (version 4.2.1). Stratified five-fold cross-validation was used to assess the performance of the model. Prior to testing, the PRS values were standardized.

2.3. Association of the CBP PRS with Other Health Conditions

We used the PRS of CBP to examine associations with other health conditions, defined by the ICD-10 diagnostic codes (International Statistical Classification of Diseases and Related Health Problems, tenth revision) and OPCS-4 (Classification of Interventions and Procedures, fourth edition) procedure codes, in the set of unrelated individuals of European ancestry from UK Biobank (N = 120,200). We used ICD-10 and OPCS-4 codes combined at the second level (see https://biobank.ctsu.ox.ac.uk/crystal/field.cgi?id=41202, accessed on 30 August 2022 and https://biobank.ndph.ox.ac.uk/ukb/field.cgi?id=41272, accessed on 30 August 2022 for more details on code levels). Codes from the ICD-10 chapters XVIII–XXII and from the OPCS-4 chapters X–Z were excluded. Next, we filtered codes with low prevalence (<0.5% and >99.5%). The resulting list of medical codes contained 165 ICD-10 and 132 OPCS-4 designations. Additionally, we analyzed the association of the PRS with six chronic pain phenotypes (pain in the back, neck/shoulders, hip, knee, stomach, or head, lasting for at least three months) defined by questionnaires, as described above. All traits were analyzed utilizing GLMs and linear regression included in the standard R function glm(). In general, models considered the trait (diagnostic code, procedure code, or chronic pain phenotype) as a dependent variable and the standardized PRS as a predictor with information on sex, age, genotyping batch, and the first 10 principal components (PCs) of the kinship matrix included as the covariates (Trait ~ Age + Sex + batch + PC1 + … + PC10 + PRS). We set the Bonferroni-corrected statistical significance threshold for the analysis at p-value < 1.65× 10−4 = 0.05/(165 + 132 + 6), where 165 corresponds to the number of ICD-10 codes analyzed, 132 reflects the number of OPCS-4 codes, and 6 represents the number of chronic pain phenotypes.

2.4. PRS according to Environment Interactions

We selected twelve known risk factors for CBP (Supplementary Table S1). Eleven risk factors were selected based on previous studies: age, sex, BMI, height, physical activity, income (as a proxy of lifestyle), education, alcohol consumption, smoking, depression, and sleep duration [7,20,21,22]. Sitting to standing height ratio was also analyzed, as we have shown that it is associated with CBP [23]. These risk factors were individually tested as covariates interacting with the PRS in a logistic regression model (glm() function). Sex, age, genotyping batch, and the first 10 PCs were included as covariates in all of the models. The significance threshold was set to p-value < 0.004 = 0.05/12.

3. Results

3.1. Developing the PRS Using the UK Biobank

The optimal PRS model for CBP included 1,090,574 HapMap3 well-imputed SNPs. In the out-of-sample test dataset from the UK Biobank, the mean PRS difference between cases and controls was 0.214 of the PRS standard deviation (two-sided t-test p-value < 10−16). Figure 2A displays the histograms of the PRS distributions for cases and controls. The estimated AUC for CBP was 0.56 (95% CI: 0.557–0.565, Figure 2B). The prevalence of CBP in the test dataset was 18%. The Pearson correlation between PRS and CBP status was 0.082. Using logistic regression, the effect of the PRS on CBP was estimated as 0.22 log odds per PRS SD (OR = 1.24, 95% CI: 1.22–1.26). We estimated the OR of CBP if the individual had PRS from the 80th, 90th, and 99th percentiles of PRS distribution. The ORs were 1.41 (95% CI: 1.36–1.46), 1.49 (95% CI: 1.43–1.56), and 1.82 (95% CI: 1.60–2.06), respectively.

3.2. Assessment of the CBP PRS Using TwinsUK Data

We evaluated the performance of the PRS in an independent TwinsUK sample. The prevalence of LBP in TwinsUK was 28%. The AUC of PRS was 0.55 (95% CI: 0.551–0.559). When we added sex and age into the model using the linear coefficients estimated in UK Biobank (0.216 × PRS + 0.06 × Sex + 0.005 × Age), the AUC increased to 0.58 (95% CI: 0.58–0.59). The effect of the standardized PRS on LBP was comparable with the effect of PRS on CBP in UK Biobank: OR = 1.21, 95% CI: 1.11–1.32.

3.3. Association of the CBP PRS with Other Health Conditions and PRS According to Environment Interactions

We evaluated the association of the CBP PRS with ICD-10 and OPCS-4 diagnostic codes (Supplementary Table S2). The PRS was significantly associated with 117 codes, with OR point estimates ranging from 1.05 (for S06 “Other excision of lesion of skin”) to 1.35 (for A52 “Therapeutic epidural injection”). Among the strongest associations based on p-values (Table 1), we observed chronic pain at five sites (neck/shoulder, hip, knee, stomach, and headache), M54 “Dorsalgia,” M47 “Spondylosis,” I10 “Essential (primary) hypertension,” and OPCS-4 G45 “ Diagnostic fibreoptic endoscopic examination of upper gastrointestinal tract”.
We evaluated the magnitude of PRS-by-environment interactions with twelve risk factors for CBP. The results are presented in Supplementary Table S3. No significant PRS-by-environment interactions were observed after accounting for multiple comparisons, but alcohol consumption and physical activity had nominally significant interaction terms with PRS (p-value < 0.05).

4. Discussion

Here, we have developed and replicated the first genome-wide polygenic risk score for CBP. The overall predictive ability of the PRS was modest (AUC = 0.56); however, it was highly statistically significant. Moreover, people from the 99th PRS percentile had almost double the risk of CBP (OR = 1.82, 95% CI: 1.60–2.06). Given the high prevalence of CBP, this translates to an absolute risk of 28%. The PRS had almost the same performance and effect estimates in an independent sample from the TwinsUK cohort, and a model also including age and sex (0.216 × PRS + 0.06 × Sex + 0.005 × Age) produced a slightly higher AUC in TwinsUK (0.58). The current work shows that this model is replicable, and suggests that it can be used further on other samples.
An analysis of the associations of the CBP PRS with ICD-10 and OPCS-4 codes showed that the PRS is associated with a wide variety of different diseases and conditions. As expected in line with Cheverud’s conjecture [24,25], the pattern of PRS associations resembles the pattern of genetic correlation of CBP with other diseases [4,9]. Moreover, the PRS for CBP was significantly associated with five other chronic pain syndromes with almost the same magnitude of the effect, in line with previous work, demonstrating that the genetic determination of chronic pain at these locations is shared to a large extent [4,26]. We can determine several groups of traits associated with PRS: coronary artery disease and its risk factors, obesity and metabolism-related traits, traits related to spine disorders and disc degeneration, traits related to diseases of the digestive system, neurological traits (e.g., anxiety disorders), and arthritis-related disorders. These traits are known as potential risk factors for CBP and could be further examined for causal relationships, for example, using Mendelian randomization (MR). Interestingly, we have recently demonstrated a causal relationship between systolic and diastolic blood pressure and CBP [IN PRESS] using MR, which is consistent with the observed association with hypertension seen in the current study. It should be noted that the prevalence of most of the ICD-10 and OPCS-4 codes in UK Biobank is lower than expected in the general population [14]. For example, M54 “Dorsalgia” has only 3% prevalence; however, the same code has 14% prevalence in another population-based biobank—FinnGen (https://r7.risteys.finngen.fi/phenocode/M13_DORSALGIA, accessed on 30 August 2022). Thus, the obtained OR could be underestimated.
The analysis of PRS-by-environment interaction revealed no statistically significant results. This analysis was designed to approximate the analysis of genotype according to environment interactions [10,27]. Nonetheless, we did observe a nominally significant interaction between the PRS and two risk factors (alcohol consumption and physical activity), although the absolute values of the interaction effect were notably small. These results align with our earlier observation that individual SNP × environment interactions are unlikely to have a substantial impact on CBP, and the expected magnitude of the SNP × environment interaction is expected to be negligible for the commonly studied risk factors [28].
The study has several limitations. Firstly, the model was developed using populations of European British descent and should not be generalized to non-European samples. Secondly, the observed squared Pearson correlation of the PRS and CBP was much smaller than the estimated SNP-based heritability on the observed scale for the same trait (0.007 and 0.04, respectively [4]). This suggests that the sample size of the GWAS used for the development of the model (N = 265,000) was not enough to obtain sufficient power. This means that there may be substantial potential for model improvement, and GWASs with a bigger sample size are needed for the development of a more powerful model. With increasing sample size, we could expect that the predictive ability of the model, as well as its potential for clinical utility, will increase.
To conclude, we developed and validated a genome-wide PRS for CBP based on a recent large-scale GWAS, revealing associations with various related disorders and traits, highlighting the challenges posed by the complex nature and polygenicity of CBP.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jpm13060977/s1, Supplementary Table S1. Description of twelve risk factors used for PRS by environment interaction analysis. Complete covariate names are those used by the UK Biobank; Supplementary Table S2. The results of PRS associations with ICD-10 and OPCS-4 codes; Supplementary Table S3. Results of PRS-by-environment interactions.

Author Contributions

Conceptualization, Y.A.T., Y.S.A., P.S. and F.M.K.W.; methodology, Y.A.T., L.C.K., Y.S.A. and A.V.N.; software, L.C.K. and A.V.N.; validation, R.C., M.B.F., E.E.E. and I.A.K.; formal analysis, Y.A.T., E.E.E., A.V.N., R.C. and I.A.K.; investigation, Y.A.T., E.E.E., R.C., M.B.F. and I.A.K.; resources, Y.A.T. and Y.S.A.; data curation, Y.A.T. and R.C.; writing—original draft preparation, Y.A.T.; writing—review and editing, Y.A.T., M.B.F., F.M.K.W., P.S. and Y.S.A.; visualization, Y.A.T.; supervision, Y.A.T. and Y.S.A.; funding acquisition, Y.A.T. and Y.S.A. All authors have read and agreed to the published version of the manuscript.

Funding

The work of R.C. has been supported by funding from Versus Arthritis (Grant Number 22467) and Marie Skłodowska Curie International Training Network (ITN) “disc4all” (H2020-MSCA-ITN-ETN-2020 GA: 9557). Tsepilov and Elgaeva were supported by the Russian Science Foundation (RSF) grant No. 22-15-20037 and Government of the Novosibirsk region. Suri is an employee of the VA Puget Sound Health Care System and is Director of the University of Washington Clinical Learning, Ev-idence and Research (CLEAR) Center Resource Core, which is funded by NIAMS/NIH P30AR072572. The contents of this work do not represent the views of the US Department of Veterans Affairs, the National Institutes of Health, or the US Government. The data analysis was performed using computational resources of the “Bioinformatics” Joint Computational Center supported by the budget project No. FWNR-2022-0020.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the North West Multi-Centre for Research Ethics Committee (11/NW/0382).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The PRS model (PLINK score file) is available on https://mga.icgbio.ru/PRS/Back_3000_genome.csv.gz, accessed on 30 August 2022.

Acknowledgments

The study was conducted using the UK Biobank Resource under approval #18219.

Conflicts of Interest

YSA and LCK are cofounders and co-owners of PolyOmica and PolyKnomics, private organizations providing services, research, and development in the field of computational and statistical genomics. The other authors declare that they have no competing interests.

References

  1. Vos, T.; Abajobir, A.A.; Abate, K.H.; Abbafati, C.; Abbas, K.M.; Abd-Allah, F.; Abdulkader, R.S.; Abdulle, A.M.; Abebo, T.A.; Abera, S.F.; et al. Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990–2016: A systematic analysis for the Global Burden of Disease Study 2016. Lancet 2017, 390, 1211–1259. [Google Scholar] [CrossRef] [Green Version]
  2. Manchikanti, L.; Singh, V.; Falco, F.J.E.; Benyamin, R.M.; Hirsch, J.A. Epidemiology of Low Back Pain in Adults. Neuromodulation Technol. Neural Interface 2014, 17, 3–10. [Google Scholar] [CrossRef]
  3. Hartvigsen, J.; Nielsen, J.; Kyvik, K.O.; Fejer, R.; Vach, W.; Iachine, I.; Leboeuf-Yde, C. Heritability of spinal pain and consequences of spinal pain: A comprehensive genetic epidemiologic analysis using a population-based sample of 15,328 twins ages 20-71 years. Arthritis Rheum. 2009, 61, 1343–1351. [Google Scholar] [CrossRef]
  4. Tsepilov, Y.A.; Freidin, M.B.; Shadrina, A.S.; Sharapov, S.Z.; Elgaeva, E.E.; van Zundert, J.; Karssen, L.; Suri, P.; Williams, F.M.K.; Aulchenko, Y.S. Analysis of genetically independent phenotypes identifies shared genetic factors associated with chronic musculoskeletal pain conditions. Commun. Biol. 2020, 3, 329. [Google Scholar] [CrossRef]
  5. Rahman, S.; Winsvold, B.S.; Chavez, S.O.C.; Børte, S.; Tsepilov, Y.A.; Sharapov, S.Z.; Aulchenko, Y.S.; Hagen, K.; Fors, E.A.; Hveem, K.; et al. Genome-wide association study identifies RNF123 locus as associated with chronic widespread musculoskeletal pain. Ann. Rheum. Dis. 2021, 80, 1227–1235. [Google Scholar] [CrossRef] [PubMed]
  6. Suri, P.; Stanaway, I.B.; Zhang, Y.; Freidin, M.B.; Tsepilov, Y.A.; Carrell, D.S.; Williams, F.M.; Aulchenko, Y.S.; Hakonarson, H.; Namjou, B.; et al. Genome-wide association studies of low back pain and lumbar spinal disorders using electronic health record data identify a locus associated with lumbar spinal stenosis. Pain 2021, 162, 2263–2272. [Google Scholar] [CrossRef]
  7. Suri, P.; Palmer, M.R.; Tsepilov, Y.A.; Freidin, M.B.; Boer, C.G.; Yau, M.S.; Evans, D.S.; Gelemanovic, A.; Bartz, T.M.; Nethander, M.; et al. Genome-wide meta-analysis of 158,000 individuals of European ancestry identifies three loci associated with chronic back pain. PLoS Genet. 2018, 14, e1007601. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Bjornsdottir, G.; Stefansdottir, L.; Thorleifsson, G.; Sulem, P.; Norland, K.; Ferkingstad, E.; Oddsson, A.; Zink, F.; Lund, S.H.; Nawaz, M.S.; et al. Rare SLC13A1 variants associate with intervertebral disc disorder highlighting role of sulfate in disc pathology. Nat. Commun. 2022, 13, 634. [Google Scholar] [CrossRef] [PubMed]
  9. Freidin, M.B.; Tsepilov, Y.A.; Palmer, M.; Karssen, L.C.; Suri, P.; Aulchenko, Y.S.; Williams, F.M.; CHARGE Musculoskeletal Working Group. Insight into the genetic architecture of back pain and its risk factors from a study of 509,000 individuals. Pain 2019, 160, 1361–1373. [Google Scholar] [CrossRef]
  10. Lewis, C.M.; Vassos, E. Polygenic risk scores: From research tools to clinical instruments. Genome Med. 2020, 12, 44. [Google Scholar] [CrossRef]
  11. McIntosh, A.M.; Hall, L.S.; Zeng, Y.; Adams, M.J.; Gibson, J.; Wigmore, E.; Hagenaars, S.P.; Davies, G.; Fernandez-Pujals, A.M.; Campbell, A.I.; et al. Genetic and Environmental Risk for Chronic Pain and the Contribution of Risk Variants for Major Depressive Disorder: A Family-Based Mixed-Model Analysis. PLoS Med. 2016, 13, e1002090. [Google Scholar] [CrossRef] [Green Version]
  12. van Reij, R.R.I.; Voncken, J.W.; Joosten, E.A.J.; Hoogen, N.J.V.D. Polygenic risk scores indicates genetic overlap between peripheral pain syndromes and chronic postsurgical pain. Neurogenetics 2020, 21, 205–215. [Google Scholar] [CrossRef]
  13. Sudlow, C.; Gallacher, J.; Allen, N.; Beral, V.; Burton, P.; Danesh, J.; Downey, P.; Elliott, P.; Green, J.; Landray, M.; et al. UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age. PLoS Med. 2015, 12, e1001779. [Google Scholar] [CrossRef] [Green Version]
  14. Fry, A.; Littlejohns, T.J.; Sudlow, C.; Doherty, N.; Adamska, L.; Sprosen, T.; Collins, R.; E Allen, N. Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants with Those of the General Population. Am. J. Epidemiol. 2017, 186, 1026–1034. [Google Scholar] [CrossRef] [Green Version]
  15. Loh, P.-R.; Kichaev, G.; Gazal, S.; Schoech, A.P.; Price, A.L. Mixed-model association for biobank-scale datasets. Nat. Genet. 2018, 50, 906–908. [Google Scholar] [CrossRef]
  16. Lloyd-Jones, L.R.; Zeng, J.; Sidorenko, J.; Yengo, L.; Moser, G.; Kemper, K.E.; Wang, H.; Zheng, Z.; Magi, R.; Esko, T.; et al. Improved polygenic prediction by Bayesian multiple regression on summary statistics. Nat. Commun. 2019, 10, 5086. [Google Scholar] [CrossRef] [Green Version]
  17. Chang, C.C.; Chow, C.C.; Tellier, L.C.; Vattikuti, S.; Purcell, S.M.; Lee, J.J. Second-generation PLINK: Rising to the challenge of larger and richer datasets. Gigascience 2015, 4, 7. [Google Scholar] [CrossRef]
  18. Verdi, S.; Abbasian, G.; Bowyer, R.C.E.; Lachance, G.; Yarand, D.; Christofidou, P.; Mangino, M.; Menni, C.; Bell, J.T.; Falchi, M.; et al. TwinsUK: The UK Adult Twin Registry Update. Twin Res. Hum. Genet. 2019, 22, 523–529. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  19. MacGregor, A.J.; Andrew, T.; Sambrook, P.N.; Spector, T.D. Structural, psychological, and genetic influences on low back and neck pain: A study of adult female twins. Arthritis Rheum. 2004, 51, 160–167. [Google Scholar] [CrossRef] [PubMed]
  20. Freidin, M.B.; Tsepilov, Y.A.; Stanaway, I.B.; Meng, W.; Hayward, C.; Smith, B.H.; Khoury, S.; Parisien, M.; Bortsov, A.; Diatchenko, L.; et al. Sex- and age-specific genetic analysis of chronic back pain. Pain 2021, 162, 1176–1187. [Google Scholar] [CrossRef] [PubMed]
  21. Elgaeva, E.E.; Tsepilov, Y.; Freidin, M.B.; Williams, F.M.K.; Aulchenko, Y.; Suri, P. ISSLS Prize in Clinical Science 2020. Examining causal effects of body mass index on back pain: A Mendelian randomization study. Eur. Spine J. 2019, 29, 686–691. [Google Scholar] [CrossRef]
  22. Williams, F.M.K.; Elgaeva, E.E.; Freidin, M.B.; Zaytseva, O.O.; Aulchenko, Y.S.; Tsepilov, Y.A.; Suri, P. Causal effects of psychosocial factors on chronic back pain: A bidirectional Mendelian randomisation study. Eur. Spine J. 2022, 31, 1906–1915. [Google Scholar] [CrossRef]
  23. Freidin, M.B.; Tsepilov, Y.A.; Aulchenko, Y.S.; Suri, P.; Williams, F.M. Sitting-to-standing height ratio is a sex-specific risk factor for chronic back pain. Abstracts from the 54th European Society of Human Genetics (ESHG) Conference: E-Posters. Eur. J. Hum. Genet. 2022, 30, 88–608. [Google Scholar] [CrossRef]
  24. Sodini, S.M.; Kemper, K.E.; Wray, N.R.; Trzaskowski, M. Comparison of Genotypic and Phenotypic Correlations: Cheverud’s Conjecture in Humans. Genetics 2018, 209, 941–948. [Google Scholar] [CrossRef] [Green Version]
  25. Lynch, M.; Walsh, B. Genetics and Analysis of Quantitative Traits; Sinauer: Sunderland, MA, USA, 1998. [Google Scholar]
  26. Livshits, G.; Popham, M.; Malkin, I.; Sambrook, P.N.; MacGregor, A.; Spector, T.; Williams, F.M.K. Lumbar disc degeneration and genetic factors are the main risk factors for low back pain in women: The UK Twin Spine Study. Ann. Rheum. Dis. 2011, 70, 1740–1745. [Google Scholar] [CrossRef] [PubMed]
  27. Tang, Y.; You, D.; Yi, H.; Yang, S.; Zhao, Y. IPRS: Leveraging Gene-Environment Interaction to Reconstruct Polygenic Risk Score. Front. Genet. 2022, 13, 801397. [Google Scholar] [CrossRef] [PubMed]
  28. Kuznetsov, I.A.; Tsepilov, Y.A.; Freidin, M.B.; Williams, F.M.; Suri, P.; Aulchenko, Y.S. Genotype-by-environment interactions in chronic back pain. Spine J. 2023; in press. [Google Scholar] [CrossRef]
Figure 1. The scheme of the PRS model development and validation in UK Biobank.
Figure 1. The scheme of the PRS model development and validation in UK Biobank.
Jpm 13 00977 g001
Figure 2. Results of CBP PRS prediction performance. (A) Histograms showing the distribution of PRS among CBP cases and controls. (B) ROC curve of the CBP~PRS prediction model.
Figure 2. Results of CBP PRS prediction performance. (A) Histograms showing the distribution of PRS among CBP cases and controls. (B) ROC curve of the CBP~PRS prediction model.
Jpm 13 00977 g002
Table 1. Top ten associations of CBP PRS with different traits and diseases, ranked by p-value.
Table 1. Top ten associations of CBP PRS with different traits and diseases, ranked by p-value.
Prevalence of the Code/Trait, %p-ValueOdds RatioCode/Trait Description
17.923.56 × 10−1761.24Back pain for 3+ months
16.171.85 × 10−941.18Neck pain for 3+ months
171.49 × 10−741.15Knee pain for 3+ months
9.036.65 × 10−531.17Hip pain for 3+ months
18.43.86 × 10−511.12ICD-10 I10 Essential (primary) hypertension
16.762.60 × 10−491.12OPCS-4 G45 Diagnostic fibreoptic endoscopic examination of upper gastrointestinal tract
3.262.96 × 10−461.26ICD-10 M54 Dorsalgia
4.855.60 × 10−431.2Stomach pain for 3+ months
1.724.55 × 10−321.3ICD-10 M47 Spondylosis
9.211.42 × 10−311.13Head pain for 3+ months
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tsepilov, Y.A.; Elgaeva, E.E.; Nostaeva, A.V.; Compte, R.; Kuznetsov, I.A.; Karssen, L.C.; Freidin, M.B.; Suri, P.; Williams, F.M.K.; Aulchenko, Y.S. Development and Replication of a Genome-Wide Polygenic Risk Score for Chronic Back Pain. J. Pers. Med. 2023, 13, 977. https://doi.org/10.3390/jpm13060977

AMA Style

Tsepilov YA, Elgaeva EE, Nostaeva AV, Compte R, Kuznetsov IA, Karssen LC, Freidin MB, Suri P, Williams FMK, Aulchenko YS. Development and Replication of a Genome-Wide Polygenic Risk Score for Chronic Back Pain. Journal of Personalized Medicine. 2023; 13(6):977. https://doi.org/10.3390/jpm13060977

Chicago/Turabian Style

Tsepilov, Yakov A., Elizaveta E. Elgaeva, Arina V. Nostaeva, Roger Compte, Ivan A. Kuznetsov, Lennart C. Karssen, Maxim B. Freidin, Pradeep Suri, Frances M. K. Williams, and Yurii S. Aulchenko. 2023. "Development and Replication of a Genome-Wide Polygenic Risk Score for Chronic Back Pain" Journal of Personalized Medicine 13, no. 6: 977. https://doi.org/10.3390/jpm13060977

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop