The effect of heteroscedasticity on the prediction efficiency of genome-wide polygenic score for body mass index

Baek, Eun Ju; Jung, Hae-Un; Chung, Ju Yeon; Jung, Hye In; Kwon, Shin Young; Lim, Ji Eun; Kim, Han Kyul; Kang, Ji-One; Oh, Bermseok

doi:10.3389/fgene.2022.1025568

ORIGINAL RESEARCH article

Front. Genet., 07 November 2022

Sec. Statistical Genetics and Methodology

Volume 13 - 2022 | https://doi.org/10.3389/fgene.2022.1025568

The effect of heteroscedasticity on the prediction efficiency of genome-wide polygenic score for body mass index

Eun Ju Baek¹^†

Hae-Un Jung¹^†

Ji-One Kang²*^‡

Bermseok Oh^1,2*^‡

¹Department of Biomedical Science, Graduate School, Kyung Hee University, Seoul, South Korea
²Department of Biochemistry and Molecular Biology, School of Medicine, Kyung Hee University, Seoul, South Korea

Globally, more than 1.9 billion adults are overweight. Thus, obesity is a serious public health issue. Moreover, obesity is a major risk factor for diabetes mellitus, coronary heart disease, and cardiovascular disease. Recently, GWAS examining obesity and body mass index (BMI) have increasingly unveiled many aspects of the genetic architecture of obesity and BMI. Information on genome-wide genetic variants has been used to estimate the genome-wide polygenic score (GPS) for a personalized prediction of obesity. However, the prediction power of GPS is affected by various factors, including the unequal variance in the distribution of a phenotype, known as heteroscedasticity. Here, we calculated a GPS for BMI using LDpred2, which was based on the BMI GWAS summary statistics from a European meta-analysis. Then, we tested the GPS in 354,761 European samples from the UK Biobank and found an effective prediction power of the GPS on BMI. To study a change in the variance of BMI, we investigated the heteroscedasticity of BMI across the GPS via graphical and statistical methods. We also studied the homoscedastic samples for BMI compared to the heteroscedastic sample, randomly selecting samples with various standard deviations of BMI residuals. Further, we examined the effect of the genetic interaction of GPS with environment (GPS×E) on the heteroscedasticity of BMI. We observed the changing variance (i.e., heteroscedasticity) of BMI along the GPS. The heteroscedasticity of BMI was confirmed by both the Breusch-Pagan test and the Score test. Compared to the heteroscedastic sample, the homoscedastic samples from small standard deviation of BMI residuals showed a decreased heteroscedasticity and an improved prediction accuracy, suggesting a quantitatively negative correlation between the phenotypic heteroscedasticity and the prediction accuracy of GPS. To further test the effects of the GPS×E on heteroscedasticity, first we tested the genetic interactions of the GPS with 21 environments and found 8 significant GPS×E interactions on BMI. However, the heteroscedasticity of BMI was not ameliorated after adjusting for the GPS×E interactions. Taken together, our findings suggest that the heteroscedasticity of BMI exists along the GPS and is not affected by the GPS×E interaction.

Introduction

Obesity is a serious public health issue, as it is a major risk factor for diabetes mellitus, coronary heart disease, and cardiovascular disease (Visscher and Seidell, 2001). In 2016, more than 1.9 billion adults were overweight globally, and among them, over 650 million were obese (Mirzaei et al., 2014; World Health Organization, 2020). Moreover, the estimated health care cost driven by the risk factor of obesity was a staggering 480 billion dollars in the United States in 2018, and it continues to rise (Waters and Graf, 2020).

Obesity is typical of a polygenic phenotype and hence influenced by various genetic variants (Silventoinen et al., 2016; O'Connor, 2021). Recent GWAS on obesity and body mass index (BMI) have increasingly unveiled many aspects of the genetic architecture of obesity and BMI (Speliotes et al., 2010; Visscher et al., 2012; Locke et al., 2015; Yengo et al., 2018). As a result, the genome-wide SNP-heritability of BMI was estimated to be from 29% to 44% in the European samples (Hou et al., 2019).

For many common diseases, polygenic inheritance, consisting of multiple genetic variants, plays a greater role than rare monogenic mutations (Khera et al., 2018). Given that, genome-wide polygenic scores (GPSs) for various diseases have been estimated to identify individuals at a clinically increased risk and are expected to contribute to the personalized prediction and prevention of polygenic diseases (Chatterjee et al., 2016; Khera et al., 2018). LDpred is a popular method for deriving GPS based on GWAS summary statistics and a linkage disequilibrium (LD) matrix (Vilhjalmsson et al., 2015). Recently, LDpred2, a newer version of LDpred, was developed, and its computational efficiency is markedly improved and its predictive power is higher than LDpred (Prive et al., 2020). Using LDpred, Khera et al. (2019) created a GPS for BMI and observed a great gradient in weight and the risk of severe obesity across GPS deciles among middle-aged adults (Locke et al., 2015; Khera et al., 2019; Prive et al., 2020).

In general, GPS has been tested using linear regression models between GPS and a phenotype (Khera et al., 2019; Sulc et al., 2020). For linear regression, the two critical assumptions for the data distribution are normality and homoscedasticity (i.e., the equal variance of a phenotype) (Yang et al., 2019). The importance of normality has been enough emphasized when fitting linear regression models, but the importance of homoscedasticity has been often overlooked (Yang et al., 2019). When homoscedasticity is violated, heteroscedasticity occurs in the distribution of a phenotype when the phenotype is fitted in the linear regression model, such that the variance of an individual’s phenotype changes depending on the genotype. The increased degree of heteroscedasticity causes type I errors, yielding a higher rate of false positives (Yang et al., 2019). In particular, when GPS is fitted in a linear regression model, the effect of heteroscedasticity on the prediction efficiency of GPS is not clearly understood yet.

For the unequal variance of a phenotype according to genotypes, there is evidence across different species, such that the variance of phenotypes is genotype dependent (Wolc et al., 2009; Jimenez-Gomez et al., 2011). For example, the FTO variant rs7202116 is associated with a 7% difference in variance in BMI between individuals that are homozygous for different alleles (Yang et al., 2012). This may reflect that individuals carrying only one of the alleles may more easily lose or gain weight, whereas individuals with the other allele may have a more stable BMI. The difference in the variance observed for the FTO variant has been suggested to be due to different responses to activity levels and other lifestyle factors (Yang et al., 2012; Rask-Andersen et al., 2017). The variance quantitative trait locus (vQTL) analysis has provided empirical evidence for the genetic control of phenotypic variance (Tyrrell et al., 2017). Using the vQTL, Wang et al. (2019) suggested that the gene-environment interaction (G×E) effect of a genetic variant on a quantitative trait could lead to the phenotypic heteroscedasticity of a trait among individuals carrying different genetic alleles (Wang et al., 2019). For BMI, a genetic risk score (GRS) of 376 variants explains 5.2% of the trait variance, and the interaction between GRS and environment (GRS×E) contributes an additional 1.9% (Rask-Andersen et al., 2017; Sulc et al., 2020). On the other hand, it is suggested that the heteroscedastic nature of BMI is most likely due to a transformation and not driven by the GRS×E interaction (Sulc et al., 2020).

In this study, we constructed a GPS via LDpred2 using BMI GWAS summary statistics (1,342,646 SNPs) from a European meta-analysis (Locke et al., 2015). When the BMI GPS was validated on 275,809 European samples from the UK Biobank (UKB), we demonstrated the existence of heteroscedasticity of BMI across GPS percentiles. Using the Breusch-Pagan (BP) test and the Score test, we confirmed the heteroscedasticity of BMI across GPS. Further, we showed that using a homoscedastic (or less heteroscedastic) sample, the decreased heteroscedasticity led to an improved prediction efficiency of GPS, suggesting a quantitatively negative relationship between the heteroscedasticity and the prediction accuracy. Further, we investigated the effect of the GPS×E interaction on the heteroscedasticity of BMI. Among 21 environmental factors, we found 8 significant GPS×E interactions on BMI using 216,103 European samples from the UKB. Then we studied whether the heteroscedasticity changed after adjustments for each of 8 GPS×E interactions using BP and Score testes.

Materials and methods

Study population and design

The UKB is a population-based cohort that recruited over 487,409 individuals aged 40–69 years in the United Kingdom during 2006–2010 (Collins, 2012). For quality control, we used the following filter parameters from the Neale lab (https://github.com/Nealelab/UK_Biobank_GWAS): a principal component analysis (PCA) to filter for the selection of unrelated samples; a sex chromosome filter to remove aneuploidy; a principal components (PCs) filter for European sample selection to determine British ancestry; and filters for the selection of self-reported ‘white British’, ‘Irish’, and ‘White’. We then selected 354,761 unrelated European samples from the UKB participants, of which 10,000 samples were used for a linkage disequilibrium (LD) reference, 68,952 samples were used to calculate the candidate GPSs (test set), and 275,809 samples were used to validate a GPS with selected parameters (validation set) (Table 1). Access to the UKB data was granted under application no. 48422 “Gene-environment interaction study on obesity, body mass index, and waist circumference”.

TABLE 1

TABLE 1. Characteristics of the UK Biobank samples.

Ethics approval and consent to participate

The UKB was granted ethical approval to collect data from participants by the North West Multicentre Research Ethics Committee, the National Information Governance Board for Health & Social Care, and the Community Health Index Advisory Group.

Genotype data

At baseline, imputation data for 93,095,623 SNPs were available for 487,409 participants using the UKB Axiom Array and the UK BiLEVE Axiom Array from Affymetrix (Sudlow et al., 2015). Genotyping imputation was performed using the UK10K Project and 1,000 Genome Project Phase 3 reference panels (UK10K Consortium et al., 2015). Quality control was performed based on the following exclusion criteria using PLINK v.1.90: SNPs with missing genotype call rates >0.05; minor allele frequency (MAF) < 0.01; and p-value for Hardy-Weinberg equilibrium test <1.00 × 10⁻⁶ (Purcell et al., 2007). A total of 5,664,578 SNPs were retained for further analysis.

Phenotype data and environmental variables

The BMI value was determined using the height and weight measured during the initial UKB assessment center visit. Individuals with missing BMI data were discarded from the analysis (Table 1).

Based on a previous report (Young et al., 2016), we included 21 environmental variables for genetic interaction studies, discarding individuals with missing environmental variables. “Prefer not to answer” and “I do not know” were set as “missing” in our analyses (Supplementary Table S1). The 21 environmental variables included 12 dietary variables (beef intake, bread intake, cheese intake, cooked vegetable intake, lamb/mutton intake, non-oily fish intake, oily fish intake, port intake, poultry intake, processed meat intake, salt added to food, and tea intake), 6 lifestyle variables (alcohol intake frequency, sleep duration, sleep duration residual squared, smoking status, townsend deprivation index (TDI), and time spent watching television (TV)), and 3 physical activity variables (number of days/week walked 10+ minutes, number of days/week moderate physical activity 10+ minutes, and number of days/week vigorous physical activity 10+ minutes). For ‘sleep duration residual squared’, for each individual, we calculated, the squared deviations from the mean sleep duration.

Creation of genome-wide polygenic score

Candidate GPSs were derived using the LDPred2 computational algorithm that is based on a Bayesian approach using an LD matrix and GWAS summary statistics, implemented in the R package bigsnpr (Prive et al., 2020). LDpred2 provides more hyper-parameters (102 grids) than LDpred1 (7 grids). Among the models, the infinitesimal model assumes that all the genetic variants are causal. The grid model tunes the hyper-parameters of SNP heritability (h²), the proportion of causal variants (p), and the optional sparsity to reweight the variant effect to the phenotype. The SNP heritability (h²) was estimated using LD Score regression between summary statistics and LD score from the European-ancestry samples in the 1000 Genomes Project (Bulik-Sullivan et al., 2015). The grid sparsity is an option to reduce SNPs by fitting some effects to zero, providing a sparse vector of the effects. Hence, the sparsity may reduce the computing time without losing the predictive accuracy (Prive et al., 2020).

To calculate a GPS, we obtained the GWAS summary statistics on the BMI from the European meta-analysis (Locke et al., 2015). For the LD reference, using 10,000 European samples, we computed the LD correlation matrix among 1,342,646 SNPs consisting of a common set of both HapMap3 variants and the BMI GWAS meta-summary statistics (International HapMap 3 Consortium et al., 2010; Yengo et al., 2018; Prive et al., 2020). Using 344,761 European samples, we estimated 3 models (infinitesimal, grid, and grid sparse model) to construct the GPS. For the infinitesimal model, where all the markers are causal, we estimated the infinitesimal model using a total of 344,761 participants. For the grid models, we tested a total of 102 grid models of hyper-parameters, consisting of p (proportion of causal variants, 17 values from 0.0001 to 1), h²LDSC (heritability, 3 values at 0.1027, 0.1467, and 0.2054), and the existence of sparsity. Using 68,952 European samples (test set) for the grid models, the best model was determined by the adjusted R² (the explained phenotypic variance) via a linear regression model between the GPS and a phenotype, which was adjusted for age, sex, array, and 10 PCs (BMI ∼ GPS + age + sex + array +10 PCs) (Table 1, Supplementary Table S2, and Supplementary Figure S1). Once the optimal hyper-parameters were set, 275,809 European participants were used as a validation set (Table 1, Supplementary Table S2, and Supplementary Table S3).

To estimate the prediction accuracy of the GPSs, we used the adjusted R² and mean squared error (MSE) in a linear regression model between GPS and BMI and adjusted for age, sex, array, and 10 PCs.

Statistical analysis

We performed an association analysis via linear regression, and a variance test, such as the Fligner Killeen test (FK-test), scatter plotting, and density plotting using the R stats package (version 4.0.2; www.r-project.org). To draw the scatter plot and density plot, the “ggplot2” package was used. Linear regression modeling was used to estimate the prediction accuracy of GPS (BMI ∼ GPS + age + sex + array +10 PCs) and the effect of gene-environment interaction (GPS×E) on BMI (BMI ∼ GPS + E + GPS×E + age + sex + array +10 PCs). When the heteroscedasticity existed, as a sensitivity analysis, we performed the robust regression analysis by calculating the robust (or “White”) standard error of the GPS×E interaction effects, using the vce (robust) option in STATA (Cribari-Neto, 2004; Tyrrell et al., 2017).

The Breusch-Pagan (BP) and Score tests were used to assess the heteroscedasticity of BMI across the GPS. The FK-test was used to compare the variances between the GPS quintile (Q) subgroups. To compare the interaction effect size between Q1 and Q5, we computed p-values (P_difference) by testing for difference between Q1 and Q5 beta-estimates with their corresponding standard errors SE1 and SE5 using the t statistic (Randall et al., 2013).

For quantifying the relationship between the heteroscedasticity of BMI and the prediction accuracy of GPS, linear regression analyses were performed using randomly selected samples with various SD cutoffs from our UKB validation set (275,809 participants). First, 10,000 participants were randomly selected from each group with a SD cutoff (from 0.25 to 2 by 0.25 increment) of the BMI residual mean and this random selection was repeated in each group until 100 times. Then, the linear regression model was adjusted for age, sex, array, and 10 PCs. In each model, R² and MSE were estimated, and BP and Score tests were performed. Then, the correlation analysis between the prediction accuracy (R² or MSE) and heteroscedasticity statistics (χ² of BP or Score test) was performed using the statistics from all 800 models.

Results

Construction of GPS for BMI

To estimate the GPS, we used the GWAS summary statistics on BMI from the European meta-analysis (Locke et al., 2015) and the LD correlation matrix. We then tested 3 models (infinitesimal, grid, and grid sparse model) to construct the GPS using 68,952 unrelated Europeans from the UKB participants (the infinitesimal model used both the test and validation sets). The best model for GPS was the grid model (for grid: p = 0.18, h² = 0.2054, and R² = 0.3192), which was based on the R² when fitting to a linear regression model (Supplementary Table S2).

To visualize the relationship between BMI and GPS, we depicted the BMI scatter plots for the 3 GPS models (infinitesimal, grid, and grid sparse) using the validation set (the infinitesimal model used both the test and validation sets). We observed that the BMI variance tended to increase across the GPS in all 3 models (Figures 1A,C,E). To estimate how much the variance changed, we binned GPS into 5 quintiles (Q) subgroups and plotted the mean value and the BMI variance per quintile group (Figures 1B,D,F; Table 2). These plots showed that the BMI variance increased according to the GPS quintiles.

FIGURE 1

FIGURE 1. Scatterplot and box plot of BMI across GPS. (A,C,E) BMI was plotted against GPS for each GPS model, using the validation set. (B,D,F) The mean and variance of BMI were depicted by box plot per GPS quintile subgroup. (A,B) Infinitesimal model using both test and validation sets, (C,D) grid sparse model using only the validation set, and (E,F) grid model using only the validation set.

TABLE 2

TABLE 2. The mean, SD, and variance of BMI for each GPS quintile group.

To see the distribution of the obese versus normal subjects, we depicted the population density plot for each GPS model. The mean GPS shifted to the right for the obese subjects (BMI ≥30 kg/m²) versus the normal subjects (BMI <30 kg/m²), suggesting that all 3 GPS models distinguished between the obese and normal groups (Supplementary Figure S2). All 3 GPS models showed significant associations with BMI based on the linear regression model that was adjusted for age, sex, genotyping array, and 10 PCs (p-value < 2.00 × 10⁻¹⁶) (Supplementary Table S3). Consistently, when using the test and validation sets, the grid model for GPS showed the best R² and the explained phenotypic variance compared to the null model (R² Grid = 0.3216, R²Grid sparse = 0.3185, and R²Infinitesimal = 0.3022). Hence, this Grid model was used in the following studies (Supplementary Tables S2, S3).

Heteroscedasticity of BMI across GPS

To better assess the changes in the BMI variance, we estimated the residuals of the individuals (the difference between a set of observed and predicted values) using linear regression for BMI and adjusted for age, sex, genotyping array, and 10 PCs and then plotted the averaged BMI residuals according to the GPS percentiles (Figure 2). The averaged residuals plot showed a great increase in BMI residuals as the GPS increased, suggesting a violation of homoscedasticity, i.e., heteroscedasticity (Figure 2).

FIGURE 2

FIGURE 2. Plot of BMI residuals by GPS percentile. Samples were binned into 100 equally sized groups by GPS. In each group, the mean of residuals of BMI was calculated and plotted against the mean value of GPS in the GPS percentile group.

For a statistical assessment of heteroscedasticity, we performed the BP and Score tests on the GPS total and quintile subgroups. Both tests statistically confirmed a heteroscedastic inequality in the residuals according to the increasing GPS, with the highest heteroscedasticity in Q5 and the smallest in Q1, among the 5 quintiles (Table 3) (Rosopa et al., 2013). We then also assessed whether there is variation in the residuals between GPS quintile groups via the FK test. The results showed statistically significant discrepancies in the residuals between the GPS Q1 and the other 4 quintile groups (Q2-5) (Supplementary Table S4). We then assessed the prediction accuracy of GPS via the MSE of the linear regression model. The results showed an increasing MSE (i.e., a decreasing accuracy) as the GPS quintiles increased (Table 3). The residuals plot and heteroscedasticity analyses suggested that BMI followed a heteroscedastic distribution across the GPS subgroups and the increased heteroscedasticity might lead to GPS imprecision.

TABLE 3

TABLE 3. The heteroscedasticity of BMI according to GPS quintile.

In general, when fitting linear regression models, the normality of a phenotype is a critical factor for achieving better power performance and less inflated type I errors (Chien, 2020). In addition, the normality of a phenotype seems to affect the heteroscedasticity (Sulc et al., 2020). To further test whether the skewed (or non-normal) distribution of the phenotype drives the heteroscedasticity, we performed heteroscedasticity analyses (i.e., BP and Score tests) following normal transformation of BMI using 3 different methods [log-, Box-Cox, and rank-based inverse normal transformation (INT)] (Supplementary Figures S3, S4) (Buzkova, 2013; Koenker, 2022; Chien, 2020). The association analyses showed that all 3 normalizations of BMI improved the prediction accuracy of GPS based on R² and MSE, compared to non-normalized BMI, except that the MSE for logBMI worsened (Supplementary Table S5). The heteroscedasticity greatly decreased when using normalized BMIs compared to non-normalized BMI, of which the INT transformation almost removed the heteroscedasticity, the Box-Cox method greatly decreased the heteroscedasticity, and the log-transformation weakly decreased it (Supplementary Table S5). Our findings suggest that for BMI, all 3 normalizations are effective to correct the heteroscedasticity and accordingly to improve the prediction accuracy of BMI GPS (Nwakuya and Nwabueze, 2018).

Analysis of homoscedastic samples for BMI across GPS

To better understand the heteroscedasticity, we examined the homoscedastic (or less heteroscedastic) samples in relation to the prediction efficiency. Two different homoscedastic (or less heteroscedastic) samples were extracted from the samples with 1 standard deviation (SD) and 2SD of the residual mean of the linear regression for BMI (Supplementary Figurers S5A,D). The selected 1SD and 2SD samples were binned into 5 quintile subgroups. The plots of the mean and variance for BMI showed a smaller variation for the 1SD and 2SD samples for BMI according to the GPS quintiles (Supplementary Figurers S5B,E). Consistently, the plots of the BMI residuals showed a reduced increase for the 2SD sample and were almost unchanged for the 1SD across the GPS percentiles (Supplementary Figurers S5C,F). Both the BP and Score tests showed consistent results, such that the degree of heteroscedasticity decreased in the 2SD sample and dramatically dropped in the 1SD sample, suggesting the 2SD sample was a less heteroscedastic sample and 1SD was a homoscedastic sample (Supplementary Table S5).

We then assessed the prediction accuracy of GPS via the adjusted R² or MSE of the linear regression model. The R² was 0.6095 for the 1SD samples (N = 201,568) and 0.3746 for the 2SD samples (N = 263,310), showing an increase greater than 0.3216 for the total samples (N = 275,809) (Supplementary Table S5). Consistent with the R², the MSE decreased in the 2SD sample compared to the total samples and more greatly decreased for the 1SD (Supplementary Table S5). These results implied that the increased degree of heteroscedasticity might adversely affect the prediction efficiency of GPS.

To further quantify the relationship between the phenotypic heteroscedasticity and the prediction accuracy of GPS, we performed linear regression analyses on GPS using randomly selected samples with various SD cutoffs (from 0.25 to 2 by 0.25 increment) from our UKB validation set (275,809 participants). The results showed that as the SD cutoffs increased, the averaged prediction accuracy (R² and MSE) of GPS decreased and the averaged heteroscedasticity (χ² of BP and Score test) increased (Supplementary Table S6). Next, the correlation analysis between the prediction accuracy (R² or MSE) and heteroscedasticity statistics (χ² of BP and Score test) suggested a strong negative correlation between them (R² vs χ²BP = -0.86, p = 4.05E-232; R² vs χ²score = -0.90, p = 8.99E-296; MSE vs χ²BP = 0.86, p = 3.49E-232; MSE vs χ²score = 0.90, p = 4.07E-293) (Supplementary Table S6). Moreover, the χ² values of heteroscedasticity were plotted against the GPS accuracy. The plots showed an exponential distribution of heteroscedasticity against the accuracy of GPS, suggesting that the prediction accuracy may greatly increase when the degree of heteroscedasticity is low (χ² < 150 for BP; < 200 for Score test) while the accuracy may slightly vary or be unchanged when the heteroscedasticity is too high (χ² > 150 for BP; > 200 for Score test) (Figure 3).

FIGURE 3

FIGURE 3. Plot of the heteroscedasticity versus the prediction accuracy of GPS in randomly selected samples. Linear regression analyses on GPS were performed using randomly selected samples with various SD cutoffs (from 0.25 to 2 by 0.25 increment) in the UKB validation set. The random selection of 10,000 samples were repeated 100 times and each sample was used for a linear regression analysis. (A,B) The heteroscedasticity (χ² of BP in the upper panel and Score test in the lower panel) in each sample was plotted against the prediction accuracy (R² in A and MSE in B) of GPS.

Among the GPS quantile groups from the total heteroscedastic samples, 64.92% of Q1 were normal weight, 31.6% were overweight, and 3.48% were obese, while 7.95% of Q5 were normal, 36.09% were overweight and 55.96% were obese (Supplementary Figure S6). In the Q1 of the 2SD samples, 65.77% were normal weight, 31.99% were overweight, and 2.24% were obese, while in Q5, 6.83% were normal weight, 40.12% were overweight, and 53.06% were obese. In the Q1 of the 1SD samples, 70.61% were normal weight, 29.39% were overweight, and nobody was obese, while in the Q5, 0.02% were normal weight, 41.95% were overweight, and 58.04% were obese (Supplementary Figure S6). Consistent with the high R², the GPS in the 1SD samples targeted a narrower range of obesity than in the total samples, indicating a greater prediction power in the homoscedastic (or less heteroscedastic) sample than in the heteroscedastic sample.

Heteroscedasticity of BMI attributed to the GPS×E interaction

Since there have been controversies surrounding the role of the G×E interaction in phenotypic heteroscedasticity (Wang et al., 2019; Sulc et al., 2020), we aimed to determine the effect of the GPS×E interaction on the heteroscedasticity of BMI. To select the environmental variables for the GPS×E, we tested associations between BMI and 21 environmental variables that had been previously reported to be associated with BMI (Supplementary Table S7) (Young et al., 2016). Among the 21 variables, 20 showed significant associations between BMI and environmental variables after Bonferroni correction (p-value < 0.05/21 = 0.002) (Supplementary Table S7). We then performed 20 GPS×E interaction analyses using a linear regression model that was adjusted for age, sex, genotyping array, and 10 PCs. Of the 20 environmental variables, 15 GPS×E interactions satisfied the statistical significance threshold (p-value < 0.05/20 = 0.0025) (Supplementary Table S8). Because heteroscedasticity was present, as a sensitivity analysis, we performed the robust regression analysis, the “White test,” for the interaction studies (Tyrrell et al., 2017). The results showed that 14 GPS×E interactions were still significant via the White test, excluding the overriding of the false positives (Supplementary Table S8). As another sensitivity assay, we tested whether the GPS×E interaction effects differed between the GPS Q1 and Q5 groups. The linear regression analyses for the GPS×E interaction were performed for the GPS Q1 and Q5 groups, followed by the stratification analysis between the Q1 and Q5 groups using the GPS×E interaction results for each group. Among the 14 environmental variables, 8 GPS×E interactions met the significance threshold after Bonferroni correction (p-value < 0.05/20 = 0.0025; pork intake, processed meat intake, tea intake, alcohol intake frequency, TDI, number of days/week walked 10+ minutes, number of days/week of moderate physical activity 10+ minutes, and number of days/week of vigorous physical activity 10+ minutes) (Supplementary Table S8). Taken together, our results implicated at least 8 significant GPS×E interactions on BMI.

We further studied whether GPS×E interactions are specific to the measured environmental factor or represent a more general pattern of moderation of the total variance in BMI by the environmental factor. We applied the heteroscedastic GPS×E regression model for our BMI GPS on BMI including the GPS×E term. The results showed that 6 GPS×E interactions (processed meat intake, alcohol intake, TDI, number of days/week walked 10+ minutes, and number of days/week of moderate/vigorous physical activity 10+ minutes) were significantly specific to the environmental factors, while the other 2 interactions (pork and tea intake) might present an environmental heteroscedasticity (Supplementary Table S9). The results of the heteroscedastic GPS×E regression modelling suggest that 2 of 8 GPS×E interactions may not be true positive interactions.

To study a change in the heteroscedastic variance of BMI attributable to the GPS×E interaction, we performed the BP and Score tests via linear regression of BMI and GPS after adjusting for each of the 8 GPS×E interactions. The BP and Score tests showed that the heteroscedasticity increased after the adjustment for each of the 8 GPS×E interactions compared to the sample without the adjustment (Supplementary Table S10). These results suggested that the GPS×E interactions might not always contribute to the heteroscedasticity of BMI. Further, we studied whether the adjustment for GPS×E interaction could alter the prediction accuracy of GPS. We found that based on R² and MSE, the prediction accuracy of GPS slightly increased after adjusting for GPS×E interactions (Supplementary Table S10).

Discussion

We estimated a GPS composed of 1.3 million SNPs from the BMI GWAS meta-summary statistics using LDpred2 (Locke et al., 2015; Prive et al., 2020). We observed that the standard deviations of BMI varied across the GPS percentiles and statistically confirmed the heteroscedasticity. Furthermore, our findings suggested an improved prediction efficiency of GPS in the homoscedastic sample compared to the heteroscedastic sample.

Previous studies have acknowledged the presence of heteroscedasticity in studies using large sample sizes for GWAS (Tyrrell et al., 2017; Sulc et al., 2020). However, those studies mainly focused on the role of heteroscedasticity resulting in false positives for the GRS×E interactions, not on its role in the efficient prediction of the GPS. Therefore, our study is the first to manifest the heteroscedasticity of BMI across GPS and further show the possible modulation of heteroscedasticity on the prediction power of the GPS. We then tested the effect of the GPS×E interaction on heteroscedasticity, because previous studies suggest that G×E interactions may contribute to the heteroscedasticity of phenotypes across different genotypes (Yang et al., 2012; Rask-Andersen et al., 2017; Tyrrell et al., 2017; Wang et al., 2019; Marderstein et al., 2021). However, we found that the heteroscedasticity remained significant after adjusting for the GPS×E interaction. Regarding the controversies over the origin of the phenotypic variation by genotypes, our findings support a previous report that the heteroscedasticity of BMI is not driven by the G×E interaction (Sulc et al., 2020). Despite the increased heteroscedasticity, the prediction accuracy of GPS slightly increased after adjustment for GPS×E interactions. Our previous results using randomly selected samples with various SD cutoffs suggested a negative relationship between the heteroscedasticity and the prediction accuracy of GPS. However, the results after adjusting for GPS×E interactions are inconsistent with the negative correlation between heteroscedasticity and accuracy. We assume that the prediction accuracy of GPS may slightly vary or be unchanged when the heteroscedasticity is too high since the accuracy exponentially decreases as the heteroscedasticity increases. Our sample for studying GPS×E interactions presents extremely high heteroscedasticity. Hence, we presume that our sample may not be an appropriate sample that can differentiate the effect of GPS×E interaction on the prediction accuracy of GPS.

We found at least 8 significant GPS×E interactions for BMI. Previous studies using GRSs based on genome-widely significant BMI-associated variants suggest that the genetic effects on BMI are modulated by various lifestyle factors, such as diet (red/processed meat intake), alcohol intake frequency, usual walking pace, TDI, and moderate/vigorous physical activity (Rask-Andersen et al., 2017; Tyrrell et al., 2017; Ding et al., 2018). Our findings using 8 GPS×E interactions were consistent with previous reports, supporting the modulation of genetic factors on BMI by 4 environmental factors (i.e., diet, alcohol, TDI, and physical activity).

Although we constructed a GPS for BMI with high prediction power and demonstrated the presence of heteroscedasticity, we acknowledge several limitations in our study. First, we studied the heteroscedasticity of BMI across GPS but were not able to identify the possible causes for heteroscedasticity. However, we most likely ruled out the role of the G×E interaction in causing heteroscedasticity. For BMI, rare genetic mutations, such as the loss of functions or rare variants (MAF <0.01), have been reported to contribute considerably to its heritability (Akbari et al., 2021; Wainschtein et al., 2022). The contribution of rare genetic variants to heteroscedasticity and the prediction accuracy is not clearly understood yet. Hence, our hope for identifying the original cause for heteroscedasticity and hence, improving the prediction accuracy of GPS is largely unexplored yet. Second, our study focused on BMI, and thus, caution is noted in applying our results to other traits. Despite an important assumption, heteroscedasticity is often overlooked in linear regression models (Yang et al., 2019). Thus far, the association of phenotypic heteroscedasticity and GPS has not been studied in GPS models for other traits, regarding the prediction power of GPS. We believe that our study provides a good basis for the effect of phenotypic heteroscedasticity on GPS.

GPS is a forefront study that warrants more careful characterization for its prediction power in different backgrounds. We believe that our findings brush away a flow in the crystal and hence, provide a good basis for using GPS for predicting individuals at genetically increased risk.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Ethics statement

The studies involving human participants were reviewed and approved by North West Multicentre Research Ethics Committee. The patients/participants provided their written informed consent to participate in this study.

Author contributions

EB, HJ, J-OK, and BO designed the study. EB and HJ analyzed the data and wrote the first draft of the manuscript. J-OK revised the manuscript. JC, HJ, SK, JL, and HK provided technical support. All authors contributed to the interpretation of the results and critical revision of the manuscript for important intellectual content and approved the final version of the manuscript.

Funding

This work was supported by a grant from the National Research Foundation of Korea (NRF), funded by the Korean government (MSIT), and the Bio and Medical Technology Development Program of the NRF (grant numbers: 2019M3E5D3073365, NRF-2019R1A2C1083980).

Acknowledgments

This study was conducted using the UK Biobank (Application 48422).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.1025568/full#supplementary-material

References

Akbari, A., Padidar, K., Salehi, N., Mashayekhi, M., Almadani, N., Sadighi Gilani, M. A., et al. (2021). Rare missense variant in MSH4 associated with primary gonadal failure in both 46, XX and 46, XY individuals. Hum. Reprod. 36 (4), 1134–1145. doi:10.1093/humrep/deaa362

PubMed Abstract | CrossRef Full Text | Google Scholar

Bulik-Sullivan, B. K., Loh, P. R., Finucane, H. K., Ripke, S., Yang, J., Patterson, N., et al. (2015). LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47 (3), 291–295. doi:10.1038/ng.3211

PubMed Abstract | CrossRef Full Text | Google Scholar

Buzkova, P. (2013). Linear regression in genetic association studies. PLoS One 8 (2), e56976. doi:10.1371/journal.pone.0056976

PubMed Abstract | CrossRef Full Text | Google Scholar

Chatterjee, N., Shi, J., and Garcia-Closas, M. (2016). Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nat. Rev. Genet. 17 (7), 392–406. doi:10.1038/nrg.2016.27

PubMed Abstract | CrossRef Full Text | Google Scholar

Chien, L. C. (2020). A rank-based normalization method with the fully adjusted full-stage procedure in genetic association studies. PLoS One 15 (6), e0233847. doi:10.1371/journal.pone.0233847

PubMed Abstract | CrossRef Full Text | Google Scholar

Collins, R. (2012). What makes UK Biobank special? Lancet 379 (9822), 1173–1174. doi:10.1016/S0140-6736(12)60404-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Cribari-Neto, F. (2004). Asymptotic inference under heteroskedasticity of unknown form. Comput. Stat. Data Anal. 45 (2), 215–233. doi:10.1016/S0167-9473(02)00366-3

CrossRef Full Text | Google Scholar

Ding, M., Ellervik, C., Huang, T., Jensen, M. K., Curhan, G. C., Pasquale, L. R., et al. (2018). Diet quality and genetic association with body mass index: Results from 3 observational studies. Am. J. Clin. Nutr. 108 (6), 1291–1300. doi:10.1093/ajcn/nqy203

PubMed Abstract | CrossRef Full Text | Google Scholar

Hou, K., Burch, K. S., Majumdar, A., Shi, H., Mancuso, N., Wu, Y., et al. (2019). Accurate estimation of SNP-heritability from biobank-scale data irrespective of genetic architecture. Nat. Genet. 51 (8), 1244–1251. doi:10.1038/s41588-019-0465-0

PubMed Abstract | CrossRef Full Text | Google Scholar

International HapMap 3 Consortium Altshuler, D. M., Gibbs, R. A., Peltonen, L., Altshuler, D. M., Gibbs, R. A., et al. (2010). Integrating common and rare genetic variation in diverse human populations. Nature 467 (7311), 52–58. doi:10.1038/nature09298

PubMed Abstract | CrossRef Full Text | Google Scholar

Jimenez-Gomez, J. M., Corwin, J. A., Joseph, B., Maloof, J. N., and Kliebenstein, D. J. (2011). Genomic analysis of QTLs and genes altering natural variation in stochastic noise. PLoS Genet. 7 (9), e1002295. doi:10.1371/journal.pgen.1002295

PubMed Abstract | CrossRef Full Text | Google Scholar

Khera, A. V., Chaffin, M., Aragam, K. G., Haas, M. E., Roselli, C., Choi, S. H., et al. (2018). Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 50 (9), 1219–1224. doi:10.1038/s41588-018-0183-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Khera, A. V., Chaffin, M., Wade, K. H., Zahid, S., Brancale, J., Xia, R., et al. (2019). Polygenic prediction of weight and obesity trajectories from birth to adulthood. Cell 177 (3), 587–596. doi:10.1016/j.cell.2019.03.028

PubMed Abstract | CrossRef Full Text | Google Scholar

Koenker, R. (2022). Quantile regression. Available at: https://cran.r-project.org/web/packages/quantreg/quantreg.pdf.

Google Scholar

Locke, A. E., Kahali, B., Berndt, S. I., Justice, A. E., Pers, T. H., Day, F. R., et al. (2015). Genetic studies of body mass index yield new insights for obesity biology. Nature 518 (7538), 197–206. doi:10.1038/nature14177

PubMed Abstract | CrossRef Full Text | Google Scholar

Marderstein, A. R., Davenport, E. R., Kulm, S., Van Hout, C. V., Elemento, O., and Clark, A. G. (2021). Leveraging phenotypic variability to identify genetic interactions in human phenotypes. Am. J. Hum. Genet. 108 (1), 49–67. doi:10.1016/j.ajhg.2020.11.016

PubMed Abstract | CrossRef Full Text | Google Scholar

Mirzaei, K., Xu, M., Qi, Q., de Jonge, L., Bray, G. A., Sacks, F., et al. (2014). Variants in glucose- and circadian rhythm-related genes affect the response of energy expenditure to weight-loss diets: the POUNDS LOST trial. Am. J. Clin. Nutr. 99 (2), 392–399. doi:10.3945/ajcn.113.072066

PubMed Abstract | CrossRef Full Text | Google Scholar

Nwakuya, M. T., and Nwabueze, J. C. (2018). Application of box-cox transformation as a corrective measure to heteroscedasticity using an economic data. Am. J. Math. Stat. 8(1), 8–12. doi:10.5923/j.ajms.20180801.02

CrossRef Full Text | Google Scholar

O'Connor, L. J. (2021). The distribution of common-variant effect sizes. Nat. Genet. 53 (8), 1243–1249. doi:10.1038/s41588-021-00901-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Prive, F., Arbel, J., and Vilhjalmsson, B. J. (2020). LDpred2: Better, faster, stronger. Bioinformatics 36, 5424–5431. doi:10.1093/bioinformatics/btaa1029

CrossRef Full Text | Google Scholar

Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A., Bender, D., et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81 (3), 559–575. doi:10.1086/519795

PubMed Abstract | CrossRef Full Text | Google Scholar

Randall, J. C., Winkler, T. W., Kutalik, Z., Berndt, S. I., Jackson, A. U., Monda, K. L., et al. (2013). Sex-stratified genome-wide association studies including 270, 000 individuals show sexual dimorphism in genetic loci for anthropometric traits. PLoS Genet. 9 (6), e1003500. doi:10.1371/journal.pgen.1003500

PubMed Abstract | CrossRef Full Text | Google Scholar

Rask-Andersen, M., Karlsson, T., Ek, W. E., and Johansson, A. (2017). Gene-environment interaction study for BMI reveals interactions between genetic factors and physical activity, alcohol consumption and socioeconomic status. PLoS Genet. 13 (9), e1006977. doi:10.1371/journal.pgen.1006977

PubMed Abstract | CrossRef Full Text | Google Scholar

Rosopa, P. J., Schaffer, M. M., and Schroeder, A. N. (2013). Managing heteroscedasticity in general linear models. Psychol. Methods 18 (3), 335–351. doi:10.1037/a0032553

PubMed Abstract | CrossRef Full Text | Google Scholar

Silventoinen, K., Jelenkovic, A., Sund, R., Hur, Y. M., Yokoyama, Y., Honda, C., et al. (2016). Genetic and environmental effects on body mass index from infancy to the onset of adulthood: an individual-based pooled analysis of 45 twin cohorts participating in the COllaborative project of development of anthropometrical measures in twins (CODATwins) study. Am. J. Clin. Nutr. 104 (2), 371–379. doi:10.3945/ajcn.116.130252

PubMed Abstract | CrossRef Full Text | Google Scholar

Speliotes, E. K., Willer, C. J., Berndt, S. I., Monda, K. L., Thorleifsson, G., Jackson, A. U., et al. (2010). Association analyses of 249, 796 individuals reveal 18 new loci associated with body mass index. Nat. Genet. 42 (11), 937–948. doi:10.1038/ng.686

PubMed Abstract | CrossRef Full Text | Google Scholar

Sudlow, C., Gallacher, J., Allen, N., Beral, V., Burton, P., Danesh, J., et al. (2015). UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12 (3), e1001779. doi:10.1371/journal.pmed.1001779

PubMed Abstract | CrossRef Full Text | Google Scholar

Sulc, J., Mounier, N., Gunther, F., Winkler, T., Wood, A. R., Frayling, T. M., et al. (2020). Quantification of the overall contribution of gene-environment interaction for obesity-related traits. Nat. Commun. 11 (1), 1385. doi:10.1038/s41467-020-15107-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Tyrrell, J., Wood, A. R., Ames, R. M., Yaghootkar, H., Beaumont, R. N., Jones, S. E., et al. (2017). Gene-obesogenic environment interactions in the UK Biobank study. Int. J. Epidemiol. 46 (2), 559–575. doi:10.1093/ije/dyw337

PubMed Abstract | CrossRef Full Text | Google Scholar

UK10K Consortium Walter, K., Min, J. L., Huang, J., Crooks, L., Memari, Y., et al. (2015). The UK10K project identifies rare variants in health and disease. Nature 526 (7571), 82–90. doi:10.1038/nature14962

PubMed Abstract | CrossRef Full Text | Google Scholar

Vilhjalmsson, B. J., Yang, J., Finucane, H. K., Gusev, A., Lindstrom, S., Ripke, S., et al. (2015). Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am. J. Hum. Genet. 97 (4), 576–592. doi:10.1016/j.ajhg.2015.09.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Visscher, T. L., and Seidell, J. C. (2001). The public health impact of obesity. Annu Rev Public Health 22, 355–375. doi:10.1146/annurev.publhealth.22.1.355

PubMed Abstract | CrossRef Full Text | Google Scholar

Visscher, P. M., Brown, M. A., McCarthy, M. I., and Yang, J. (2012). Five years of GWAS discovery. Am. J. Hum. Genet. 90 (1), 7–24. doi:10.1016/j.ajhg.2011.11.029

PubMed Abstract | CrossRef Full Text | Google Scholar

Wainschtein, P., Jain, D., Zheng, Z., Cupples, L. A., et al. TOPMed Anthropometry Working GroupNHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium (2022). Assessing the contribution of rare variants to complex trait heritability from whole-genome sequence data. Nat. Genet. 54 (3), 263–273. doi:10.1038/s41588-021-00997-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, H., Zhang, F., Zeng, J., Wu, Y., Kemper, K. E., Xue, A., et al. (2019). Genotype-by-environment interactions inferred from genetic effects on phenotypic variability in the UK Biobank. Sci. Adv. 5 (8), eaaw3538. doi:10.1126/sciadv.aaw3538

PubMed Abstract | CrossRef Full Text | Google Scholar

Waters, H., and Graf, M. (2020). America’s obesity crisis: the health and economic costs of excess weight. Miliken Institute.

Google Scholar

Wolc, A., White, I. M., Avendano, S., and Hill, W. G. (2009). Genetic variability in residual variation of body weight and conformation scores in broiler chickens. Poult. Sci. 88 (6), 1156–1161. doi:10.3382/ps.2008-00547

PubMed Abstract | CrossRef Full Text | Google Scholar

World Health Organization (2020). WHO guideline on use of ferritin concentrations to assess iron status in populations. World Health Organization.

Google Scholar

Yang, J., Loos, R. J., Powell, J. E., Medland, S. E., Speliotes, E. K., Chasman, D. I., et al. (2012). FTO genotype is associated with phenotypic variability of body mass index. Nature 490 (7419), 267–272. doi:10.1038/nature11401

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, K., Tu, J., and Chen, T. (2019). Homoscedasticity: an overlooked critical assumption for linear regression. Gen. Psychiatr. 32 (5), e100148. doi:10.1136/gpsych-2019-100148

PubMed Abstract | CrossRef Full Text | Google Scholar

Yengo, L., Sidorenko, J., Kemper, K. E., Zheng, Z., Wood, A. R., Weedon, M. N., et al. (2018). Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry. Hum. Mol. Genet. 27 (20), 3641–3649. doi:10.1093/hmg/ddy271

PubMed Abstract | CrossRef Full Text | Google Scholar

Young, A. I., Wauthier, F., and Donnelly, P. (2016). Multiple novel gene-by-environment interactions modify the effect of FTO variants on body mass index. Nat. Commun. 7, 12724. doi:10.1038/ncomms12724

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: heteroscedasticity, body mass index, prediction efficiency, gene-environment interaction, genome-wide polygenic risk score

Citation: Baek EJ, Jung H-U, Chung JY, Jung HI, Kwon SY, Lim JE, Kim HK, Kang J-O and Oh B (2022) The effect of heteroscedasticity on the prediction efficiency of genome-wide polygenic score for body mass index. Front. Genet. 13:1025568. doi: 10.3389/fgene.2022.1025568

Received: 24 August 2022; Accepted: 25 October 2022;
Published: 07 November 2022.

Edited by:

Ching-Ti Liu, Boston University, United States

Reviewed by:

Yuxuan Wang, Boston University, United States
Xiaowei Hu, University of Virginia, United States

Copyright © 2022 Baek, Jung, Chung, Jung, Kwon, Lim, Kim, Kang and Oh. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ji-One Kang, jionekang@yahoo.com; Bermseok Oh, ohbs@khu.ac.kr

^†These authors share first authorship

^‡These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.