Introduction

Over the past four years, Coronavirus 2019 (COVID-19), caused by Severe Acute Respiratory Syndrome Coronavirus 2019 (SARS-CoV-19), has spread around the world and is causing great concern globally [38]. Reportedly, COVID-19 may be able to co-exist with humans for a long time. The clinical presentation of patients with COVID-19 is diverse, and a variety of complications are frequently observed in patients with COVID-19, including interstitial pneumonia, pancytopenia, arthralgia and autoimmune diseases [9, 15]. Conversely, autoimmune diseases and inflammation lead COVID-19 patients into a vicious cycle of infection, causing increased morbidity and mortality [27, 35]. Autoimmune diseases such as multiple myeloma (MM), systemic lupus erythematosus and multiple sclerosis are extremely similar to COVID-19 in terms of symptomatology and pathological response, suggesting that there may be a close association between the diseases. A causal relationship between systemic lupus erythematosus, multiple sclerosis and COVID-19 has been reported [34]. However, there is still a gap in the causal relationship between MM and COVID-19. High doses of corticosteroids are used in the treatment of MM, and this usually leads to immunosuppression and a high inflammatory response, increasing the severity of COVID-19 [14]. A number of studies have reported, People with pre-existing autoimmune diseases are at higher risk of SARS-CoV-19 infection and disease severity [12]. Many other studies have observed that MM patients with COVID-19 have rates of infection and severe disease that are within the range of the general population [4]. Causal inferences from observational studies are unreliable or even contradictory due to the effects of unmeasured or unknown confounding factors [7]. On the other hand, due to ethics, it was impossible to conduct randomized controlled trial (RCTs) to investigate the causal relationship between COVID-19 and MM. Thus, the causal relationship between COVID-19 and MM remains ambiguous.

Clarifying the relationship between different traits of COVID-19 and MM helps in the diagnosis, treatment and rehabilitation of MM patients with COVID-19, assess the feasibility and risks of vaccination for patients with MM and help to develop strategies for management of patients with MM and for COVID-19 prophylaxis. Therefore, it is necessary to investigate the causal relationship between COVID-19 and MM [39].

Mendelian randomization (MR) is based on Mendel’s laws of inheritance, and use single nucleotide polymorphisms (SNPs) as an instrumental variable to assess the causal relationship between exposure factors and related outcomes [19]. Because genetic variation has occurred before the outcome, MR reduces the confounding effect of environmental factors and demonstrates genetic causality between exposure and outcome [19].

To fill this gap, we performed a bidirectional two-sample MR analysis to assess the causal relationship between different traits of COVID-19 and MM. LDSC calculates heritability, MTAG identifies novel possible causal SNPs, and combined analysis of co-localization and over-representation enrichment analysis reveal possible biological pathways for COVID-19 to cause MM. This facilitates a deeper understanding of the complex molecular mechanisms behind COVID-19 and MM.

Methods

Study design

Fig. 1
figure 1

Flowchart of overall study design. GWAS, genome-wide association study; IVs, instrumental variables; SNPs, single nucleotide polymorphisms

Fig. 2
figure 2

Bidirectional Mendelian randomization analysis using inverse-variance weighting (IVW) method. a Coronavirus Disease (COVID-19) related traits to multiple myeloma (MM); b multiple myeloma (MM) to Coronavirus Disease (COVID-19) related traits

Fig. 3
figure 3

Circular Manhattan plot of significant SNP correlation with multiple myeloma (MM). a The significant SNP correlation with MM before MTAG analysis; b the significant SNP correlation with MM after MTAG analysis

Fig. 4
figure 4

Over-representation enrichment analysis of shared genes. a The shared genes of SARS-CoV-2 infection and MM. b The shared genes of COVID-19 hospitalization and MM

The overall study design is shown in Fig. 1. Our study is based on multiple GWAS studies. We firstly calculate heritability of COVID-19 related traits (SARS-CoV-2 infection, COVID-19 hospitalization and severe COVID-19) and MM to show that they are all hereditable. Then selecting instrumental variables (IVs), we used bidirectional MR analysis [19] to infer the causal relationships between COVID-19 related traits and MM. Based on the results of bidirectional MR analysis, we expect to identify more novel SNPs with strong signals for MM using multi-trait analysis of GWAS (MTAG) [30] incorporating information from GWAS of COVID-19. In addition, we performed fine-mapping and co-localization analysis [16] to identify shared loci between COVID-19 and MM. We undertook over-representation enrichment analysis [31] of the shared loci to explore the biological processes involved.

Data sources

Genetic associations of COVID-19 were obtained from the latest freeze 7 data (release date: April 8, 2022) based on European ancestry—populations of the COVID-19 host genetics Initiative (COVID-19 HGI) (https://www.covid19hg.org/results/covidhgi-freeze-7-readme.txt) [21]. The data contain three relevant features of COVID-19: (1) Individuals with very severe respiratory symptoms confirmed by COVID-19 or who died from the disease are called severe COVID-19 (13,769 cases and 1,072,442 controls). (2) Individuals hospitalized for symptoms of associated infections are called COVID-19 hospitalization (32,519 cases and 2,062,805 controls). (3) All individuals reporting positive SARS-CoV2 infection, regardless of symptoms and hospitalization, are called SARS-CoV2 infection (122,616 cases and 2,475,240 controls). The genome-wide association studies (GWAS) data for MM patients from the FinnGen GWAS database [10], comprising 598 patients and 180,756 controls of European ancestry. The data in the two databases are not raw genotyping data, but summary statistics data after strict and standardized quality control. Therefore, there is no need for quality control again.

Linkage disequilibrium score regression (LDSC)

Referring to LD scores of European ancestry reference data from the 1000 Genomes Project, we used GWAS summary statistics to calculate heritability for single traits. The LDSC provided an estimate of heritability by regressing the LD score on the running test statistic [8]. LDSC can provide accurate estimates, even if test statistics are inflated due to polygenicity [33]. The heritability estimation was completed by the “ldsc” R software package.

MR analysis

We screened for SNPs that reached the genomic significance level (p < 5 \(\times\) 10 \(^{-6}\)). Secondly, to exclude the effect of linkage disequilibrium (LD) on the MR results, we used the clumping process (r\(^{2}\) < 0.001, clumping distance = 10,000 kb) to ensure that there is no LD between SNPs. Finally, selected SNPs were matched with the GWAS of outcome, and the missing SNPs were replaced by SNPS with high LD (r\(^{2}\) > 0.8). Finally, after removing palindromic SNPs, the other SNPs were used as IVs.

“TwoSampleMR” R package (version 0.5.6, Stephen Burgess, Chicago, IL, USA) was used for two-sample MR Analysis between exposure and outcome. The bidirectional MR analysis is an MR analysis that tries to differentiate whether factor A is a cause or a consequence of factor B or whether there is a true bidirectional causal effect between A and B. Inverse variance weighting (IVW, random effects) was used as the main method of analysis. Cochran’s Q statistic was adopted for IVW analysis to detect the heterogeneity, and p > 0.05 indicates no heterogeneity [28]. MR-Egger regression was used to identify potential pleiotropy and to assess the effect of pleiotropy on the risk assessment of the intercept test, with p > 0.05 indicating no pleiotropy [36].

MTAG

The key assumption of MTAG is that all SNPs share the same variance-covariance matrix of effect sizes across traits. MTAG re-estimated the effect size for each trait by combining the weighted sum of the GWAS estimates for different traits to identify novel SNPs with strong signals [30]. Compared with genome-wide significant SNPs identified in single-trait GWAS, more phenotype-related SNPs can be found through joint analysis of multiple traits related GWAS summary results. We used trait-specific effective sample sizes and transformed Z-scores for MTAG. All SNPs present in the GWAS data were included in the MTAG calculation.

Fine-mapping and co-localization analysis

The “COLOC” R package was used to assess the probability that two traits share the same causal variant. We selected each genetic variant within 100 kb of the major SNPs for co-localization analysis [13]. By calculating the posterior probability that two traits share the same causal variant (p(H4)), we explore possible causal pathways by which COVID-19 causes MM. If any SNP has a p (H4) > 0.85, we label it as a co-localized genetic variant [16].

Over-representation enrichment analysis

We mapped shared loci to gene names through the dpSNP database. Enrichment in Gene Ontology (GO) biological process was analyzed by the WEB-based Gene Set Analysis Toolkit using genes with p(H4) > 0.8 [31]. We performed over-representation enrichment analysis for shared genes of SARS-CoV-2 infection and MM, COVID-19 hospitalization and MM in order to explore the biological processes involved in the causal relationship between COVID-19 and MM.

Results

Bidirectional MR analysis

Heritability (h2) estimated that all three traits of COVID-19 (severe COVID-19, COVID-19 hospitalization, SARS-CoV-2 infection) and MM were heritable (p < 0.05, as shown in Table 1). We next performed MR analysis of COVID-19 to MM (Fig. 2a). The results of IVW suggested SARS-CoV-2 infection had a positive effect on MM (p = 0.021), and COVID-19 hospitalization also increased the risk of MM (p = 0.017). Heterogeneity test and MR-Egger intercept test did not demonstrate evidence of heterogeneity or horizontal pleiotropy (Table 2), which supported the validity of the findings. The causal effect of severe COVID-19 to MM was not observed (p = 0.066).

Table 1 SNP based heritability estimated by LDSC
Table 2 The corresponding pleiotropy and heterogeneity test of each exposure-outcome pair
Table 3 Significant SNPs of MM after MTAG analysis
Table 4 Co-localization analysis of COVID-19 and MM

The results of MR Analysis of MM to COVID-19 are shown in Fig. 2b. We found null causal relationship of genetically predicted MM to severe COVID-19 (p = 0.296), COVID-19 hospitalization (p = 0.879) and SARS-CoV-2 infection (p = 0.626). IVs used in MR analysis for individual traits are shown in supplemental Tables 14.

MTAG

Fourteen genomic significant and independent genetic loci for MM were extracted using GWAS of European ancestry (Fig. 3a). We performed a joint GWAS using MTAG for two COVID-19 traits (SARS-CoV-2 infection, COVID-19 hospitalization) with MM. Finally, a total of 17 significant loci associated with MM were identified (Table 3). We observed that there were three more relevant loci than those identified by single-trait GWAS. Interestingly, all three loci are located on chromosome 11 and may regulate the expression of B3GNT6 (Fig. 3b). This gene encodes B3GNT6 protein, the absence of which predisposes to the development of chronic inflammation and ultimately to the development of cancer. In addition, we observed that three SNPs strongly associated with MM were located on the X chromosome, which may be one of the reasons why there are more males than females in patients with MM. We did not identify other genome-wide significant loci for the COVID-19 trait using MTAG combining information from GWAS associated with MM.

Analysis of shared genetic loci

Although we have found evidence of a positive causal effect of COVID-19 on MM, the causal mechanisms by which COVID-19 promotes MM remain to be explored. We defined each individual SNP and genetic variation within 100kb associated with COVID-19 or MM as a test region. Finally, after merging the overlapping regions, a total of 53 unique regions of SARS-CoV-2 infection and MM were included, and a total of 86 unique regions of COVID-19 hospitalization and MM were included.

Among these unique regions tested, five regions in chromosomes 3, 6, and 12 suggested causal SNPs of SARS-CoV-2 infection on MM (co-localization probability p(H4) > 0.85; Table 4). There are three regions on chromosome 3 and one region on each of chromosomes 6 and 12. The co-localized genetic loci were mapped to SLC6A20, LZTFL1, MUC4, OAS1, HLA-C genes for MM with SARS-CoV-2 infection (Table 4). In addition to these chromosomes, the causal SNPS of COVID-19 hospitalization on MM are also located in many regions of chromosome 19, involving genes such as PSG5, SLC22A31, FDX2, MAPT and others. ABO and FUT2 all contribute to both autoimmune diseases and COVID-19. Chromosome 6 has a region of the human leukocyte antigen (HLA) complex that is strongly associated with immunity and encodes a variety of cell surface proteins responsible for regulating the immune system, including HLA-C. Autoimmune diseases have also been repeatedly reported to be associated with genes on chromosomes 3 and 19 [18]. Combining MR results showing causal effects of SARS-CoV-2 infection and COVID-19 hospitalization to MM and no evidence of horizontal pleiotropy, these observed genes may exert causal effects on MM through changes in SARS-CoV-2 infection and COVID-19 hospitalization.

Over-representation enrichment analysis

In over-representation enrichment analysis, these shared genes were enriched in several immune response-related pathways (Fig. 4), such as “response to type I interferon”, “type I interferon signaling pathway”, “interferon-gamma-mediated signaling pathway”, “glycosylation of proteins and macromolecules”, “innate immune response” and “defense response to virus”. These biological processes are closely related and important for COVID-19 and MM.

Discussion

The malignant plasma cell neoplasm is a hematological disorders caused by clonal proliferation of plasma cells and the resulting overproduction of monoclonal immunoglobulins (M proteins). MM is an important category. MM patients exhibit various immune deficiencies caused by the disease or its treatment [3]. There are also complex interactions between SARS-CoV-2 and the immune system [22]. It is important to determine whether there is a potential relationship between COVID-19 and MM and to understand the complex molecular mechanisms behind them. This contributes to a deeper understanding of the relationship between COVID-19 and MM and provides insights into future viral crisis events similar to SARS or SARS-CoV-2. To our knowledge, this report is the first to provide evidence for the presumed causal nature of the association between COVID-19 and MM.

Autoantibody production is a key feature of autoimmune diseases, and autoantibodies known to occur in many autoimmune diseases have been detected in patients with COVID-19 [17]. SARS-CoV-2 can disrupt self-tolerance and trigger an autoimmune response through cross-reactivity with host cells [2]. Some patients have been reported to develop autoimmune diseases, such as Guillain-Barre syndrome or systemic lupus erythematosus, after contracting COVID-19 [6]. Our MR study suggests that SARS-CoV-2 infection and COVID-19 hospitalization are associated with an increased risk of MM. By using a bidirectional analysis strategy, we can distinguish between upstream and downstream of the disease chain without reverse causation. We show for the first time that SARS-CoV-2 infection and COVID-19 hospitalization are associated with a higher risk of MM genetically. Our results support the conclusion that COVID-19 may trigger autoimmunity and induce autoimmune diseases [24].

When the GWAS of a phenotype is correlated with the results of other phenotypes, MTAG can use the information of the correlated phenotypes to conduct joint multi-trait analyses to identify novel SNPs of significance. We observed three novel SNPs, which coincidentally clustered around B3GNT6. B3GNT6, which encodes an important precursor in the biosynthesis of mucin-type glycoproteins, is associated with autoimmune diseases and DNA repair, such as selective lgA deficiency [23].

We applied an integrated approach combining MR with COLOC to investigate the genetic causal pathway of MM mediated by SARS-CoV-2 infection and COVID-19 hospitalization. MR + COLOC analysis showed evidence of causal effects of five genes mediated by SARS-CoV-2 infection and 19 genes mediated by COVID-19 hospitalization on MM. Some of these genes have been reported to be associated with COVID-19 and may be responsible for increased susceptibility to autoimmune diseases. MUC4 controls ciliary motility and contributes in defensing against allergens, viruses and extracellular molecules [25]. What is more, MUC4 is highly associated with SARS-CoV-2 infection, and its subsequent dysregulated immune-inflammatory response plays an important role in the pathogenic mechanisms of autoimmune diseases such as rheumatoid arthritis and systemic lupus erythematosus [29]. The HLA region contains genes involved in antigen processing, presentation, and immune regulation, and has a role in both COVID-19 and autoimmune diseases. HLA-C in the HLA I region is an allele closely associated with SARS-CoV-2 infection and also mediates a variety of autoimmune diseases [32]. There are also SLC6A20 [1] and LZTFL1 [11]. Variants within the ATP11A gene influence the development of illness requiring hospitalization after infection with SARS-CoV-2 [37]. Hu et al. [20] reported that low-expressed/highly methylated ATP11A is a prognostic marker for acute myeloid leukemia. These genes we identified are involved in three main biological processes: response to interferon, regulation of immune processes and glycosylation of proteins. It has been shown that genetic variation of functional genes in IFN-I pathway is associated with disease risk. Inhibition of IFN-I response can aggravate virus replication and lead to excessive immunity and induce a variety of immune diseases such as rheumatoid arthritis and systemic lupus erythematosus [26]. COVID-19-associated changes in levels and N-glycosylation of specific plasma proteins highlight the role and complexity of glycosylation in COVID-19 [5]. It has also been widely reported that glycosylation of immunoglobulins induced pathological changes and ultimately lead to the development of autoimmune diseases [40]. The glycosylation pathway may play a role in the causal relationship between COVID-19 and MM. Thus, these genes we identified may play important roles in the underlying mechanisms of MM, of which COVID-19 is the mediator.

Our study is the first to explore the relationship between COVID-19 related traits and MM. However, we acknowledge some limitations of our study. Firstly, we used GWAS data of European ancestor to avoid population stratification, so the applicability of our results to other populations (such as Asians) should be carefully considered. Secondly, we indirectly highlighted causal genes and pathways, and these findings require further confirmatory analyses. And finally, we did not find a causal relationship between severe COVID-19 and MM for reasons, so further observations and biological experiments may be needed.

Conclusion

In summary, we report a causal relationship between COVID-19 and MM, identify possible causal pathways using an MR + COLOC analysis method, and discover the biological processes at work. These findings may provide new insights for further studies on the potential pathogenesis and therapeutic targets of MM.