Genome-wide association study of self-reported food reactions in Japanese identifies shrimp and peach specific loci in the HLA-DR/DQ gene region

Khor, Seik-Soon; Morino, Ryoko; Nakazono, Kazuyuki; Kamitsuji, Shigeo; Akita, Masanori; Kawajiri, Maiko; Yamasaki, Tatsuya; Kami, Azusa; Hoshi, Yuria; Tada, Asami; Ishikawa, Kenichi; Hine, Maaya; Kobayashi, Miki; Kurume, Nami; Kamatani, Naoyuki; Tokunaga, Katsushi; Johnson, Todd A.

doi:10.1038/s41598-017-18241-w

Download PDF

Article
Open access
Published: 18 January 2018

Genome-wide association study of self-reported food reactions in Japanese identifies shrimp and peach specific loci in the HLA-DR/DQ gene region

Seik-Soon Khor¹,
Ryoko Morino²,
Kazuyuki Nakazono³,
Shigeo Kamitsuji³,
Masanori Akita²,
Maiko Kawajiri²,
Tatsuya Yamasaki⁴,
Azusa Kami²,
Yuria Hoshi⁴,
Asami Tada²,
Kenichi Ishikawa²,
Maaya Hine⁵,
Miki Kobayashi⁵,
Nami Kurume⁵,
Naoyuki Kamatani³,
Katsushi Tokunaga¹ &
…
Todd A. Johnson ORCID: orcid.org/0000-0003-3377-6692³

Scientific Reports volume 8, Article number: 1069 (2018) Cite this article

4773 Accesses
27 Citations
11 Altmetric
Metrics details

Subjects

Abstract

Food allergy is an increasingly important health problem in the world. Several genome-wide association studies (GWAS) focused on European ancestry samples have identified food allergy-specific loci in the HLA class II region. We conducted GWAS of self-reported reactivity with common foods using the data from 11011 Japanese women and identified shrimp and peach allergy-specific loci in the HLA-DR/DQ gene region tagged by rs74995702 (P = 6.30 × 10⁻¹⁷, OR = 1.91) and rs28359884 (P = 2.3 × 10⁻¹², OR = 1.80), respectively. After HLA imputation using a Japanese population-specific reference, the most strongly associated haplotype was HLA-DRB1*04:05-HLA-DQB1*04:01 for shrimp allergy (P = 3.92 × 10⁻¹⁹, OR = 1.99) and HLA-DRB1*09:01-HLA-DQB1*03:03 for peach allergy (P = 1.15 × 10⁻⁷, OR = 1.68). Additionally, both allergies’ associated variants were eQTLs for several HLA genes, with HLA-DQA2 the single eQTL gene shared between the two traits. Our study suggests that allergy to certain foods may be related to genetic differences that tag both HLA alleles having particular epitope binding specificities as well as variants modulating expression of particular HLA genes. Investigating this further could increase our understanding of food allergy aetiology and potentially lead to better therapeutic strategies for allergen immunotherapies.

European and multi-ancestry genome-wide association meta-analysis of atopic dermatitis highlights importance of systemic immune regulation

Article Open access 04 October 2023

Ashley Budu-Aggrey, Anna Kilanowski, … Lavinia Paternoster

A genome-wide association study in Japanese identified one variant associated with a preference for a Japanese dietary pattern

Article 06 December 2020

Harumitsu Suzuki, Yasuyuki Nakamura, … Kenji Wakai

Genome-wide association study of dietary intake in the UK biobank study and its associations with schizophrenia and other traits

Article Open access 03 February 2020

Maria Niarchou, Enda M. Byrne, … Naomi R. Wray

Introduction

Asthma has been traditionally known to be the first wave of the allergy epidemic affecting the lives of millions and is the third leading cause for hospitalization in the United States among people less than 18 years of age¹. Food allergy appears to be increasing in its prevalence, with 5% of adults and 8% of children estimated to be affected², and it is now commonly referred to as the second wave of the allergy epidemic³. In general, individuals during childhood are at higher risk for food allergies to wheat, soy, milk and egg, but food allergies to tree nuts, peanuts, fish and shellfish normally tend to persist throughout adulthood². In addition, recent studies have shown that sensitization to food can occur in adulthood⁴, although serum IgE reactivity with specific foods appears to differ by geographic region, presumably due to differential consumption of particular foods or exposure to environmental allergens^4,5,6,7.

Food allergy related disorders include acute conditions with the potential to be fatal, along with a number of chronic diseases that mainly occur in the epidermis and gastrointestinal tract⁸. Such disorders can roughly be divided into classical IgE-mediated food allergies or those mediated by other antibody isotypes such as IgA and IgG⁹. In addition, food allergy diagnosis can be complicated because detection of food-specific IgE does not always correlate with clinically defined allergy¹⁰, and there are a number of disorders for which symptoms may appear to be similar to food allergy. For example, dark-meat fish can contain histamine-like chemicals that cause a toxic reaction known as scombroid poisoning¹¹, and consumption of spicy foods may result in rhinorrhea, a situation known as gustatory rhinitis, which mimics aspects of food allergy¹². In addition, individuals with food intolerance may misascribe their symptoms as “allergies”. For example, one study using self-reported food reactions found that of subjects reporting reactivity to milk and wheat, 85% and 94% could be ascribed to lactose and gluten intolerance, respectively⁵.

In a nationwide survey of Japanese physicians for immediate-type food allergies across all age groups, the percentage of 2,954 patients that reacted with causative foods were: hen’s eggs (39.0%), cow’s milk (21.8%), wheat (11.7%) peanuts (5.1%), fruit (4%), fish roe (3.7%), crustaceans (3.4%), nuts (2.3%), buckwheat (2.2%), fish (2.1%), and other foods (4.6%)¹³. In contrast, foods that made up greater than 5% of either new-onset food allergies (NOFA) or patient visits due to accidental ingestion (AIFA) in adults (≥20 years old) were limited to wheat (NOFA = 38.0%; AIFA = 34.0%), crustaceans (NOFA = 10.0%; AIFA = 22.0%), buckwheat (NOFA = NA; AIFA = 10.0%), fish (NOFA = 13.0%; AIFA = 8.0%), and fruit (NOFA = 7.0%; AIFA = 8.0%). The levels of wheat-related NOFA and AIFA in adults was around three times greater than that seen across the five lower age ranges that were tabulated and may be related to recent reports of wheat allergy and wheat-dependent exercise-induced urticaria/anaphylaxis attributed to use of soap and other cosmetic products containing hydrolysed wheat protein¹⁴. Based on such surveys, the food labelling system for allergenic food ingredients in Japan started on April 1, 2002 designated egg, milk, wheat, buckwheat and peanuts as mandatory, and the labelling of shrimp and crab was required from 2008¹⁵. Labelling of an additional twenty foods is recommended by the Consumer Affairs Agency¹³.

The human major histocompatibility complex (MHC) region encodes the human leukocyte antigen (HLA) genes, among which HLA class II molecules (HLA-DR, -DQ and -DP) have especially been reported to be involved in food allergy responses. HLA involvement in the IgE response was first described when Levine et al. reported that HLA class I haplotypes triggered IgE response to allergen derived from ragweed (Ambrosia artemisiaefolia)¹⁶. Subsequently, other HLA associations with allergens were reported, including for rye grass (Lol p I, Lol p II, Lol p III)^17,18, American feverfew, and birch pollen (Bet v I) with HLA-DR3¹⁹. With the advancement of methods for genotyping, genome-wide association studies (GWASs) have been used to identify susceptibility genes for allergic diseases such as asthma^20,21,22 and atopic dermatitis^23,24,25. Besides that, association of HLA-DR and HLA-DQ were confirmed for several food allergies including peanut^26,27,28 and asparaginase (a kind of enzyme used in food manufacturing)²⁹.

Foods generally contain multiple proteins to which individuals may react differently due to exposure of the food to heat (cooking), acid, and proteases (gastrointestinal tract), and most of the identified relevant food allergens are water-soluble glycoproteins^30,31,32,33. In addition, antibodies may bind to either linear or conformational epitopes within a protein’s structure, with the former representing short runs of amino acids within the primary protein structure, and the latter representing epitopes within the three-dimensional structure of the protein. Previous studies have shown that the repertoire of antibodies binding to multiple epitopes within food allergens may correlate with disease status³⁴, and IgE that recognizes the tertiary allergen structure is often associated with transient childhood allergy while linear epitope recognition often results in persistence allergy into adulthood^34,35.

In this study, we performed a comprehensive analysis of self-reported reactivity with 27 different foods from fruits, vegetables, grains, meats, seafood, and dairy product food groups in a Japanese population sample.

Results

GWAS of self-reported food reactions in the Japanese population

Samples for this study were collected as part of an on-going effort of the Japanese company MTI and its subsidiary EverGene (EG) to better understand and improve women’s health through research use of its Luna Luna family of women’s healthcare related web-site and smartphone app. DNA from 11379 female subjects was collected in two study stages (LL01 = 5751, LL02 = 5628), and subjects were queried via questionnaire for putative allergic sensitivity to twenty-seven different foods from fruits, legumes/grains, nuts, vegetables/mushrooms, eggs/dairy, meats, fish, and shellfish product food groups (Supplementary Table S1). DNA samples were genotyped using a custom Affymetrix Axiom genotyping array based on the company’s CHB-1 product (SNP ct. = 607857; see Methods), and using 536506 QC + variants, we analysed LL01/LL02 samples for population structure and for the presence of duplicated samples via principal component analysis (PCA; Supplementary Fig. S1) and identity-by-descent (IBD) analysis (see Methods); 11011 QC + subjects also answered the food allergy questions and were used in our analysis.

Preliminary GWAS scans were performed across the 27 foods, and then, due to noise and a lack of signals for foods with low case sample counts, final GWAS analyses were run across seven foods for which the number of cases were deemed sufficient (>100 cases in both the LL01 and LL02 sample sets; kiwi, peach, Chinese yam, eggs, mackerel, crab, and shrimp in Supplementary Table S1). Evaluation of genome-wide inflation of test statistics showed that λ_GC values for genotyped variants ranged from 0.9889 to 1.0073 across the seven foods, which suggested that there was negligible inflation of the test statistics after adjustment for the top two PCs (Supplementary Fig. S2).

Since neither LL01 or LL02 was originally considered as a traditional discovery sampleset with the other being a “replication” set, we employed a bi-directional “discovery-evaluation” GWAS strategy that was recently described (see Methods; Flow-chart: Supplementary Fig. S3)³⁶. We then combined the two samplesets in a meta-analysis, and based on a previous estimate of the effective independent SNP count (M_E) for a similarly sized array platform and the JPT population³⁷, we called SNPs that achieved a single GWAS P-value cut-off of P_meta < 1.2 × 10⁻⁷ (0.05/411,521 SNPs) as nominal associations and those achieving a multiple testing adjusted cut-off of P_meta < 4.4 × 10⁻⁹ (1.2 × 10⁻⁷/27 foods) as strong associations. Two foods were identified with SNPs achieving the latter threshold (peach and shrimp; Table 1). To check for potential signals that may have escaped detection by using only genotyped data for identifying association signals, we also performed a genome-wide summary statistics based imputation using DISTMIX (see Methods) and plotted the results as Manhattan plots in Fig. 1. No additional signals achieving the significance cutoff were identified in any of the seven foods as compared to the original analysis that used genotyped data.

Table 1 Summary of food-allergy association signals.

Full size table

Associations with peach and shrimp allergy in the HLA region

Within both sets of peach and shrimp allergy associated variants, we identified a single cluster of neighbouring SNPs in the HLA-DR/HLA-DQ region of chromosome six (Fig. 1). To more accurately examine the underlying signals, we performed regional genotype imputation using 1000 Genomes Phase 3 reference haplotypes and identified rs28359884 and rs74995702 as the top SNP in the fine-mapped peach (Fig. 2a) and shrimp (Fig. 3a) allergy association signals, respectively; we found no residual significant signal after conditioning the regression analyses for each top imputed variant (Table 1; Fig. 2b and Fig. 3b).

We annotated variants that were in linkage disequilibrium (LD) to the top SNP in each signal using HaploReg³⁸, the NHGRI/EBI GWAS Catalog (downloaded October 18, 2017)^39,40, and UCSC known gene browser tables to identify overlap or proximity to neighbouring genes. For each associated SNP, we identified overlap with HaploReg components such as dbSNP functional annotation (5′-UTR, 3′-UTR, intergenic, nonsynonymous, synonymous), RoadmapEpigenomics epigenetic elements (promoter, enhancer, and DNase marks), and eQTL functional annotations across moderate-high LD variants (per-signal summary: Supplementary Worksheet S1; per-SNP summaries: Supplementary Worksheets S2 and S3).

Top variants associated with peach allergy lay within upstream regions of three HLA genes (HLA-DQA1, HLA-DRB5, HLA-DRB1), with 94% of the top peach SNPs (43/46 SNPs r² > 0.8) overlapping epigenomic annotations (Supplementary Worksheet S2: 7 promoter marks, 32 enhancers, 23 TFBS), and 31 SNPs associated with gene expression in HaploReg (min. P~6.3 × 10⁻⁴⁷; associated genes: HLA-DRB6, HLA-DQA2, HLA-DRB1, HLA-DQB1, SKIV2L, SEC. 14L3). Based on that observation, we examined SNPs that had at least moderate LD with the top associated peach allergy SNP (rs28359884) for overlap with eQTL variants identified in both European ancestry tissue samples (GTExPortal version 6p)⁴¹ and Japanese ancestry peripheral blood mononuclear cell (PBMC) samples (HGVD: Human Genetic Variation Database)^42,43. That analysis identified significant GTExPortal + HGVD eQTL SNPs for two HLA genes (Supplementary Worksheets S4 and S5), HLA-DQA2 (min.P_{HLA-DQA2/GTEx} = 2.3 × 10⁻⁵⁶, min.P_{HLA-DQA2/HGVD} = 1.3 × 10⁻¹⁸) and HLA-DRB5(min.P_{HLA-DRB5/GTEx} = 2.6 × 10⁻²⁹, min.P_{HLA-DRB5/HGVD} = 1.8 × 10⁻⁶²).

For shrimp allergy, top variants resided within a similar genomic HLA region encompassing six HLA class II genes (HLA-DQA1, HLA-DRB5, HLA-DRB1, HLA-DQB1, HLA-DQA2, HLA-DRA), with 60% of high LD SNPs (18/30 SNPs r² > 0.8) overlapping potential regulatory elements (Supplementary Worksheets S3: 5 promoter marks, 16 enhancers, 2 DHS, 7 TFBS). Although only a single high LD variant (rs2760995) was an eQTL listed in HaploReg, analysis of moderate LD variants identified 18 SNPs associated with expression of one or more of four HLA genes (HLA-DQA1, HLA-DQA2, HLA-DQB2, and/or HLA-DRA) in both GTExPortal and HGVD (Supplementary Worksheets S4 and S6). Within 413 moderate LD SNPs, 162 SNPs were significantly associated with those genes’ expression in GTExPortal data, with HLA-DQA2 associated SNPs displaying the largest number of SNPs and highest significance levels across shrimp-allergy, GTExPortal, and HGVD eQTL phenotypic analyses (Supplementary Worksheet S4; min.P_{HLA-DQA2/GTEx} = 1.3 × 10⁻⁵¹, min.P_{HLA-DQA2/HGVD} = 5.3 × 10⁻²⁸).

Two-field HLA allele imputation analysis

Food-allergy is also considered to be related to an ability of particular HLA alleles to present exogenous food acquired peptides. Therefore, to better understand the underlying nature of the peach and shrimp allergy associations, we performed imputation of HLA alleles using the SNP genotype data (see Methods). To confirm the quality of the HLA imputation results, 126 samples with various representative HLA haplotypes were HLA typed by PCR-SSOP method. There was >99% concordance across both HLA-DRB1 and HLA-DQB1 genes between imputed and HLA typing results. To analyse the HLA data, we performed a Relative Predispositional Effects (RPE) analysis⁴⁴, which performs a stepwise removal of top effect HLA alleles from cases and controls and then re-evaluates the significance of remaining alleles (see Methods)⁴⁵. In that fashion, RPE attempts to account for the impact of strong effect alleles that may shift the distribution of other allele’s frequencies in cases away from those observed in the controls.

RPE analysis of the imputed HLA alleles for peach allergy identified four HLA alleles in HLA-DRB1 (09:01, 14:05, 15:01, and 15:02) and HLA-DQB1 (03:03, 06:02, 05:03 and 06:01) and two alleles in HLA-DPB1 (14:01 and 09:01) (RPE summary in Table 2; RPE allele results in Table 3) as predisposing (PRE) alleles. While the preliminary stage of peach RPE analysis also identified alleles with protective effects (PRO), their significance decreased at each RPE step, suggesting that their presence was related to the strong PRE alleles (see DRB1*04:05 and 01:01, DQB1*04:01 and 05:01 in Table 3 RPE step 0). Among the PRE alleles, HLA-DRB1*09:01 and HLA-DQB1*03:03 are shown to be most commonly expressed in Asian populations, among which Japanese have one of the highest frequencies (Supplementary Fig. S4a and S4b). RPE analysis of two locus HLA-DRB1~HLA-DQB1 haplotypes showed significant associations for HLA-DRB1*09:01-HLA-DQB1*03:03, HLA-DRB1*14:05-HLA-DQB1*05:03, HLA-DRB1*15:01-HLA-DQB1*06:02, HLA-DRB1*15:02-HLA-DQB1*06:01 (Supplementary Table S2), while three locus HLA-DRB1~HLA-DQB1~HLA-DPB1 haplotype analysis identified six haplotypes containing the significant two locus haplotypes in combination with three HLA-DPB1 alleles (Supplementary Table S2). Among the associated two-locus HLA haplotypes, HLA-DRB1*09:01-HLA-DQB1*03:03 is shown to be exclusively expressed in Asian populations (Supplementary Fig. 4c).

Table 2 Relative Predispositional Effect (RPE) analysis locus summary.

Full size table

Table 3 Peach allergy: RPE analysis HLA allele summary.

Full size table

RPE analysis of HLA alleles associated with shrimp allergy revealed two PRE alleles for HLA-DRB1 (04:05 and 15:01), HLA-DQB1 (04:01 and 06:02) and HLA-DPB1 (06:01 and 14:01) (RPE summary in Table 2; RPE allele results in Table 4). As seen for the peach allergy, initially significant PRO alleles in RPE step 0 (HLA-DRB1*09:01, HLA-DQB1*03:03) were no longer significant after RPE adjustment for the top PRE alleles. Two locus HLA haplotype analysis (HLA-DRB1~DQB1) showed HLA-DRB1*04:05-HLA-DQB1*04:01 to be the single strongest HLA haplotype (Supplementary Table S3), accounting for most of the signal observed at the top meta-analysis SNP. Three locus HLA haplotype analysis (HLA-DRB1-DQB1-DPB1) did not identify a more significant association, as the signal observed at the major two locus haplotype was split into HLA-DRB1*04:05-HLA-DQB1*04:01-HLA-DPB1*05:01, HLA-DRB1*04:05-HLA-DQB1*04:01-HLA-DPB1*02:01, and HLA-DRB1*04:05-HLA-DQB1*04:01-HLA-DPB1*04:02 (Supplementary Table S3). Among the associated HLA alleles and haplotypes, HLA-DRB1*04:05, HLA-DQB1*04:01 and HLA-DRB1*04:05-HLA-DQB1*04:01 showed Asian-specific geographic distribution similar to that seen for the top peach HLA alleles and haplotype (Supplementary Fig. S5).

Table 4 Shrimp allergy: RPE analysis HLA allele summary.

Full size table

Cross-reactivity for self-reported food reactions

Previous food allergy reports have identified that individuals who react to one food sometimes react to other similar foods⁵ or even other environmental allergens⁴⁶. Therefore, we examined the proportion of individuals reporting reactivity with one of the twenty-seven foods who also reported reactivity with another analyzed food. That analysis is summarized as a heatmap (Fig. 4), which shows three distinct clusters where individuals reported cross-reactivity between members of fruits, nuts, or shellfish food groups. We observed the highest cross-reactivity for the two foods for which we identified significant associations in this report, with 60.5% of individuals reporting a reaction to crab also reactive with shrimp (49.2% of shrimp sensitive subjects reported reaction with crab), and 70.6% of apple reactive subjects also reporting peach sensitivity (41.2% of peach reactive individuals reported sensitivity to apple).

We then examined the top peach and shrimp SNPs in the MHC region for associations with apple or crab, respectively. Both of the top peach and shrimp allergy associated SNPs were nominally significant in apple and crab reactive subjects (Top peach SNP: rs28359884, P_apple = 5.75 × 10⁻⁶; Top shrimp SNP: rs74995702, P_crab = 1.22 × 10⁻³). In addition, out of 18,679 SNPs in the region, about 50% of the top 100 peach or shrimp associated SNPs were also in the top 100 apple or crab associated variants (54 and 48 SNPs were in the top 100 peach and apple or shrimp and crab SNPs, respectively).

Association analysis of previously reported SNPs

Recently, a GWAS of German children identified six SNPs (chr1:rs12123821; chr5:rs11949166; chr6:rs9273440, chr11:rs2212434; chr18:rs12964116, rs1243064) that were significantly associated with overall susceptibility to food allergy or individually to cows milk, hen’s egg, and/or peanuts⁴⁷. Due to our study-design, we were not able to analyse all the Marenholz et al. SNPs for all three foods in our data, but we did analyse association with egg for those variants using the DISTMIX imputed GWAS statistics. Two of the six (rs12123821 and rs12964116) were monomorphic in East Asians, while the other four did not achieve even nominal significance (P < 0.05) in our dataset. We also calculated association statistics for peanut allergy using the regionally imputed data around the HLA for peanut allergy specific rs9273440 but also did not see a significant result (P = 0.1368). One also might expect that SNPs associated with overall susceptibility to food allergy would be associated across multiple other foods. Therefore, we examined all seven of our GWAS foods for association with chr1:rs12123821, chr5:rs11949166, chr11:rs2212434, chr18:rs12964116, and chr18:rs1243064. As noted, the filaggrin (FLG) locus SNP rs12123821, and rs12964116, one of two serpin peptidase inhibitor, clade B, member 7 (SERPINB7) locus SNPs, were monomorphic, and across the remaining three SNPs, we observed nominal significance in just one out of the seven foods for rs2212434 (P_peach = 0.01393) and rs1243064 (P_peach = 0.0390); we did not consider those as having passed multiple testing adjustment for the seven foods (P < 0.007). In addition, two variants had previously been identified in a GWAS by Hong et al. as associated with peanut allergy in U.S. children²⁶. We found that one of the two SNPs was nominally associated with reactivity to peanuts in our dataset (rs7192: P = 0.00804; rs9275596: P = 0.0942). Potentially, the non-replication between our data and the Marenholz et al. study could be due to the low sample size in our data, case heterogeneity in our data due to use of self-reported food reactions rather than oral food challenge, or differences between children and adults with respect to food allergy aetiology.

Discussion

In this study, we solicited 11011 adult females for self-reported allergy (SrFA) to any of 27 different foods and 21.5% reported sensitivity to one or more foods (Supplementary Table S1). That level is much higher than generally accepted estimates of 2–4% for the prevalence of food allergies (FA) based on the diagnostic gold standard of double-blind placebo controlled food challenge (DBPCFC), but not outside of the 10–30% estimates from previous studies that tabulated SrFA status^5,6,48,49,50. Although one might presume that a large part of the difference between prevalence rates from SrFA and DBPCFC stems from patient subjectivity, some of the difference may be related to factors that augment allergic reactivity such as NSAID use, alcohol intake, exercise, menstruation cycle, and current pollen allergy season status⁵¹. For example, in birch pollen allergy patients, markers of gastrointestinal inflammation were significantly higher during the pollen allergy season than in the off-season⁵², suggesting that reactivity to oral food challenge (OFC) could depend upon seasonal fluctuation in inflammatory status. In addition, geographic regions may impart differential exposure to airborne allergens such as birch pollen (https://www.forestry.gov.uk/forestry/infd-8qnk35), which could lead to a larger local prevalence of FA to birch pollen related foods such as hazel nut, apple, peach, pear, etc. Also, exposure to potentially allergenic foods may differ due to regional consumption habits, with fruits and vegetables that are rare in some areas being very common in others and fish and shellfish consumption differing greatly between countries⁵³. As an example, Chinese yam (Japanese yamaimo), which was the highest reported food in our study (Supplementary Table S1: 5.1%), is consumed widely in East Asia and is on the list of 25 foods for which the Japanese government requires labelling on food products¹³ but is not a common food in the United States or European countries. Since it is known to commonly elicit contact dermatitis when grated with bare hands⁵⁴, the reported sensitivity in our study may relate to the external dermatological reaction rather than to a true FA response. Interestingly, although the SrFA rates observed in our study may seem high, the top foods largely mirror those identified as causative foods for new-onset FA and those responsible for accidental food ingestion by the Japanese national physician survey that was described in the Introduction¹³.

Seafood allergy is one of the common food allergies, with crustaceans representing 17.1% and 10.0% of new onset food allergies in Japan in 7–19 years and ≥20 years old age groups, respectively¹³. Interestingly, crustaceans are the second most reported cause of doctor’s visits due to accidental ingestion by Japanese adults, and shellfish are known to be the most common cause of anaphylaxis in adulthood in most Asian countries (Hong Kong (China), Singapore, and Thailand)^55,56,57,58, US⁵⁹ and Australia⁶⁰ as well. In Japan, the most common types of edible shrimp are kuruma prawn (Marsupenaeus japonicus), White Pacific shrimp (Litopenaeus vannamei), black tiger (Penaeus monodon) and Japanese spiny lobster (Panulirus japonicas)⁶¹. Shrimp allergens are some of the most commonly researched among shellfish allergies⁶², with currently identified shrimp allergens including tropomyosin, arginine kinase, myosin light chain 1, myosin light chain 2, sarcoplasmic calcium-binding protein and troponin C and triosephosphate isomerase⁶¹. Among the shrimp allergens, tropomyosin is a pan-allergen that is considered to be the major shellfish allergen, and it has been reported that at least 80% of shrimp-allergy subjects respond to tropomyosin³⁰. Cross-reactions between tropomyosins in the same class of shellfish are frequently reported, and it is known that the risk of reaction to a second species of crustacean is as high as 75%⁶³. Besides that, the risk of dust mite allergy (Der p 10) is increased in those with allergy to crustaceans⁶³. Binding assay of extracts of tropomyosin showed that roughly 75% of tropomyosin binds to shrimp-specific IgE from shrimp-allergic subjects⁶⁴. Shrimp allergy is known to be mediated through the IgE mechanism, with the recognition by naïve CD4 + T cells of antigenic tropomyosin peptides presented by HLA class II molecules known to be the trigger for the IgE mediated hypersensitivity reaction^65,66. Peptide binding assays on European patients showing positive reaction to brown shrimp tropomyosin identified 17 CD4 + T cell shrimp tropomyosin-derived epitopes restricted to DRB*01:01, DRB1*03:01, DRB1*04:01, DRB1*09:01, DQB1*02:01, DQB1*03:02 and DQB1*05:01. DRB1*04:05 and DQB1*04:01 identified in our analysis were not tested in that study as these two alleles are expressed only in Asian populations⁶⁵. Interestingly, all shrimp epitope binding alleles tested in that study that are polymorphic in Asians (DRB*01:01, DRB1*04:01, DRB1*09:01, DQB1*03:02 and DQB1*05:01) were not significantly associated with shrimp allergy in our data. That observation suggests that additional mechanisms besides the presence of a particular allele with the capacity to bind a particular epitope are necessary to elicit an allergic response.

Peach (Prunus persica) is one of the most common fruits found to be related to allergy. The allergens most commonly found in peach include Pru P 1 (Bet v 1-homologous pathogenesis related protein 10 [PR 10]), Pru P 2 (thaumatin-like protein), Pru P 3 (lipid transfer protein), Pru P 4 (profilin), and Pru P 7 (antimicrobial peptide Gibberellin-regulated protein: GRB). Among the allergens, lipid transfer protein Pru P 3 is associated with severe peach allergy symptoms^67,68, while on the other hand, Pru P 1 and Pru P 4 are mostly associated with milder forms of peach allergy (oral allergy symptoms)³². It has been suggested that the difference in severity of peach allergy might be related to the physical and chemical structure of the allergens, with Pru P 1 and Pru P 4 easily deactivated by the higher temperature and acidic environment in our gastrointestinal tract, while Pru P 3 is strongly heat and pepsin resistant⁶⁹. Heat resistant allergens are usually unlikely to be outgrown and are known to be highly allergenic. Interestingly, individuals with the severe form of peach allergy are at higher risk of developing a severe form of tomato allergy⁷⁰, even though the similarity between peach and tomato lipid transfer protein is only 49%⁶⁷. Other reported peach allergy cross-reactivity includes fennel and to a rarer extent rice allergy⁷¹. In addition, a recent Japanese study found that 20% of 100 fruit-sensitive FA patients lacked srIgE reactivity to common birch tree Bet v 1 homologs or profilin, and that 65% of those lacked reactivity with lipid transfer proteins but did react with GRB⁷². Surprisingly, a majority of GRB reactive patients (84.6%) required some augmenting factor such as NSAID or exercise to develop an allergic response, suggesting that for particular antigens, food challenge by itself may not suffice to diagnose FA.

In this study, HLA-DRB1*09:01-HLA-DQB1*03:03 was found to be the top HLA susceptibility haplotype for peach allergy (P = 1.15E-07; OR = 1.68), but in contrast to shrimp allergy, three additional predisposing haplotypes with lower but comparable significance were identified by the RPE analysis (P ≤ 4.31E-05; OR range 1.68–2.46). Besides that, HLA-DRB1*09:01 is famously known to be the susceptibility HLA allele for Japanese rheumatoid arthritis (RA)⁷³. A previous report has suggested a putative link between gut immunity and RA, with cross-reactive food antibodies (IgA, IgG and IgM) reported to be increased in the gut of RA patients⁷⁴. Intervention studies have reported that decreasing animal protein and saturated fat intake and increasing fruit/vegetable, legume, and fish intake is associated with decreased levels of some markers of RA disease status^75,76, which may relate to increased inflammatory status associated with the Western diet. That could suggest that food allergy may also contribute to increased inflammation in the gut and increased gut permeability that has been associated with RA^77,78.

In addition to the food allergy associated HLA alleles and haplotypes, our analysis also found the top associated SNPs to be strongly associated in GTExPortal and HGVD data with the expression of various HLA genes. Among several HLA Class II genes for which SNPs were associated (Supplementary Worksheet S4), our analysis found only HLA-DQA2 expression to be associated with both peach and shrimp allergy associated variants. HLA-DQA2 and its beta-subunit HLA-DQB2, are paralogous genes to HLA-DQA1 and HLA-DQB1 that are only lowly polymorphic but are highly conserved across both Old and New World primates⁷⁹. Despite the strong conservation across species, HLA-DQA2 was originally reported to be only lowly expressed and HLA-DQB2 not expressed in lymphoblastoid cell lines^79,80, and later in vivo footprinting assays found a lack of protein-DNA interactions in the same cell type⁸¹. However, a more recent report comparing the transcriptome of different dendritic cell types (DCs) identified strong expression of both HLA-DQA2 and HLA-DQB2 genes in Epidermal Langerhans Cells (LCs)⁸², in contrast to monocyte-derived, CD1c+, or plasmacytoid DCs, which lacked expression of the two genes. Further, that study identified cell surface expression of HLA-DQA2/HLA-DQB2 heterodimers, and intriguingly, they also identified the presence of mixed HLA-DQA2/HLA-DQB1 heterodimers and concluded that these genes’ expression in LCs could represent a new added level of complexity to the repertoire of antigens that could be presented by these cells. To confirm their results, we used FANTOM5 Phase 1 Cap Analysis of Gene Expression (CAGE) data^83,84 to compare relative expression levels between the four HLA-DQ genes and found very different profiles in LCs compared to other lymphoid and dendritic cell-types (Supplementary Fig. S6). LCs displayed the highest CAGE expression levels of all four genes compared to other cell-types, with HLA-DQB2 expression limited to just LCs and HLA-DQA2 expression in LCs 2–7 times greater than that observed in the top non-LC cell-types. As a presumed subtype of dendritic cells, LCs are considered to be a type of antigen presenting cell that are found within the stratified squamous epithelial cells of the skin epidermis and oral mucosa⁸⁵, as well as in gut-associated lymphoid tissues⁸⁶. LCs have been described as central mediators of skin immunity⁸⁷, have also been implicated in the pathogenesis of atopic dermatitis⁸⁸, and are also a specific target for new allergen immunotherapies⁸⁹. Since LCs appear to express the related HLA-DQ genes at higher levels than found in other cell-types, we suggest that LCs should be a target for future eQTL analyses.

In addition to the concerns noted above with respect to the inaccuracy of SrFA based studies, our questionnaire also did not query the subjects for the timeframe during which they had acquired their FA, whether they were responding to current or also childhood FA, or solicit them for whether their FA was of an immediate or delayed type response. Because of those concerns, we expect that cases in our analysis would represent a heterogeneous group. Some of that could possibly be captured by defining cases as those that experienced severe symptoms, namely loss of consciousness and/or anaphylaxis, but lack of power with such a small sample led us to combine the mild and severe responders into a combined case group. A larger sample set with a more comprehensive FA related questionnaire and follow-up with skin prick tests, srIgE, and DBPCFC in positive responders would likely lead to a finer understanding of whether HLA haplotypes and/or eQTL variants are responsible for different aspects of food allergy aetiology.

Conclusion

This study identified variants in the HLA region that were associated with self-reported peach or shrimp allergy in a Japanese population sample. We found both shrimp and peach allergy to be associated with variants that are eQTLs for HLA-DQA2 gene expression as well as associated with specific HLA-DRB1 and HLA-DQB1 alleles. However, of HLA alleles that were previously investigated for the capacity to bind shrimp epitopes⁶⁵, none were associated in our dataset. Taken together, that may suggest that the genetics of food allergy is more complicated than the mere presence of a particular HLA allele that has a greater capacity to bind a certain food-antigen derived epitope, and that modulation of the expression of related HLA genes by genetic variants could play a role in the aetiology of certain food allergies. The general restriction of HLA-DQA2 expression to LCs and the presence of LCs in various tissue-types that would be readily exposed to food during consumption (i.e., oral mucosa, gut associated lymphoid tissues) suggests that LCs may be a focal point for expanding our understanding of food allergy pathogenesis.

Methods

Subject, sample, and phenotype data collection

Subjects for this study were gathered by requesting users of MTI’s (http://www.mti.co.jp/eng/) “Luna Luna” women’s healthcare-related information website and apps to voluntarily participate in a study run by the MTI subsidiary EverGene to investigate the genetics of various human traits. Subjects were collected in two stages, denoted as LL01 and LL02, with DNA collected using saliva sampling kits (OraGene; DNA Genotek, Inc., Ottawa, Canada) and questionnaires soliciting trait information created using Survey Monkey (http://www.surveymonkey.com) and filled-out by subjects online. In total, we acquired 11379 subjects for analysis (LL01 = 5751, LL02 = 5628). The study design, including the consent form, general questionnaire topics, and genotyping, was approved by the Institutional Review Board at the Tsukuba International Clinical Pharmacology Clinic. The study was executed in accordance with applicable regulations and guidelines, and written informed consent was obtained from each patient for sample collection, genotyping, trait questionnaire, and trait analysis using genome-wide association study analysis.

Sample processing, genotyping and quality control

Saliva sample kits were processed by Takara Bio (Kusatsu, Shiga Prefecture, Japan). LL01 and LL02 stage sample plates were genotyped separately for each stage by Takara Bio on a custom East Asian specific Axiom array (EverGene1). The EverGene1 chip contains 607857 total variants, with most variants chosen from Axiom CHB-1 chip SNPs that had MAF ≥ 0.01 in 1000 Genomes Project Japanese ancestry samples and additional custom variants selected from those with known pathogenic or phenotypic associations. Each stage’s genotypes were divided into separate batches and called separately using Affymetrix Analysis Suite 1.1.0616. Within the samples genotyped during each stage, there were 329 duplicate samples that we used to calculate genotype concordance between the two stages for each SNP. For association analyses, we only included autosomal and chromosome X variants that fulfilled the following criteria in both stages: 1) >99% call-rate, 2) MAF ≥ 0.01, 3) HWE P-value > 1 × 10⁻⁶, and 4) concordance-rate > 90%. After applying those filters, there were 536506 variants, of which 2417 were insertion-deletion polymorphisms (INDEL). Across those QC + variants, the average concordance-rate was 99.85 ± 0.30% (mean ± SD).

Principal component analysis (PCA)

We downloaded genotype data for 2504 samples from the 1000 Genomes Project Phase 3 sample populations (ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/release/20130502/) to help identify any admixed samples in our Japanese sample dataset. We then performed LD-pruning across the 1000 Genomes and LL01 and LL02 (LL01/LL02) genotype datasets using PLINK2 v1.90p (release date 16 Aug 2016)^90,91 with r² < 0.2, which identified 121595 SNPs with no or low-LD. We performed a principal component analysis (PCA)⁹² using PLINK with the LD-pruned SNPs, and after one round of PCA (PCA1; Supplementary Fig. S1a and S1b), we identified a small number of samples that were outliers (n = 19) to the typically recognized East Asian cluster^93,94. Overlap with other populations suggests some level of admixture with European, African, or South Asian ancestry. To reduce downstream biases, we removed the admixed samples and performed a second round of PCA with LL01/LL02 + 1000 G EAS samples to help identify overlap of clustered samples with known East Asian sub-groups (PCA2; Supplementary Fig. 1c). After that, we performed a third round of PCA with just those LL01/LL02 samples so that the top PCs would reflect the main genetic axes of East Asian and Japanese population structure inherent to our population samples (PCA3; Supplementary Fig. 1d).

Identification of duplicated samples

Using the same LD-pruned SNP data, we performed identify-by-descent (IBD) analysis using PLINK2 ver. 1.90p’s to identify potential duplicated samples. For eleven pairs of samples that appeared to be duplicate samples (PI_HAT > 0.8), we removed one from each pair from downstream analyses.

Definition of food allergy cases and controls

Food allergy questionnaire items solicited as to whether an individual had either mild allergic reactions (“itching, hives, swelling of the lips and eyelids, vomiting”) or severe allergic reactions (“consciousness disorder, anaphylactic shock, such as a drop in blood pressure”) to a particular food. We coded food allergy cases as those that provided an affirmative response in either of the two response groups. For all food allergies examined, we used as controls 8350 individuals (LL01 = 4225; LL02 = 4125) that had no affirmative responses for either mild or severe food allergies. Within subjects who passed our sample and genotype QC procedures, there were 11011 samples who also responded to our food allergy questionnaire items and who were included in the GWAS analyses.

Statistical analysis and genotype imputation

Basic data management, statistical analyses, and plotting were performed using the R 3.4.1 statistical programming environment⁹⁵. Import and export of Microsoft Excel formatted data was done using the R package XLConnect⁹⁶. We performed the primary association analysis using PLINK2’s logistic regression analysis method in an additive test for allelic association, with PC1 and PC2 from the PCA3 included as covariates. For seven foods that had more than 100 samples in both LL01 and LL02 stages, we utilized a bi-directional “discovery-evaluation” GWAS strategy³⁶. Using that strategy, we used both sample sets simultaneously as discovery and evaluation and defined candidate SNPs as genotyped SNPs that fulfilled P_discovery ≤ 1 × 10⁻⁴ in one sampleset and P_evaluation < 0.05 and Benjamini-Hochberg FDR < 0.2 in the other sample set (Flow-chart: Supplementary Fig. S3). Meta-analysis statistics were calculated for each SNP using the inverse-variance weighting method⁹⁷ to pool LL01 and LL02 samplesets’ beta-coefficients and standard errors as input. Based on the effective independent SNP count (M_E) for a similarly sized array platform and the JPT population³⁷, genotyped SNPs identified from the discovery-evaluation strategy were then called as nominally associated if they achieved a single GWAS P-value cut-off of P_meta < 1.21 × 10⁻⁷ (0.05/411,521). Those passing a multiple-testing adjusted P-value cut-off of P_meta < 4.4 × 10⁻⁹ (P_meta < 1.21 × 10⁻⁷/27 foods) were then considered as strongly associated. For producing the Manhattan plot (Fig. 1), we used the DISTMIX program⁹⁸ and 1000 Genomes Project Phase 1 Release 3⁹⁹ SNP correlation data to perform genome-wide imputation using the meta-analysis summary statistics; a manual population weight file was used to reflect a predominately Japanese population sample with slight proportions of other East Asian samples (JPT = 0.95, CHB = 0.04, CHS = 0.01). Output was filtered for SNPs that had DISTMIX info values greater than 0.8.

For two foods with SNPs achieving the significance cut-offs, we performed regional genotype imputation around associated variants by pre-phasing the LL01/LL02 genotyping dataset with EAGLE 2.3^100,101 and then imputing using BEAGLE 4.1¹⁰² with the 1000 G Phase 3 reference haplotypes¹⁰³. We imputed SNPs within 2 Mb of each associated signal, and then performed logistic regression analysis conditioning on the top imputed variant to better understand the structure of the association signal and identify un-genotyped top associated variants in each signal that could be candidate causal variants. Compared to BEAGLE imputed data, DISTMIX displayed inflated significance for top shrimp SNPs and depressed significance for associated peach variants.

Linkage disequilibrium (LD) statistics

For sorting and ranking SNPs in LD with a top associated variant, we report two different measures. First, we report the common LD r² measure produced using PLINK’s –r2 command, which calculates correlation of alternate allele dosage at two SNPs. Second, we calculated a simple metric, which we refer to as r²_equiv, of how association statistics decrease after conditioning on a top SNP in a particular genomic region. For a single top SNP in a region, r²_equiv was calculated using −log₁₀ transformed P-values of the top SNP A along with those of a tested SNP B before (B) and after conditioning (B|A). For test statistic Z = −log₁₀(P), then r²_equiv = (Z_B − Z_B|A)/Z_A. Unless otherwise stated, moderate LD to a top SNP was defined as r²_equiv > 0.5 and high LD to a top SNP defined as r²_equiv > 0.8.

In silico functional analysis of associated variants

We annotated variants using HaploReg 4.1³⁸, which includes regulatory annotation for SNPs predicted to modify transcription factor motifs, DNase hypersensitivity sites (DHS), DNA methylation, evolutionary conservation scores (GERP), gene overlap, and eQTL annotations. HaploReg also includes annotation for overlap with previous GWAS data, but to include a more current dataset, we downloaded the NHGRI/EBI GWAS Catalog (dated October 18, 2017)^39,104 and known gene tables from the UCSC Genome Browser web-site^105,106. To examine overlap with transcription factor binding sites (TFBS), we created a .bed file for our variants, moved the genome-build coordinates from hg19 to hg38 using the UCSC liftOver tool, and then annotated them using the ReMap 2018 Annotation Tool¹⁰⁷.

eQTL analysis

We analysed eQTL data for both the European ancestry sample based GTExPortal version 6p^41,108 and the Japanese ancestry sample based Human Genome Variation Database eQTL Release Version 8.1⁴³. HGVD data was downloaded from their website (http://www.hgvd.genome.med.kyoto-u.ac.jp/eQTL/version.8.1/HGVDeQTL-V8_1-cis.tar.gz), and the GTExPortal significant cis-eQTL data file (GTEx_Analysis_v6p_eQTL.tar) was downloaded from https://gtexportal.org/home/datasets.

GTExPortal cis-eQTL data consists of two files each for 44 tissues: one file with a summary for each eQTL gene (*_Analysis.v6p.egenes.txt.gz) and the other with eQTL SNP-gene pairs (*_Analysis.v6p.signif_snpgene_pairs.txt.gz). The data as downloaded had been filtered by GTExPortal to included only significant SNP-gene pairs after permutation testing and application of a false-discovery rate (FDR < 0.05).

HGVD data consists of eQTL statistics files (sva.corrected.pheno.*.cis.gz) and annotation tables for SNPs (snpID.table) and genes (probeID.table). All were imported into R and merged into a single large table to use for analyses. While GTExPortal SNPs were already pre-filtered using the FDR, HGVD variants were not pre-filtered. To select SNPs that were determined by HGVD to account for some portion of the variance in a particular gene’s expression, we select HGVD SNPs that had at least R² > 0.1 in HGVD’s PLINK^90,91 based association analysis output.

SNP rsids used by HGVD and GTExPortal were updated to dbSNP147 when possible.

To identify SNPs associated with peach or shrimp food-allergy as well as with gene expression in GTExPortal and HGVD data, we selected moderate LD GWAS variants as those with either r² > 0.5 or r²_equiv > 0.5 to the top GWAS SNP. Within each eQTL dataset, we calculated eQTL SNPs’ relative signal strength with respect to the minimum P-value at the top SNP for a particular gene as RSS = −log₁₀(P_SNP)/−log₁₀(P_Gene.min). We then labelled the moderate LD GWAS SNPs for whether they were tentative eQTL SNPs with RSS_HGVD > 0.2 or max.RSS_GTEx > 0.2 (max.RSS_GTEx is the maximum RSS value across all associated tissues for a particular SNP-gene pair). That filter is meant to select variants that have relatively strong evidence for being an eQTL compared to the larger set of variants in the GTExPortal and HGVD output. The final set of candidate eQTL regulated genes for each food allergy (Supplementary Worksheet S4) are genes for which at least one SNP was associated with its expression in both eQTL databases: (r² > 0.5 or r²_equiv > 0.5) & RSS_HGVD > 0.2 & RSS_GTEx > 0.2. eQTL data shown in Supplementary Worksheets S5 and S6 is the larger set of gene-SNP pairs for which (r² > 0.5 or r²_equiv > 0.5) & (RSS_HGVD > 0.2 | RSS_GTEx > 0.2); for GTEx Portal data, we filtered for data from higher confidence genes with one or more SNPs associated in a large number of tissue samples (≥10 tissue samples). One caveat for this analysis is that while GTExPortal eQTLs represents a rather comprehensive set of common variants, the HGVD data was based on micro-array genotyping with about 1.4 million analysed SNPs, and therefore, HGVD is missing data at about 75% of positions.

HLA imputation

We extracted genotype data for EverGene1 SNPs between base pairs 25,759,242 and 33,534,827 (GRCh37/hg19 coordinates) on chromosome 6 and then performed HLA imputation for HLA DRB1, DQB1, and DPB1 using the HIBAG R package¹⁰⁹ along with Japanese specific HLA classifier data that we previously published¹¹⁰. Post-imputation quality control (Call Threshold, CT > 0.5) was applied to the imputed HLA alleles. To analyse concordance between imputed and laboratory determined HLA alleles, we selected 126 samples with representative imputed HLA alleles and HLA typing performed by an outside laboratory (Wakunaga Pharmaceutical Co., Ltd., Akitaka City, Hiroshima, Japan).

HLA statistical analysis

Basic immunogenetic analyses of HLA loci and alleles were performed using the R BIGDAWG package¹¹¹. To account for the impact of strong effect alleles, we implemented theses analyses in a Relative Predispositional Effect (RPE) framework^44,45. RPE involves evaluation of a locus’ significance using a chi-square test of the 2xk contingency table of case/control status versus alleles, followed by a Pearson’s chi-square test of the 2 × 2 table of each individual allele’s significance if significance at a locus falls below a chosen cut-off (P_locus < 0.05). After the initial locus and allele evaluation (RPE step 0), the top effect allele (based on lowest P-value) was removed from both cases and controls, and then the locus and allele significance retested. That removal was then redone in a stepwise fashion, with subsequent top effect alleles added to the previous list for removal and re-evaluation until P_locus > = 0.05. Alleles with expected counts less than five in cases or controls were automatically collapsed by the BIGDAWG software into a “binned” category for the analysis.

We performed initial 2-locus (HLA-DRB1-DQB1) or 3-locus (HLA-DRB1-DQB1-DPB1) haplotype analysis using BIGDAWG and then ran an analysis in an RPE framework using the estimated haplotypes. Only samples with no missing HLA alleles were used in the analysis, and HLA haplotypes with <1% frequency in either cases or controls were placed in the “binned” category.

HLA-DQ gene expression in FANTOM5 data

We downloaded FANTOM5 CAGE RLE expression data for HLA-DQA1, HLA-DQB1, HLA-DQA2, HLA-DQB2 genes from the Zenbu browser: (http://fantom.gsc.riken.jp/zenbu/gLyphs/#config = ONHzqgf2E5Xtmnpsh2gURB)^84,112 for the “FANTOM5 CAGE Phase1 CTSS human tracks pooled filtered with 3 or more tags per library” track. We filtered the data for high-expressing tissue samples: those that had non-zero RLE values for a particular gene and then were in the upper-quartile of samples. We then extracted tissues that expressed two or more of the four genes for plotting in Supplementary Fig. S6.

HLA allele and haplotype frequency data

We queried and downloaded frequency data for HLA alleles and haplotypes from The Allele Frequency Net Database (http://www.allelefrequencies.net/hla6006a.asp?)¹¹³ with the following URL command strings following the trailing question-mark:

HLA-DRB1*09:01

hla_allele1 = DRB1*09:01&hla_allele2 = DRB1*09:01

HLA-DRB1*04:05

hla_allele1 = DRB1*04:05&hla_allele2 = DRB1*04:05

HLA-DQB1*03:03

hla_allele1 = DQB1*03:03&hla_allele2 = DQB1*03:03

HLA-DQB1*04:01

hla_allele1 = DQB1*04:01&hla_allele2 = DQB1*04:01

HLA-DRB1*09:01-DQB1*03:03

hla_selection = DRB1*09:01-DQB1*03:03

HLA-DRB1*04:05-DQB1*04:01

hla_selection = DRB1*04:05-DQB1*04:01

Figure plotting

Manhattan plots, regional association figures and other plots were produced using self-written R programs. HLA allele and haplotype frequency distribution maps were plotted using the R package rworldmap Ver.1.3-4¹¹⁴.

Data availability

Original genotype data is not publicly available for the EverGene data due to strict consent requirements to protect subjects’ privacy. In its place, we provide files of genome-wide summary statistics for the genotyped and regionally imputed data for the two foods that possessed genome-wide significant association signals, with peach and shrimp allergy analyses provided as Supplementary Datasets S1 and S2, respectively. Files for the five foods lacking significant signals can be obtained upon request to the corresponding author.

References

Eder, W., Ege, M. J. & von Mutius, E. The asthma epidemic. N Engl J Med 355, 2226–2235 (2006).
Article CAS PubMed Google Scholar
Sicherer, S. H. & Sampson, H. A. Food allergy: Epidemiology, pathogenesis, diagnosis, and treatment. J Allergy Clin Immunol 133, 291–307; quiz 308 (2014).
Prescott, S. & Allen, K. J. Food allergy: riding the second wave of the allergy epidemic. Pediatr Allergy Immunol 22, 155–160 (2011).
Article PubMed Google Scholar
Kamdar, T. A. et al. Prevalence and characteristics of adult-onset food allergy. J Allergy Clin Immunol Pract 3, 114–5.e1 (2015).
Article PubMed Google Scholar
Rentzos, G., Johanson, L., Sjölander, S., Telemo, E. & Ekerljung, L. Self-reported adverse reactions and IgE sensitization to common foods in adults with asthma. Clin Transl Allergy 5, 25 (2015).
Article CAS PubMed PubMed Central Google Scholar
Le, T. M. et al. Food allergy in the Netherlands: differences in clinical severity, causative foods, sensitization and DBPCFC between community and outpatients. Clin Transl Allergy 5, 8 (2015).
Article PubMed PubMed Central CAS Google Scholar
Ballmer-Weber, B. K. Food allergy in adolescence and adulthood. Chem Immunol Allergy 101, 51–58 (2015).
PubMed Google Scholar
Sicherer, S. H. Food allergy. Lancet 360, 701–710 (2002).
Article PubMed Google Scholar
Sforza, S. & Prandi, B. In Encyclopedia of Food and Health (eds Caballero, B., Finglas, P. M. & Toldrá, F.) 743–748 (Academic Press, 2015).
Sicherer, S. H. Food allergy. Mt Sinai J Med 78, 683–696 (2011).
Article PubMed Google Scholar
Ridolo, E., Martignago, I., Senna, G. & Ricci, G. Scombroid syndrome: it seems to be fish allergy but. it isn’t. Curr Opin Allergy Clin Immunol 16, 516–521 (2016).
Article CAS PubMed Google Scholar
Raphael, G., Raphael, M. H. & Kaliner, M. Gustatory rhinitis: a syndrome of food-induced rhinorrhea. J Allergy Clin Immunol 83, 110–115 (1989).
Article CAS PubMed Google Scholar
Ebisawa, M., Ito, K., Fujisawa, T. & Committee for Japanese Pediatric Guideline for Food Allergy, T. J. S. O. P. A. A. C. I., The Japanese Society of Allergology. Japanese guidelines for food allergy 2017. Allergol Int 66, 248–264 (2017).
Kobayashi, T. et al. Eighteen cases of wheat allergy and wheat-dependent exercise-induced urticaria/anaphylaxis sensitized by hydrolyzed wheat protein in soap. Int J Dermatol 54, e302–5 (2015).
Article CAS PubMed Google Scholar
Akiyama, H., Imai, T. & Ebisawa, M. Japan food allergen labeling regulation–history and evaluation. Adv Food Nutr Res 62, 139–171 (2011).
Article CAS PubMed Google Scholar
Levine, B. B., Stember, R. H. & Fotino, M. Ragweed hay fever: genetic control and linkage to HL-A haplotypes. Science 178, 1201–1203 (1972).
Article ADS CAS PubMed Google Scholar
Freidhoff, L. R. et al. Association of HLA-DR3 with human immune response to Lol p I and Lol p II allergens in allergic subjects. Tissue Antigens 31, 211–219 (1988).
Article CAS PubMed Google Scholar
Ansari, A. A., Freidhoff, L. R., Meyers, D. A., Bias, W. B. & Marsh, D. G. Human immune responsiveness to Lolium perenne pollen allergen Lol p III (rye III) is associated with HLA-DR3 and DR5. Hum Immunol 25, 59–71 (1989).
Article CAS PubMed Google Scholar
Fischer, G. F. et al. Association between IgE response against Bet v I, the major allergen of birch pollen, and HLA-DRB alleles. Hum Immunol 33, 259–265 (1992).
Article CAS PubMed Google Scholar
Torgerson, D. G. et al. Meta-analysis of genome-wide association studies of asthma in ethnically diverse North American populations. Nat Genet 43, 887–892 (2011).
Article CAS PubMed PubMed Central Google Scholar
Moffatt, M. F. et al. A large-scale, consortium-based genomewide association study of asthma. N Engl J Med 363, 1211–1221 (2010).
Article CAS PubMed PubMed Central Google Scholar
Hirota, T. et al. Genome-wide association study identifies three new susceptibility loci for adult asthma in the Japanese population. Nat Genet 43, 893–896 (2011).
Article CAS PubMed PubMed Central Google Scholar
Paternoster, L. et al. Meta-analysis of genome-wide association studies identifies three new risk loci for atopic dermatitis. Nat Genet 44, 187–192 (2012).
Article CAS Google Scholar
Sun, L. D. et al. Genome-wide association study identifies two new susceptibility loci for atopic dermatitis in the Chinese Han population. Nat Genet 43, 690–694 (2011).
Article CAS PubMed Google Scholar
Hirota, T. et al. Genome-wide association study identifies eight new susceptibility loci for atopic dermatitis in the Japanese population. Nat Genet 44, 1222–1226 (2012).
Article CAS PubMed Google Scholar
Hong, X. et al. Genome-wide association study identifies peanut allergy-specific loci and evidence of epigenetic mediation in US children. Nat Commun 6, 6304 (2015).
Article CAS PubMed PubMed Central Google Scholar
Martino, D. J. et al. Genomewide association study of peanut allergy reproduces association with amino acid polymorphisms in HLA-DRB1. Clin Exp Allergy 47, 217–223 (2017).
Article CAS PubMed Google Scholar
Madore, A. M. et al. HLA-DQB1*02 and DQB1*06:03P are associated with peanut allergy. Eur J Hum Genet 21, 1181–1184 (2013).
Article CAS PubMed PubMed Central Google Scholar
Fernandez, C. A. et al. Genome-wide analysis links NFATC2 with asparaginase hypersensitivity. Blood 126, 69–75 (2015).
Article CAS PubMed PubMed Central Google Scholar
Shanti, K. N., Martin, B. M., Nagpal, S., Metcalfe, D. D. & Rao, P. V. Identification of tropomyosin as the major shrimp allergen and characterization of its IgE-binding epitopes. J Immunol 151, 5354–5363 (1993).
CAS PubMed Google Scholar
Pastorello, E. A. et al. Complete amino acid sequence determination of the major allergen of peach (Prunus persica) Pru p 1. Biol Chem 380, 1315–1320 (1999).
Article CAS PubMed Google Scholar
Gamboa, P. M. et al. Two different profiles of peach allergy in the north of Spain. Allergy 62, 408–414 (2007).
Article CAS PubMed Google Scholar
Gaier, S. et al. Purification and structural stability of the peach allergens Pru p 1 and Pru p 3. Mol Nutr Food Res 52(Suppl 2), S220–9 (2008).
PubMed Google Scholar
Wang, J. et al. Correlation of IgE/IgG4 milk epitopes and affinity of milk-specific IgE antibodies with different phenotypes of clinical milk allergy. J Allergy Clin Immunol 125(695–702), 702.e1 (2010).
Google Scholar
Sicherer, S. H. & Sampson, H. A. Food allergy. J Allergy Clin Immunol 125, S116–25 (2010).
Article PubMed Google Scholar
Zeng, Y. et al. Novel loci and pathways significantly associated with longevity. Sci Rep 6, 21243 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, M. X., Yeung, J. M., Cherny, S. S. & Sham, P. C. Evaluating the effective numbers of independent tests and significant p-value thresholds in commercial genotyping arrays and public imputation reference datasets. Hum Genet 131, 747–756 (2012).
Article CAS PubMed Google Scholar
Ward, L. D. & Kellis, M. HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res 40, D930–4 (2012).
Article CAS PubMed Google Scholar
Welter, D. et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res 42, D1001–6 (2014).
Article CAS PubMed Google Scholar
MacArthur, J. et al. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res 45, D896–D901 (2017).
Article PubMed Google Scholar
Consortium, G. T. E. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Article CAS Google Scholar
Higasa, K. et al. Human genetic variation database, a reference database of genetic variations in the Japanese population. J Hum Genet 61, 547–553 (2016).
Article CAS PubMed PubMed Central Google Scholar
Narahara, M. et al. Large-scale East-Asian eQTL mapping reveals novel candidate genes for LD mapping and the genomic landscape of transcriptional effects of sequence variants. PLoS One 9, e100924 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Payami, H. et al. Relative predispositional effects (RPEs) of marker alleles with disease: HLA-DR alleles and Graves disease. Am J Hum Genet 45, 541–546 (1989).
CAS PubMed PubMed Central Google Scholar
Hollenbach, J. A., Mack, S. J., Thomson, G. & Gourraud, P. A. Analytical methods for disease association studies with immunogenetic data. Methods Mol Biol 882, 245–266 (2012).
Article CAS PubMed PubMed Central Google Scholar
Werfel, T. et al. Position paper of the EAACI: food allergy due to immunological cross-reactions with common inhalant allergens. Allergy 70, 1079–1090 (2015).
Article CAS PubMed Google Scholar
Marenholz, I. et al. Genome-wide association study identifies the SERPINB gene cluster as a susceptibility locus for food allergy. Nat Commun 8, 1056 (2017).
Article PubMed PubMed Central Google Scholar
Verrill, L., Bruns, R. & Luccioli, S. Prevalence of self-reported food allergy in U.S. adults: 2001, 2006, and 2010. Allergy Asthma Proc 36, 458–467 (2015).
Article PubMed PubMed Central Google Scholar
Soller, L. et al. Prevalence and predictors of food allergy in Canada: a focus on vulnerable populations. J Allergy Clin Immunol Pract 3, 42–49 (2015).
Article PubMed Google Scholar
Ben-Shoshan, M. et al. A population-based study on peanut, tree nut, fish, shellfish, and sesame allergy prevalence in Canada. J Allergy Clin Immunol 125, 1327–1335 (2010).
Article CAS PubMed Google Scholar
Niggemann, B. & Beyer, K. Factors augmenting allergic reactions. Allergy 69, 1582–1587 (2014).
Article CAS PubMed Google Scholar
Rentzos, G., Lundberg, V., Stotzer, P. O., Pullerits, T. & Telemo, E. Intestinal allergic inflammation in birch pollen allergic patients in relation to pollen season, IgE sensitization profile and gastrointestinal symptoms. Clin Transl Allergy 4, 19 (2014).
Article PubMed PubMed Central CAS Google Scholar
National_Marine_Fisheries_Service. Fisheries Economics of the United States Report. https://www.fisheries.noaa.gov/resource/data/fisheries-economics-united-states-report-2015 (2015).
Liu, Y., Wang, Z. & Zhang, J. Dietary Chinese Herbs (Springer Science & Business Media, 2015).
Lertnawapan, R. & Maek-a-nantawat, W. Anaphylaxis and biphasic phase in Thailand: 4-year observation. Allergol Int 60, 283–289 (2011).
Article PubMed Google Scholar
Smit, D. V., Cameron, P. A. & Rainer, T. H. Anaphylaxis presentations to an emergency department in Hong Kong: incidence and predictors of biphasic reactions. J Emerg Med 28, 381–388 (2005).
Article PubMed Google Scholar
Techapornroong, M., Akrawinthawong, K., Cheungpasitporn, W. & Ruxrungtham, K. Anaphylaxis: a ten years inpatient retrospective study. Asian Pac J Allergy Immunol 28, 262–269 (2010).
PubMed Google Scholar
Thong, B. Y., Cheng, Y. K., Leong, K. P., Tang, C. Y. & Chng, H. H. Anaphylaxis in adults referred to a clinical immunology/allergy centre in Singapore. Singapore Med J 46, 529–534 (2005).
CAS PubMed Google Scholar
Ross, M. P. et al. Analysis of food-allergic and anaphylactic events in the National Electronic Injury Surveillance System. J Allergy Clin Immunol 121, 166–171 (2008).
Article PubMed Google Scholar
Brown, A. F., McKinnon, D. & Chu, K. Emergency department anaphylaxis: A review of 142 patients in a single year. J Allergy Clin Immunol 108, 861–866 (2001).
Article CAS PubMed Google Scholar
Matsuo, H., Yokooji, T. & Taogoshi, T. Common food allergens and their IgE-binding epitopes. Allergol Int 64, 332–343 (2015).
Article CAS PubMed Google Scholar
Leung, N. Y. et al. Current immunological and molecular biological perspectives on seafood allergy: a comprehensive review. Clin Rev Allergy Immunol 46, 180–197 (2014).
Article CAS PubMed Google Scholar
García, B. E. & Lizaso, M. T. Cross-reactivity syndromes in food allergy. J Investig Allergol Clin Immunol 21, 162–70; quiz 2 p following 170 (2011).
Lopata, A. L., O’Hehir, R. E. & Lehrer, S. B. Shellfish allergy. Clin Exp Allergy 40, 850–858 (2010).
Article CAS PubMed Google Scholar
Ravkov, E. V. et al. Identification and validation of shrimp-tropomyosin specific CD4 T cell epitopes. Hum Immunol 74, 1542–1549 (2013).
Article CAS PubMed Google Scholar
Wang, S. et al. Penaeus monodon tropomyosin induces CD4 T-cell proliferation in shrimp-allergic patients. Hum Immunol 73, 426–431 (2012).
Article CAS PubMed Google Scholar
Mascheri, A. et al. Hypersensitivity to Tomato (Lycopersicon esculentum) in Peach-Allergic Patients: rPrup 3 and rPrup 1 Are Predictive of Symptom Severity. J Investig Allergol Clin Immunol 25, 183–189 (2015).
CAS PubMed Google Scholar
Gao, Z. S. et al. IgE-binding potencies of three peach Pru p 1 isoforms. Mol Nutr Food Res 60, 2457–2466 (2016).
Article CAS PubMed Google Scholar
Pastorello, E. A. et al. Anti-rPru p 3 IgE levels are inversely related to the age at onset of peach-induced severe symptoms reported by peach-allergic adults. Int Arch Allergy Immunol 162, 45–49 (2013).
Article CAS PubMed Google Scholar
Pastorello, E. A. et al. Clinical role of a lipid transfer protein that acts as a new apple-specific allergen. J Allergy Clin Immunol 104, 1099–1106 (1999).
Article CAS PubMed Google Scholar
Pastorello, E. A. et al. Rice allergy demonstrated by double-blind placebo-controlled food challenge in peach-allergic patients is related to lipid transfer protein reactivity. Int Arch Allergy Immunol 161, 265–273 (2013).
Article CAS PubMed Google Scholar
Inomata, N., Miyakawa, M. & Aihara, M. High prevalence of sensitization to gibberellin-regulated protein (peamaclein) in fruit allergies with negative immunoglobulin E reactivity to Bet v 1 homologs and profilin: Clinical pattern, causative fruits and cofactor effect of gibberellin-regulated protein allergy. J Dermatol 44, 735–741 (2017).
Article CAS PubMed Google Scholar
Shimane, K. et al. An association analysis of HLA-DRB1 with systemic lupus erythematosus and rheumatoid arthritis in a Japanese population: effects of *09:01 allele on disease phenotypes. Rheumatology (Oxford) 52, 1172–1182 (2013).
Article CAS Google Scholar
Hvatum, M., Kanerud, L., Hällgren, R. & Brandtzaeg, P. The gut-joint axis: cross reactive food antibodies in rheumatoid arthritis. Gut 55, 1240–1247 (2006).
Article CAS PubMed Google Scholar
McKellar, G. et al. A pilot study of a Mediterranean-type diet intervention in female patients with rheumatoid arthritis living in areas of social deprivation in Glasgow. Ann Rheum Dis 66, 1239–1243 (2007).
Article CAS PubMed PubMed Central Google Scholar
Sköldstam, L., Hagfors, L. & Johansson, G. An experimental study of a Mediterranean diet intervention for patients with rheumatoid arthritis. Ann Rheum Dis 62, 208–214 (2003).
Article PubMed PubMed Central Google Scholar
Nissinen, R. et al. Immune activation in the small intestine in patients with rheumatoid arthritis. Ann Rheum Dis 63, 1327–1330 (2004).
Article CAS PubMed PubMed Central Google Scholar
Porzio, V. et al. Intestinal histological and ultrastructural inflammatory changes in spondyloarthropathy and rheumatoid arthritis. Scand J Rheumatol 26, 92–98 (1997).
Article CAS PubMed Google Scholar
Berdoz, J., Tiercy, J. M., Rollini, P., Mach, B. & Gorski, J. Remarkable sequence conservation of the HLA-DQB2 locus (DX beta) within the highly polymorphic DQ subregion of the human MHC. Immunogenetics 29, 241–248 (1989).
Article CAS PubMed Google Scholar
Yu, L. P. & Sheehy, M. J. The cryptic HLA-DQA2 (”DX alpha”) gene is expressed in human B cell lines. J Immunol 147, 4393–4397 (1991).
CAS PubMed Google Scholar
Indovina, P. et al. Absence of in vivo DNA-protein interactions in the DQA2 and DQB2 promoter regions. Hum Immunol 62, 504–508 (2001).
Article CAS PubMed Google Scholar
Lenormand, C. et al. HLA-DQA2 and HLA-DQB2 genes are specifically expressed in human Langerhans cells and encode a new HLA class II molecule. J Immunol 188, 3903–3911 (2012).
Article CAS PubMed Google Scholar
FANTOM_Consortium_and_the_RIKEN_PMI_and_CLST_(DGT) et al. A promoter-level mammalian expression atlas. Nature 507, 462–470 (2014).
Lizio, M. et al. Gateways to the FANTOM5 promoter level mammalian expression atlas. Genome Biol 16, 22 (2015).
Article CAS PubMed PubMed Central Google Scholar
Jaitley, S. & Saraswathi, T. Pathophysiology of Langerhans cells. J Oral Maxillofac Pathol 16, 239–244 (2012).
Article PubMed PubMed Central Google Scholar
Chang, S. Y. & Kweon, M. N. Langerin-expressing dendritic cells in gut-associated lymphoid tissues. Immunol Rev 234, 233–246 (2010).
Article CAS PubMed Google Scholar
Clausen, B. E. & Kel, J. M. Langerhans cells: critical regulators of skin immunity. Immunol Cell Biol 88, 351–360 (2010).
Article CAS PubMed Google Scholar
Dubrac, S., Schmuth, M. & Ebner, S. Atopic dermatitis: the role of Langerhans cells in disease pathogenesis. Immunol Cell Biol 88, 400–409 (2010).
Article PubMed Google Scholar
Aricigil, M. et al. New routes of allergen immunotherapy. Am J Rhinol Allergy 30, 193–197 (2016).
Article PubMed Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Article PubMed PubMed Central CAS Google Scholar
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38, 904–909 (2006).
Article CAS PubMed Google Scholar
Yamaguchi-Kabata, Y. et al. Japanese population structure, based on SNP genotypes from 7003 individuals compared to other ethnic groups: effects on population-based association studies. Am J Hum Genet 83, 445–456 (2008).
Article CAS PubMed PubMed Central Google Scholar
Tian, C. et al. Analysis of East Asia genetic substructure using genome-wide SNP arrays. PLoS ONE 3, e3862 (2008).
Article ADS PubMed PubMed Central CAS Google Scholar
R_Core_Team. R: A Language and Environment for Statistical Computing. https://www.R-project.org/.
Mirai_Solutions_GmbH. XLConnect: Excel Connector for R. https://CRAN.R-project.org/package=XLConnect.
de Bakker, P. I. et al. Practical aspects of imputation-driven meta-analysis of genome-wide association studies. Hum Mol Genet 17, R122–8 (2008).
Article PubMed PubMed Central CAS Google Scholar
Lee, D. et al. DISTMIX: direct imputation of summary statistics for unmeasured SNPs from mixed ethnicity cohorts. Bioinformatics 31, 3099–3104 (2015).
Article CAS PubMed PubMed Central Google Scholar
Abecasis, G. R. et al. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012).
Article ADS PubMed CAS Google Scholar
Loh, P. R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat Genet 48, 1443–1448 (2016).
Article CAS PubMed PubMed Central Google Scholar
Loh, P. R., Palamara, P. F. & Price, A. L. Fast and accurate long-range phasing in a UK Biobank cohort. Nat Genet 48, 811–816 (2016).
Article CAS PubMed PubMed Central Google Scholar
Browning, S. R. & Browning, B. L. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet 81, 1084–1097 (2007).
Article CAS PubMed PubMed Central Google Scholar
Browning, B. L. & Browning, S. R. Genotype Imputation with Millions of Reference Samples. Am J Hum Genet 98, 116–126 (2016).
Article CAS PubMed PubMed Central Google Scholar
Burdett, T. et al. The NHGRI-EBI Catalog of published genome-wide association studies. http://www.ebi.ac.uk/gwas (Accessed February 2, 2017).
Karolchik, D. et al. The UCSC Genome Browser Database: 2008 update. Nucleic Acids Res 36, D773–9 (2008).
Article CAS PubMed Google Scholar
Kent, W. J. et al. The human genome browser at UCSC. Genome Res 12, 996–1006 (2002).
Article CAS PubMed PubMed Central Google Scholar
Griffon, A. et al. Integrative analysis of public ChIP-seq experiments reveals a complex multi-cell regulatory landscape. Nucleic Acids Res 43, e27 (2015).
Article PubMed CAS Google Scholar
Carithers, L. J. et al. A Novel Approach to High-Quality Postmortem Tissue Procurement: The GTEx Project. Biopreserv Biobank 13, 311–319 (2015).
Article PubMed PubMed Central Google Scholar
Zheng, X. et al. HIBAG–HLA genotype imputation with attribute bagging. Pharmacogenomics J 14, 192–200 (2014).
Article CAS PubMed Google Scholar
Khor, S. S. et al. High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references. Pharmacogenomics J 15, 530–537 (2015).
Article CAS PubMed PubMed Central Google Scholar
Pappas, D. J., Marin, W., Hollenbach, J. A. & Mack, S. J. Bridging ImmunoGenomic Data Analysis Workflow Gaps (BIGDAWG): An integrated case-control analysis pipeline. Hum Immunol 77, 283–287 (2016).
Article CAS PubMed Google Scholar
Severin, J. et al. Interactive visualization and analysis of large-scale sequencing datasets using ZENBU. Nat Biotechnol 32, 217–219 (2014).
Article CAS PubMed Google Scholar
González-Galarza, F. F. et al. Allele frequency net 2015 update: new features for HLA epitopes, KIR and disease and HLA adverse drug reaction associations. Nucleic Acids Res 43, D784–8 (2015).
Article PubMed CAS Google Scholar
South, A. rworldmap: A New R package for Mapping GlobalData. The R Journal 3, 35–43 (2011).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, 113-0033, Japan
Seik-Soon Khor & Katsushi Tokunaga
EverGene Ltd., Shinjuku-ku, Tokyo, 163-1435, Japan
Ryoko Morino, Masanori Akita, Maiko Kawajiri, Azusa Kami, Asami Tada & Kenichi Ishikawa
StaGen Co., Ltd., Taito-ku, Tokyo, 111-0051, Japan
Kazuyuki Nakazono, Shigeo Kamitsuji, Naoyuki Kamatani & Todd A. Johnson
Life Science Group, Healthcare Division, Department of Healthcare Business, MTI Ltd., Shinjuku-ku, Tokyo, 163-1435, Japan
Tatsuya Yamasaki & Yuria Hoshi
LunaLuna Division, Department of Healthcare Business, MTI Ltd., Shinjuku-ku, Tokyo, 163-1435, Japan
Maaya Hine, Miki Kobayashi & Nami Kurume

Authors

Seik-Soon Khor
View author publications
You can also search for this author in PubMed Google Scholar
Ryoko Morino
View author publications
You can also search for this author in PubMed Google Scholar
Kazuyuki Nakazono
View author publications
You can also search for this author in PubMed Google Scholar
Shigeo Kamitsuji
View author publications
You can also search for this author in PubMed Google Scholar
Masanori Akita
View author publications
You can also search for this author in PubMed Google Scholar
Maiko Kawajiri
View author publications
You can also search for this author in PubMed Google Scholar
Tatsuya Yamasaki
View author publications
You can also search for this author in PubMed Google Scholar
Azusa Kami
View author publications
You can also search for this author in PubMed Google Scholar
Yuria Hoshi
View author publications
You can also search for this author in PubMed Google Scholar
Asami Tada
View author publications
You can also search for this author in PubMed Google Scholar
Kenichi Ishikawa
View author publications
You can also search for this author in PubMed Google Scholar
Maaya Hine
View author publications
You can also search for this author in PubMed Google Scholar
Miki Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Nami Kurume
View author publications
You can also search for this author in PubMed Google Scholar
Naoyuki Kamatani
View author publications
You can also search for this author in PubMed Google Scholar
Katsushi Tokunaga
View author publications
You can also search for this author in PubMed Google Scholar
Todd A. Johnson
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.S.K. and T.A.J. performed analyses, made figures and tables, and wrote the manuscript. K.N. wrote programs and performed analyses. S.K. supervised the statistical and bioinformatics analyses. R.M. and M.Ka. performed genotyping. R.M. also performed genotype QC. M.Ko. and N.Ku. prepared the questionnaires. T.Y. and A.T. performed management of DNA specimens and ID tracking. M.A. and M.H. performed study design. A.K., Y.H. and K.I. recruited subjects. N.Ka. and K.T. supervised the statistical analysis and writing of the manuscript.

Corresponding author

Correspondence to Todd A. Johnson.

Ethics declarations

Competing Interests

T.A.J., S.K., N.Ka., S.K. are employees of StaGen Co. Ltd. R.M., M.A., M.Ka., T.Y., A.K., M.T., Y.H., A.T., K.I., M.H., M.Ko., and N.Ku. are employees of EverGene/MTI. EverGene/MTI has applied for patents related to the food allergy associations described in this report. N.Ka. is also Chairman of StaGen Co. Ltd. and a Director of the Tsukuba International Clinical Pharmacology Clinic. S.S.K. and K.T. declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Supplementary Worksheets S1-S6

Dataset S1

Dataset S2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Khor, SS., Morino, R., Nakazono, K. et al. Genome-wide association study of self-reported food reactions in Japanese identifies shrimp and peach specific loci in the HLA-DR/DQ gene region. Sci Rep 8, 1069 (2018). https://doi.org/10.1038/s41598-017-18241-w

Download citation

Received: 07 September 2017
Accepted: 07 December 2017
Published: 18 January 2018
DOI: https://doi.org/10.1038/s41598-017-18241-w

This article is cited by

A systematic two-sample and bidirectional MR process highlights a unidirectional genetic causal effect of allergic diseases on COVID-19 infection/severity
- Xiao-tong Chen
- Shuai Zhi
- Shi-tao Rao
Journal of Translational Medicine (2024)
Clinical Manifestations of Pediatric Food Allergy: a Contemporary Review
- Ling-Jen Wang
- Shu-Chi Mu
- Cheng-Hui Lin
Clinical Reviews in Allergy & Immunology (2022)
Fine-mapping and association analysis of candidate genes for papilla number in sea cucumber, Apostichopus japonicus
- Xinghai Zhu
- Ping Ni
- Zhenmin Bao
Marine Life Science & Technology (2022)
Association of HLA-DQ and IL13 gene variants with challenge-proven shrimp allergy in West Bengal, India
- Arghya Laha
- Amlan Ghosh
- Sanjoy Podder
Immunogenetics (2020)
Japanese GWAS identifies variants for bust-size, dysmenorrhea, and menstrual fever that are eQTLs for relevant protein-coding or long non-coding RNAs
- Tetsuya Hirata
- Kaori Koga
- Yutaka Osuga
Scientific Reports (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

GWAS of self-reported food reactions in the Japanese population

Associations with peach and shrimp allergy in the HLA region

Two-field HLA allele imputation analysis

Cross-reactivity for self-reported food reactions

Association analysis of previously reported SNPs

Discussion

Conclusion

Methods

Subject, sample, and phenotype data collection

Sample processing, genotyping and quality control

Principal component analysis (PCA)

Identification of duplicated samples

Definition of food allergy cases and controls

Statistical analysis and genotype imputation

Linkage disequilibrium (LD) statistics

In silico functional analysis of associated variants

eQTL analysis

HLA imputation

HLA statistical analysis

HLA-DQ gene expression in FANTOM5 data

HLA allele and haplotype frequency data

Figure plotting

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links