Keywords
Allergen, lipocalin, cross-reactivity, docking, in silico
Allergen, lipocalin, cross-reactivity, docking, in silico
Minor changes throughout, and new references to upgrade the discussion were added.
See the authors' detailed response to the review by Anna Pomés
Lipocalins are among the most important indoor/outdoor groups of animal allergens. For some, the protein structure has been resolved, but their functions are still elusive. Lipocalins generally display a low sequence identity between family members1, but the lipocalin allergens are usually well preserved and can present similar patches that, in addition to serum albumins, may contribute to allergic cross-reactions among furry animals2–4. Rodents, especially mice and rats, are cosmopolitan species present in rural, periurban, and urban areas, and are most often considered as pests. In addition, the presence of these species as pets in homes and their permanent use as animal models in research laboratories have allowed constant exposure to their allergens, which is an important source of sensitization5,6.
So far, only one rat allergen has been described, Rat n 1, which is a lipocalin formed by two fractions, a prealbumin and an α-2U-globulin, secreted by the liver and found in high concentrations in urine, but also in saliva and fur7,8. Among patients allergic to rats, 87% reacted to Rat n 1 in dust9. This molecule is glycosylated and, up to now, was known to have two isoforms: Rat n 1.01 (21 kDa) and Rat n 1.02 (17 kDa). Its structure is like a conventional lipocalin with eight antiparallel β chains forming a single beta sheet and an α helix to create a pocket for ligand binding, very similar to that of other allergens, such as Mus m 1, the mouse's main allergen, with a highly conserved identity5,10,11. Four regions with potential immunodominant T cell epitopes have been described, and three of these are co-localized with the conserved regions of lipocalin, similar to the epitopes found in Bos d 2. No B cell epitopes have been reported for Rat n 112.
Although its structure and sensitization capacity have been well described, little is known about its biological functions and how these may be related to hypersensitivity mediated by an IgE measured response and cross-reactivity with the main allergens of the most common domestic animals, dogs, cats, and horses13,14. Therefore, the objective of this study was to explore cross-reactive epitopes among Rat n 1 and homologues in domestic pets through an in silico approach.
The amino acid sequences of lipocalins from 5 domestic animals (Rat n 1, Mus m 1, Fel d 4, Can f 6, and Equ c 1) were selected based on the reported allergenic and phylogenetic capacity15. The sequences were obtained from the UniProt database (Table 1). Sequences that were reported by the World Health Organization (WHO)/International Union of Immunological Societies (IUIS) Allergen Nomenclature Subcommittee and had complete sequences were used. Identity grades among lipocalins used in this study were determined by using the PRALINE web server16. Parameters to perform alignment were set up to use BLOSUM62 as an exchange matrix. Three iterations were used, with an E-value of 0.01. Structural homology and root mean square deviation values were determined using UCSF Chimera (V. 1.13.1) and PDB Viewer software (v.4.10)17.
A model of the Fel d 4 allergen was made by homology using the SWISS-MODEL server. The quality of the model was analyzed by ProSA-web. The model was refined in DeepView v.4.1 (energy minimization and rotamer replacements). Its quality was evaluated by several tools, including Ramachandran graphs, WHATIF, QMEAN4 index, and energy values (GROMOS96 force field). Three-dimensional structures of Rat n 1 (PDB:2A2G), Mus m 1 (1MUP), Can f 6 (6NRE), and Equ c 1 (1EW3) were retrieved from the Protein Data Bank.
ElliPro and BepiPred tools were used to predict discontinuities and lineal epitopes on Rat n 118. With ElliPro, the 3D structure of Rat n 1 was used to predict epitopes. Minimum score and maximum distance (Angstrom) were set to 0.5 and 6.
Preparation of receptors and ligands was carried out using the freely available Discovery Studio Visualizer 2016. Treatment of the receptors consisted of extracting the ligand and eliminating water molecules and cofactors with which their crystalline structures are resolved, followed by preparation of the ligands, making corrections in the structures, generating variations, and eliminating unwanted structures. Adding hydrogen atoms, neutralizing charged groups, generating ionization and tautomer states, obtaining alternative chiralities, and optimizing geometries were carried out.
Using molecules identified as pheromones and the 3-dimensional molecular modeling of odorant binding protein (OBP1), docking studies were performed using SwissDock based on EADock DSS, in the following stages: (1) generation of binding modes in local and blind docking, (2) estimation of CHARMM force field energies with GRID, (3) binding of modes with the most favorable energies with FACTS and clusters, and (4) visualization of the most favorable clusters. The best-scoring docked models exhibiting the best superposition with ligands and lowest binding energy were analyzed and visualized with Chimera (V.1.13.1).
The Rat n 1 3D structure was submitted to the ConSurf server in order to generate evolutionarily related conservation scores to help to identify functional regions in the proteins. Functional and structural critical residues in Rat n 1 sequence were confirmed by the ConSeq server.
Multiple alignment among amino acid sequences from Rat n 1, Can f 6, Equ c 1, Fel d 4, and Mus m 1 was performed. A 62% identity was identified among sequences compared. Residues located on positions 29 to 73 showed the highest identity. The sequence alignments of the lipocalins showed that identical residues formed short continuous segments (Figure 1). A comparison of the secondary structural elements of Rat n 1 with the structures listed in Table 1 revealed backbone atomic RMSD values between 0.3 and 0.95 Å, with Mus m 1 showing the most closely related structure and sequence homology to Rat n 1. For all structures analyzed, the closest structural homology was found on the α-helical amino acid sequence spanning region on Rat n 1 containing nine conserved residues (IKEKFAK-L) (Figure 2). While these proteins showed the same overall fold change, some detailed structures contained differences, such as major structural differences located on loop regions for all allergens in this study.
Using ElliPro and BepiPred servers, four lineal and three discontinuous epitopes on Rat n 1 were predicted (Table 2 and Table 3). The first and third epitopes were located on α-helices, spanning residues 158–165 and 141–148 (Figure 3). Both epitopes had a surface area of 300 Å.
Epitope | Start | End | Peptide | Number of residues | Score |
---|---|---|---|---|---|
LE1 | 158 | 165 | EAHGITRD | 8 | 0.715 |
LE2 | 91 | 97 | YKTPEDG | 7 | 0.679 |
LE3 | 141 | 148 | RTKDLSSD | 8 | 0.659 |
LE4 | 24 | 36 | STRGNLDVAKLNG | 13 | 0.608 |
The third epitope was identified as being cross-reactive among Rat n 1, Mus m 1, Equ c 1, Can f 6, and Fel d 4. From all residues conforming with mapped epitopes, 80% were conserved and surface exposed among lipocalins analyzed in this study. The second and fourth epitopes were located on loop regions, spanning residues 91–97 and 24–36 with surface areas of 262 and 487 Å, respectively (Figure 3). Conservative analysis indicated that both regions were highly conserved in the lipocalin family (Figure 4). According to ConSurf analysis, the region covering the second lineal epitope is conserved among the lipocalin family.
The first and second discontinuous epitopes were constituted by 10 amino acid residues with a surface area of 375 Å; the first discontinuous epitope was distributed on G-H and F-E β-strands and loops connecting them,where as the third epitope was mapped to an α-helical, the same region where the first lineal epitope was located (Figure 5). This epitope contained 12 amino acid residues, and of these, 85% were surface exposed with a surface area of 487 Å.
Docking molecular simulations were conducted to reveal the binding site, identify the binding properties of four pheromones and the binding potential of Rat n 1. In the docked complexes (Figure 6 and Figure 7), the central region of the corresponding structures indicated a step involving cleavage of the protein with aromatic amino acids, specifically Tyr139, Tyr103, Phe73, Phe75, and Phe122. This docked position revealed that the aliphatic structures, pyrazine derivatives, and 2-sec-butyl-4,5-dihydrothiazole (SBT) had the lowest bond energies, and the least in 2-heptanone and 2-ethylhexanol (Table 4). Likewise, in 2,5-dimethylpyrazine the interactions of aromatic and aliphatic residues such as Met61, Leu88, Val101, Val137, and Tyr139 are described, which maintain binding to the pyrazine ring and methyl substituents in C2 and C5 (Figure 6A–B). In the case of pheromones such as 2-ethylhexanol, a structural arrangement at the site was shown to be in contact with the aromatic and hydroxyl groups in the structure, which were shown in residues as Phe73, Phe109, Phe122 and Tyr139, not greater than 3.3 Å in angular distance (Figure 6C). The SBT structure was in a specific orientation with the thiazoline ring in the proximal opening of the binding site. Likewise, the presence of hydrophobic interactions with apolar and aromatic residues with SBT has been established, in which alkyl-type bonds and π-alkyl are described in Met57, Val59, Leu88, and Leu124 with the thiazoline ring and structural side chain (see Figure 6D). In general, the rest of the carbon structure and the radical presentation of a structural orientation in the anterior site of the pocket in the opposite direction to the β-barrel demonstrated a relationship with the apolar residues between ethyl radicals and interactions type alkyl with Leu124, Leu135, and Val137. Similarly, 2-heptanone showed closer interaction with the protein cavity, predominantly by apolar and polar residues through hydrogen bonds, where it is common to see the relationship between the oxygen of the carbonyl group and aromatic residues of Phe75, Tyr103, and Val101; however, hydrophobic type alkylic junctions were shown with residues such as Leu124 and Val137 (Figure 7).
Animal allergens remain an important cause of sensitization and allergic diseases. Rodents such as rats are invasive cosmopolitan species that move between urban, periurban, and urban areas looking for favorable habitats and resources. The allergenicity of these species was first observed in animal caretakers and was considered an important source of occupational sensitization, affecting up to 15% of people in European countries with an active scientific community12. Besides exposure in occupational settings, rodent exposure also occurs in domestic environments, as was shown in inner-city children with asthma in the USA, where rat sensitization rates were 19–21%19. In contrast, a recent study from Europe reported a very low prevalence of sensitization to rodents of 0.59% in urban atopic populations without occupational exposure5. Rat n 1 is the largest allergen of this species and has been well characterized. This protein belongs to the lipocalin family, a transport protein of hydrophobic ligands as lipid signaling molecules. A first approach in epitope prediction on Rat n 1 was made by Bayard et al.7, using in silico tool Chou Fasman, however, it was impossible to determine epitopes. On the other hand, with the use of synthetic octapeptides an IgE linear epitope was mapped spanning sequence STRGNLDVAKLNG, reported in this study as LE4. Authors are clear in that discontinuous epitopes were impossible to identify using same technology.
Several major allergens are members of the lipocalin family, including those from the mouse (Mus m 1), dog (Can f 1 and Can f 2), cat (Fel d 4 and Fel d 7), horse (Equ c 1), cow (Bos d 2 and Bos d 5), hamster (Phod s 1), and rabbit (Ory c 1 and Ory c 4), among others3. The factors that give rise to so many lipocalins becoming inducers of allergy are unclear. Among the allergens, cross-reactivity has been observed, mainly in organisms to which people are most exposed, such as cats, dogs, and horses.
In this study, we observed by in silico analysis possible crossbet-reactivity between Rat n 1, Can f 6, Equ c 1, and Fel d 4, with 62% identity among sequences compared. In previous studies, we found a high conservation state among Rat n 1, Equ c 1, and Fel d 4 (60% identity) and described possible residues with antigenic potential15,20. Nilson et al.21 found cross-reactivity between Can f 6, Fel d 4, and Equ c 1 by inhibition assays, especially in residues located on positions 29 to 73, which showed the highest identity. In these positions, Rat n 1 also presented the highest identity with the other three lipocalins, and here we found a possible lineal epitope (LE4: Start 24 – End 36) that could explain the cross-reactivity. Jeal et al.12 studied a population of individuals exposed to laboratory rats to determinate the proliferative response of peripheral blood mononuclear cells to Rat n 1. They found four regions as possible immunodominant T cell epitopes, and three of them were localized within the conserved regions of the lipocalins. One was also found in our study as a possible lineal epitope (LE2: Start 91 – End 97), with a high identity with Mus m 1, Equ c 1, Can f 6, and Fel d 4. The homologous allergens may contribute to multisensitization and symptoms in individuals allergic to different animals. Also, cross-reactivity to T cell epitope was found between Can f 1 and human tear lipocalin22. This could support the autosensitization and increased inflammatory response mediated by T lymphocyte CD4+. Also, first T cell epitope predicted in this study shares identity with Bos d 2, a major allergen from cow23. This can explain cross reactivity among rat and others allergenic sources, such as: Can f 1 and Equ c 1, which has been related to share identity to T epitope level24,25. However, Bos d 2 has been characterized as a weak inducer of immunological response26.
The docking simulations demonstrated that the Rat n 1 cleft is big enough to accommodate the whole fatty acid molecule. Determining the capacity to bind some ligands by allergens is critical to understand their allergenic capacity. For Bet v 1, an allergen with capacity to link hydrophobic ligands, it has been determined that ligands such as lipids, iron, and calcium modulate its allergenicity capacity27,28. When Bet v 1 is properly loaded with iron, it can promote Th2 response. A similar property is reported for Pru p 3, a peach lipid transfer protein. Results reveal that the ligand is recognized by a type of cellular receptor called CD1d in the cell surface where the antigens appear, that is, substances able to provoke an immune system response to produce antibodies. CD1d is responsible for presenting lipid antigen to activating cells of the immune system called invariant natural killer T (iNKT) cells. Once activated, these iNKT cells produce substances that cause the characteristic symptoms of allergic disorders29. Since many allergens transport diverse compounds, the discovery of Pru p 3 lipid-ligand as an adjuvant to promote allergic sensitization through its recognition by CD1d expression opened new horizons29.
Of the lipocalin family, also, Mus m 1 has been characterized for pheromone linking. Experimental assays revealed that 2-sec-butyl-4,5-dihydrothiazole (SBT), 6-hydroxy-6-methyl-3-heptanone (HMH), and (±)-dehydro-exo-brevicomin (DHB) are ligands for Mus m 1. This was a first step in determining ligands with allergenic capacity30. Also, some ligands could influence the stabilization of IgE conformational epitopes27,31. Here, we predicted three. So, experimental assays are needed to determine the impact of ligands in Rat n 1 on inducing allergic responses. Resolution of 3D structure of Rat n 1 by X-ray, it helped to understand structural basis for linking ligands. According to Bocskei et al.32, residues Tyr24, Val58, Ala107 and Phe94, are critical to lipocalin activity. However, none of these residues was identified in docking assay.
For other allergens, such as Fel d 1, lipid ligands enhance TLR4 activation and innate immune signaling and promote airway hypersensitivity reactions in diseases such as asthma33. For Bla g 4, a lipocalin from Blattella germanica (German cockroach), a capacity to bind hydrophobic ligands such as: tyramine and octopamine has been characterized34. But residues involved in linking ligands in Rat n 1 are not conserved in Bla g 4, this is relevant because suggest a capacity to link different kinds of ligands in both allergens.
The outcomes of the current work include (1) a comprehensive understanding of the structure of Rat n 1 protein and structural similarities and differences between Rat n 1 and other lipocalins, and (2) a structural and molecular basis for the identification of epitopes responsible for cross-allergenicity between rat and domestic animal allergenic lipocalins. These epitopes may contribute significantly to designing rational strategies for diagnosis of and immunotherapy for domestic animal allergies.
All data underlying the results are available as part of the article and no additional source data are required.
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Is the work clearly and accurately presented and does it cite the current literature?
Yes
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Yes
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Allergen and allergen immunotherapy.
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Allergen characterization and epitope mapping.
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Is the study design appropriate and is the work technically sound?
Partly
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Not applicable
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
No
References
1. Böcskei Z, Groom CR, Flower DR, Wright CE, et al.: Pheromone binding to two rodent urinary proteins revealed by X-ray crystallography.Nature. 1992; 360 (6400): 186-8 PubMed Abstract | Publisher Full TextCompeting Interests: No competing interests were disclosed.
Reviewer Expertise: Allergen characterization and epitope mapping.
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | ||
---|---|---|
1 | 2 | |
Version 2 (revision) 12 Dec 19 |
read | read |
Version 1 01 Oct 19 |
read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)