Next Article in Journal
Probiotic Supplementation in a Clostridium difficile-Infected Gastrointestinal Model Is Associated with Restoring Metabolic Function of Microbiota
Next Article in Special Issue
Isolation of a Psychrotolerant and UV-C-Resistant Bacterium from Elephant Island, Antarctica with a Highly Thermoactive and Thermostable Catalase
Previous Article in Journal
A Deeper Look into the Biodiversity of the Extremely Acidic Copahue volcano-Río Agrio System in Neuquén, Argentina
Previous Article in Special Issue
Complete Genome Sequence of Acidithiobacillus ferrooxidans YNTRS-40, a Strain of the Ferrous Iron- and Sulfur-Oxidizing Acidophile
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Novel Sequence Feature of SecA Translocase Protein Unique to the Thermophilic Bacteria: Bioinformatics Analyses to Investigate Their Potential Roles

Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON L8N 3Z5, Canada
*
Author to whom correspondence should be addressed.
Microorganisms 2020, 8(1), 59; https://doi.org/10.3390/microorganisms8010059
Submission received: 19 November 2019 / Revised: 24 December 2019 / Accepted: 25 December 2019 / Published: 29 December 2019
(This article belongs to the Special Issue Extremophiles and Extremozymes in Academia and Industries)

Abstract

:
SecA is an evolutionarily conserved protein that plays an indispensable role in the secretion of proteins across the bacterial cell membrane. Comparative analyses of SecA homologs have identified two large conserved signature inserts (CSIs) that are unique characteristics of thermophilic bacteria. A 50 aa conserved insert in SecA is exclusively present in the SecA homologs from the orders Thermotogales and Aquificales, while a 76 aa insert in SecA is specific for the order Thermales and Hydrogenibacillus schlegelii. Phylogenetic analyses on SecA sequences show that the shared presence of these CSIs in unrelated groups of thermophiles is not due to lateral gene transfers, but instead these large CSIs have likely originated independently in these lineages due to their advantageous function. Both of these CSIs are located in SecA protein in a surface exposed region within the ATPase domain. To gain insights into the functional significance of the 50 aa CSI in SecA, molecular dynamics (MD) simulations were performed at two different temperatures using ADP-bound SecA from Thermotoga maritima. These analyses have identified a conserved network of water molecules near the 50 aa insert in which the Glu185 residue from the CSI is found to play a key role towards stabilizing these interactions. The results provide evidence for the possible role of the 50 aa CSI in stabilizing the binding interaction of ADP/ATP, which is required for SecA function. Additionally, the surface-exposed CSIs in SecA, due to their potential to make novel protein-protein interactions, could also contribute to the thermostability of SecA from thermophilic bacteria.

Graphical Abstract

1. Introduction

Thermophilic organisms (bacteria) are of great scientific interest due to their ability to grow at temperatures well above 60 °C [1,2,3,4,5]. The thermostability of the proteins from model organisms such as Thermotoga maritima, which can survive within a wide temperature range of 55–90 °C, has been an area of intense research interest [5,6,7]. Thermostability of the protein has practical applications in industrial settings, biotechnologies, and bio-refining [1,2,3,4,5,8]. Specifically, within industrial settings, the higher temperature stability of these protein catalysts allows for reactions at higher temperatures resulting in decreased contamination concerns and overall faster reaction speeds [9]. An example in this regard includes widespread use of enzyme Thermus aquaticus (Taq) polymerase in the technique of polymerase chain reaction (PCR) [10]. Several comparative studies have shown that the thermostability of the proteins from thermophilic groups of organisms can be attributed to various characteristics [7,11,12]. One prevalent characteristic is the increase in the presence of ion-pair interactions in thermophilic organisms [11,13]. The increase in ion-pair interactions is due to a higher composition of charged amino acids such as lysine (Lys), arginine (Arg), glutamic acid (Glu) and asparagine (Asn) in the proteins from thermophilic bacteria when compared to those from mesophilic bacteria [11,13]. Sequence and structural characteristics such as presence of insertions and deletions, proline substitutions, closer packing of water-accessible surface residues, and increase in helical contents and hydrogen bonds has also been suggested to contribute towards increase in the thermostability of thermophilic proteins [14,15,16]. Evidently, the characteristics providing thermostability to proteins can exist in various forms, and further understanding protein features and characteristics that likely contribute towards the thermostability of proteins is of much interest [13,17].
Within the domain Bacteria, hyperthermophilic organisms are mainly present in three bacterial phyla viz. Aquificae, DeinococcusThermus, and Thermotogae [4,9,18,19,20,21]. The members from these phyla, which contain some of the most hyperthermophilic organisms known (e.g., Thermotoga maritima, Thermus aquaticus, and Aquifex aeolicus), are notable for their thermostable enzymes [1,2,5,9,22]. However, of these three phyla, while the phylum Aquificae is primarily comprised of thermophilic–hyperthermophilic organisms [18,20], in the other two phyla, hyperthermophilicity is a shared characteristics of species from only some orders (viz. Thermotogales and Thermales) of these bacteria [21,23,24,25,26]. Our earlier comparative analyses on protein sequences from members of these three phyla have identified large numbers of molecular markers in the form of conserved signature indels (CSIs) in different proteins which are specifically shared characteristics of different members from each of these three phyla (viz. Thermotogae, Aquificae, and DeinococcusThermus) of bacteria and their suborders [4,20,23,27,28,29,30]. Although the CSIs that we have previously identified provide very useful means for distinguishing members of these phyla from each other, as well as other groups of bacteria, it is unclear if any of these genetic/biochemical changes play any role in the thermostability of these organisms. However, in the present study we describe the identification and analysis of two large CSIs in the homologs of SecA proteins, which due to their unique shared presence in members from two different phyla of hyperthermophilic bacteria are indicated to play an important role in the thermostability of this protein.
SecA is a conserved ATPase whose homologs are well-preserved among all bacteria [31,32]. It constitutes a major molecular motor for the ATP-driven secretion of pre-proteins across the bacterial membrane in conjunction with the protein translocation complex SecYEG [33,34]. Previous biochemical and structural studies on SecA from thermophilic bacteria have contributed significantly towards understanding the overall architecture and function of the protein [35,36,37]. However, it remains unclear whether the presence of any unique sequence feature(s) in SecA contributes to its stability or for the functioning of SecA translocase at high temperatures. In the present study, we describe the identification and analysis of two large CSIs in the SecA proteins which are uniquely found in the SecA homologs from two different phyla of hyperthermophilic bacteria. One of these CSIs, consisting of an insert of 50 aa in a conserved region, is specifically found in the SecA homologs from all members of the order Thermotogales (phylum Thermotogae) and Aquificales (phylum Aquificiae), while another large 76 aa insert has uniquely shared characteristics of the SecA homologs from the order Thermales (phylum DeinococcusThermus) and in Hydrogenibacillus schlegelii, a thermophilic bacterium from the phylum Firmicutes. The shared presence of these large and unique sequence features in the SecA proteins from only the hyperthermophilic members of these phyla strongly suggests that the identified genetic/biochemical changes are related to the thermophilic characteristics of these organisms and they should be playing an important role in the thermostability of this protein.
We report here the results of phylogenetic studies examining the evolution of these large CSIs in the thermophilic organisms, as well as the sequence compositions, structural features, and location of these CSIs in the SecA protein structure using the homology models and available structural information. Lastly, to gain some insights into the functional significance of the 50 aa CSI in SecA protein (which is uniquely found in members of the orders Thermotogales and Aquificales), we have performed molecular dynamics (MD) simulation at two different temperatures (303.15 K and 363.15 K) using the available SecA structure (TmSecA) from T. maritima, which contains this large CSI. The results from MD simulation studies identify a conserved network of water molecules whose interaction with the bound nucleotide in the SecA protein is mediated/stabilized by certain conserved residues from this CSI in TmSecA. The significance of these observations in the context of the specificity of the identified CSIs for thermophilic organisms are discussed.

2. Materials and Methods

2.1. Identification of Conserved Signature Indels (Insertions/Deletions) and Phylogenetic Analysis

The described CSIs in the SecA proteins were identified as described in earlier work [28,38,39]. In brief, BLASTp searches were carried out on the SecA protein sequences from Thermotogae and other thermophilic organisms. Based on these BLASTp searches, sequences of SecA proteins were retrieved from 10–15 organisms representing different thermophilic phyla as well as a similar number of sequences from members of other bacterial phyla. Multiple sequence alignments (MSAs) of the retrieved sequences were created using ClustalX 2.1 [40,41]. These sequence alignments were examined for the presence of conserved inserts or deletions (i.e., indels), which were specifically found in thermophilic organisms, and which were flanked on both sides by at least 5 conserved residues in the neighboring 30–40 aa. More detailed BLASTp searches on the sequence regions containing the indels of interest and their flanking 40–50 aa were then conducted against the NCBI non-redundant (nr) database to determine the specificity of the identified indels. For the indels of interest, the signature files shown here were created using SIG_CREATE and SIG_STYLE programs (from www.GLEANS.net). Unless otherwise indicated, all of the reported CSIs are specific for the group of interest, and similar CSIs were not observed in homologs from any other bacterial species within the top 500 BLASTp hits examined.
For phylogenetic studies, protein sequences for SecA homologs for species from different groups of interest were obtained by means of BLASTp searches. A multiple sequence alignment of these protein sequences was constructed using ClustalX 2.1 [40]. Poorly aligned regions from the sequence alignment were removed using Gblocks 0.91b program [42], leaving 518 aligned amino acids (without any sequence gaps) for phylogenetic analysis. A maximum-likelihood phylogenetic tree based on 500 bootstrap replicates of this sequence alignment was constructed using MEGA 6 [43] based on the Jones–Taylor–Thornton (JTT) model [44].

2.2. Homology Modelling of SecA Homologs and Structural Analysis of CSIs

The homology models of the Thermotoga maritima and Thermus thermophilus SecA proteins lacking the 50 aa CSI were created using an in-house pipeline, “GlabModeller,” as described in earlier work [45,46,47,48]. The available crystal structures of SecA from T. maritima (PDB ID: 4YS0) and T. thermophilus (PDB ID: 2IPC) were used as templates [35,36]. In brief, for homology modeling, the sequence alignments between target and template proteins were carried out using the align2D module from the MODELLER, which is integrated and streamlined in the GlabModeller tool. The resulting alignments were carefully analyzed and modified manually to ensure the reliability of the location of CSIs. For each target protein, 500 models were ranked on the basis of their Discrete Optimized Protein Energy (DOPE) scores [49]. Selected models were then refined using ModRefiner [50]. The stereo-chemical properties of the final models were assessed using three independent servers which include RAMPAGE [51], ERRAT [52], PROSA [53,54], and VERIFY3D [55]. These applications utilize a dataset of refined structures to evaluate the statistical significance of the model conformation, location, and environment of each amino acid sequence and overall structural stability. The resultant models were then used to explore the structural changes associated with the CSI. The superimposition of the validated models with the template structures was carried out using PyMOL (Version 1.7.4; Schrödinger, LLC.) to examine the structure and location of identified CSIs in the SecA structures.

2.3. Molecular Dynamics Simulations

The all-atom molecular dynamics simulations were performed using (Groningen machine for chemical simulations) GROMACS 5.1.2 software [56,57] with the all-atom CHARMM36 force field for SecA, ADP, ions together with 3-points (TIP3P) water model and added ionic strength to mimic the physiological environment [58]. The atomic coordinates of the T. maritima SecA (PDB ID: 4YS0) resolved at 1.85 Å were obtained from the protein data bank (PDB) and the homology model of TmSecA lacking CSI was utilized. The potential energy of the system was minimized using a 50,000-step steepest descendent method to relax the system and to avoid any steric clashes. Missing amino acid residues in the crystal structure of TmSecA were identified and fixed using the MODELLER implemented in the GlabModeller tool. After energy minimization, the system was equilibrated with isothermal–isochoric/NVT (constant number of particles, volume, and temperature) ensemble and then 100 ns of MD simulation in the isothermal–isobaric/NPT (constant number of particles, pressure, and temperature) ensemble using the Nose–Hoover thermostat and Parrinello–Rahman barostat [59,60,61]. Comparative analysis of the difference in binding affinity of ADP towards T. maritima SecA with CSI and without CSI was carried out. The T. maritima SecA with ADP was simulated under two different reference temperatures of 303.15 K and 363.15 K. These large temperature gaps were selected to investigate the effect of temperature on the TmSecA dynamics during the simulation period. The root mean square deviation (RMSD) and hydrogen bond interaction calculations were carried out using the GROMACS utilities. All MD simulation runs were carried out using our local GROMACS certified graphical processing unit (GPU) accelerated high-performance computing system obtained from EXXACT Corporation [62]. A total of 1000 snapshots were extracted for every 100 ps from the 100 ns MD trajectories to analyze the dynamics of water molecules near the CSI-containing region. The analyses of the structures obtained from trajectories were carried out using the various utilities of the GROMACS [56], VMD [63], and PyMOL (www.pymol.com).

3. Results

3.1. Identification of Conserved Signature Indels in SecA Homologs from Thermotogales, Aquificales, and Thermales and their Phylogenetic Implications

Our comparative analysis of SecA protein sequences from different bacterial groups has identified several CSIs in SecA protein that are specific for particular groups/taxa of bacteria. However, the present study focuses on our identification of two large CSIs in the SecA protein, which are uniquely found in different homologs from the thermophilic–hyperthermophilic phyla of bacteria. The first of these CSIs shown in Figure 1 is a 50 aa insertion in SecA homologs that is uniquely shared by members of the order Thermotogales and Aquificales. As seen from Figure 1, within the phylum Thermotogae, this CSI is a shared characteristic of all members from the order Thermotogales, but it is not found in any of the species from the orders Peterotogales and Kosmotogales. It should be noted in this regard, that within the phylum Thermotogae, the order Thermotogales encompasses all of the thermophilic–hyperthermophilic organisms, whereas the other two orders lacking this CSI are comprised of mesophilic organisms [4,23,26,64]. Thus, within the phylum Thermotogae, this CSI is uniquely found in the organisms which are thermophilic–hyperthermophilic [27,28]. Interestingly, in addition to the Thermotogales, this large CSI is also commonly shared by different species belonging to the order Aquificales from the phylum Aquificae, which is also comprised exclusively of hyperthermophilic organisms. However, within this phylum members belonging to the order Desulfurobacteriales, which are strict anaerobes [20,65], do not contain this large insert.
The second large CSI in SecA protein that we have identified is a 76 aa insertion in a conserved region (see Figure 2), which is commonly shared by all SecA homologs from members of the order Thermales belonging to the phylum DeinococcusThermus. The phylum DeinococcusThermus contains two extensively studied orders of extremophilic microorganisms i.e., Deinococcales and Thermales [21,30]. Of these two orders, the order Thermales is comprised exclusively of organisms that are thermophilic and hyperthermophilic [21,30], whereas members of the order Deinococcales are known for their high degree of radiation resistance [24,30,66]. Interestingly, this large insert in SecA is found only in different members of the order Thermales but not in any of the homologs from Deinococcales. Further, in addition to the members from the order Thermales, this insert in SecA is also commonly shared by the species Hydrogenibacillus schlegelii, which is a thermophilic bacterium belonging to the phylum Firmicutes [67,68,69]. Thus, both these large CSIs in SecA are only found in members of different main orders/phyla of bacteria that contain thermophilic-hyperthermophilic organisms. Except for the thermophilic-hyperthermophilic organisms, these inserts are not present in any other SecA homologs. Thus, the indicated CSIs were further examined in the context of their role in thermostability.

3.2. Phylogenetic Analysis of the SecA Proteins to Investigate the SHARED Presence of CSIs

As both these large CSIs are present in only the thermophilic members from two different phyla of bacteria, it was of much interest to investigate how the shared presence of these genetic changes in two distinct groups/phyla of bacteria could be explained. Based on earlier work, horizontal gene transfers are indicated to occur frequently between members of the bacterial phyla that contain thermophilic–hyperthermophilic organisms [20,26,28,70]. Thus, we have examined whether the shared presence of these CSIs in these two cases is due to horizontal transfers of SecA gene between the two groups of organisms which contain either the 50 aa or the 76 aa inserts. To investigate this, we have constructed a maximum-likelihood phylogenetic tree based on SecA homologs from different relevant bacterial groups/phyla (Figure 3). In this tree, members from the phyla Thermotogae and Aquificae form distinct clades and the two orders of these bacteria viz. Thermotogales and Aquificales, which contained the 50 aa insert are separated from each other by other members of these phyla, which do not contain the 50 aa insert. Similarly, members of the order Thermales and the Firmicutes species H. schlegelli, both of which contained the 76 aa insert, also did not cluster together in the phylogenetic tree. Instead, members of the order Thermales branched with other members from the phylum DeinococcusThermus, whereas H. schlegelli branched within a cluster of other species from the phylum Firmicutes. If the shared presence of the 50 aa CSI and 76 aa CSI in the two indicated groups of organisms was due to horizontal gene transfers, then it was expected that the SecA genes from these CSI-containing organisms should have clustered together in the tree. However, as the observed branching pattern of the CSIs-containing organisms is contrary to this expectation, it strongly suggests that the shared presence of the large CSIs in either the order Thermotogales and Aquificales, or in Thermales and H. schlegelli, is not due to horizontal gene transfers. Instead the results obtained suggest that the genetic changes leading to these large inserts have occurred independently (i.e., convergent evolution) in these lineages due to their presumed selective advantage.

3.3. Computational Analysis of the CSIs in SecA Proteins

As the two large CSIs in the SecA proteins are found exclusively in the bacterial groups/orders that consist entirely of thermophilic and hyperthermophilic organisms, it strongly suggests that they play some role in the thermostability of the SecA protein. Hence, exploration of the functional characteristics of these CSI could provide some insights into the thermostability of the SecA protein. Based on earlier studies, thermostability of proteins is enhanced by an increase in the charged amino acids that facilitate increased ion-pair interactions and stabilize the protein in high entropy environment [11,13]. The results of our analyses (Figure S1) indicate that both the large CSIs in the SecA protein contains a higher proportion of charged amino acids such as Glu, Arg, and Lys, which are known to facilitate increased ion-pair interactions.
The crystal structures of SecA are available from both mesophilic bacteria (Bacillus subtilis and Mycobacterium tuberculosis), and from thermophilic bacteria (Thermus thermophilus (PDB ID: 2IPC) and Thermotoga maritima (PDB ID: 4YS0)) [36,37,71,72,73]. The catalytic core of SecA protein is comprised of five functionally essential domains, which are shown in Figure 4A in the available crystal structure of the T. maritima SecA (TmSecA) [35]. The different domains of SecA protein are highlighted using different color shades: cyan as a nucleotide-binding domain (NBD1), magenta as nucleotide-binding domain 2 (NBD2), red as a pre-protein binding domain (PPXD), yellow as helical wind domain (HWD), and green as helical scaffold domain (HSD). We have also created a homology model of the TmSecA protein lacking the 50 aa CSI, using the protocol described in the Methods section, for the comparative analyses of structural features and location of this conserved insert (Figure 4B). As can be seen, the 50 aa conserved insert in TmSecA specific for Thermotogales and Aquificales is located in a surface-exposed loop region of the NBD1, which forms a part of the ATP-binding site in the protein [35,74]. The insert protrudes to form two additional β-strands at the periphery of the NBD1 domain with two short α-helices connected by a loop [37]. Although this surface-exposed CSI in the structure of TmSecA is located in close proximity to the bound ADP molecule, it does not interact directly with bound ADP, nor it makes any contact with the SecY channel [37,75]. We have also mapped the location of the 76 aa CSI specific for the order Thermales and H. schlegelii using the available crystal structure of SecA from Thermus thermophilus SecA (TtSecA) (PDB: 2IPC) [36]. As can be seen from Figure 4C, this large CSI is also located in a surface-exposed loop region of the TtSecA at the periphery of NBD2. However, unlike the 50 CSI in Thermotogales and Aquificales, this CSI is not in close proximity to the ADP–ATP binding site on the protein. However, due to its location, this CSI, through its role in enabling intramolecular ionic interactions, could be playing a role in stabilizing dimer formation at higher temperatures.

3.4. Molecular Dynamics (MD) Simulation Studies of SecA Containing 50 aa CSI Specific for Thermotogales and Aquificales: Analysis of TmSecA Conformational Stability and Flexibility

In view of the location of the 50 aa CSI in the SecA of Thermotogales and Aquificales in close proximity to the ADP–ATP binding site, we have carried out MD simulation studies to gain some insights into the function of this CSI. In this regard, we have initially investigated the dynamic of the 50 aa CSI’s flexibility from the trajectories obtained from the MD simulation studies on TmSecA structure with CSI (+CSI) and without CSI (−CSI) at two different temperature settings. The detailed protocol of the system setup for MD simulations is described in the Methods section. In total, four simulation runs were carried out using TmSecA (+CSI) and TmSecA (−CSI), each for 100 ns at two temperature settings of 303.15 K (30 °C) and 363.15 K (90 °C). A preliminary analysis was carried out to analyze the overall deviation of TmSecA (+CSI) and TmSecA (−CSI) relative to their native structures as a function of time using the trajectories along the 100 ns time scale at these temperatures. At 303.15 K, TmSecA (+CSI) appear relatively more stable with an average RMSD of <0.20 nm (<2.0 Å) (±0.04 nm) relative to 0.39 nm (3.9 Å) (±0.08 nm) for TmSecA (−CSI) (Figure S2A). Similarly, at 363.15 K, an average RMSD value of 0.34 nm (3.4 Å) (±0.5 nm) was observed for TmSecA (+CSI) relative to an average value of 0.36 nm (3.6 Å) (±0.6 nm) for its insertion truncated homolog TmSecA (−CSI) (Figure S2B). Although at the high temperature the differences in RMSD values are minimal, TmSecA (−CSI) lacking the CSI showed a much higher degree of fluctuation, in comparison to the protein with the CSI, indicating that the TmSecA (+CSI) was more stable at the higher temperature over the simulation period.

3.5. Identification of Conserved CSI-mediated Water Network in TmSecA

The crystal structure of TmSecA with ADP-bound contains a number of bound water molecules near the ADP-binding site [35]. Of these water molecules, few forms an intermediate interaction between the adenine group of ADP to the backbone of the residues (Glu 185 and Val 186) from the 50 aa CSI in TmSecA which is located near the ADP-binding site. This observation is of much interest, as several earlier studies indicate that water molecules make significant contribution towards the binding affinity of the ligand or mediating protein-ligand complexes by forming bridges between the protein and the ligand [76,77,78,79,80]. To investigate this, using the protocol described in the Methods section, computational analyses of the MD simulation trajectories were carried out to determine whether the 50 aa CSI in TmSecA might be interacting with the ADP molecule by forming intermediate interactions with water molecules.
Initially, we analyzed the hydration of the ADP-binding site by calculating the presence of a number of water molecules within the 9 Å from ADP during the entire simulation of 100 ns at 303.15 K and 363.15 K (Figure S3). At 303.15 K, the increase in the number of water molecules increased in case of TmSecA (+CSI) after 50 ns of the simulation, and overall it contained more water molecules (with an average of total 129 molecules) when compared with TmSecA (−CSI) that contained an average total of 103 water molecules. However, at 363.15 K, the number of water molecules around the ADP-binding site for TmSecA (+CSI) decreased after 50 ns of simulation time to an average total of 103 water molecules, whereas their numbers in TmSecA (−CSI) slightly increases to an average total number of 112 water molecules.
Further analyses of the MD trajectories identifies a network of water molecules that shows a high degree of conservancy and stable occupancy near the loop residues (amino acid range 183 to 188) from the 50 aa CSI and these water molecules formed interactions with the adenine group of ADP similar to that observed in the crystal structure of TmSecA [35]. In a series of snapshots extracted at different time intervals from 100 ns MD trajectories of TmSecA (+CSI) at 303.15 K (Figure 5A) and 363.15 K (Figure 5B), we show the coordinates of water molecules, which constantly occupy the location near the backbone of residues (amino acids 185–188) from this insert. For comparisons, observed positions of the water molecules in the TmSecA crystal structure are superimposed in this figure and they are shown as magenta spheres. Any hydrogen bonds formed between simulation water molecules (red and white spheres), and adenine group of ADP or residues from the insert are shown as yellow dash lines. As can be seen, at both temperatures a network involving two to three water molecules are maintained throughout the simulation period. Although most of the water molecules are highly mobile, spending only a fraction of time in that position, interestingly the other water molecules that displace them occupy the same position and forms a network of water-mediated interactions that are highly similar to those observed in the crystal structure of TmSecA protein [35].
The time evolution of the hydrogen bond interactions calculated between Glu185 and water molecules that are within the 4 Å of Glu185 over the course of 100 ns MD trajectory at 303.15 K and 363.15 K is shown in Figure 5C. It is of interest to note that the residue glutamic acid (Glu185) present in the loop region (residues 183–188) of this 50 aa insert is found to be conserved among SecA homologs from all members of the orders Thermotogales and Aquificales. As can be seen in Figure 5C, the backbone of Glu185 residue is involved in bridging the two hydrogen bond interactions with water molecules throughout the simulation at both temperatures.

4. Discussion

Thermophilic bacteria often exhibit many evolutionary adaptations that aids to retain the function of their proteins at very high temperatures [7,11]. These evolutionary adaptions such as ion-pair interactions, insertions and deletions, hydrogen bonds, and salt bridges ultimately result in increased intramolecular interactions and provide protein stability in the high entropy conditions [9,11,13,81,82]. In addition, a higher degree of close packing of water-accessible residues on the surface of the proteins and contact orders has been reported in thermophilic bacteria compared to their mesophilic homologs [7,16]. However, despite the significant progress that has been made toward understanding of the structural peculiarities of thermophilic proteins, no unique or single mechanism has been found responsible, instead, a set of factors or their various combinations has been suggested to contribute towards the thermostability of a protein [5,9,81,82,83,84,85].
In the current work, we describe the identification of two large and unique molecular sequence features in the form of CSIs that are uniquely present in the SecA homologs from different thermophilic and hyperthermophilic members from all three main phyla of bacteria (viz. Thermotogae, Aquificae and DeinococcusThermus), which harbor most such organisms [4,9,18,19,20,21]. Earlier studies on CSIs in different proteins show that these kind of rare genetic changes play important (or essential) roles in the functioning of the proteins within the CSI-containing organisms [66,86] and any significant changes in these genetic characteristics are incompatible with their cellular function/growth [86]. In view of the specificities of the identified CSIs in the SecA homologs for thermophilic–hyperthermophilic organisms, and the importance of such genetic changes, it is of much interest to understand what unique functions/roles these large CSIs play in the indicated groups/phyla of thermophilic bacteria.
Results presented here show that the two large CSIs in SecA protein are commonly shared by members from two different groups/phyla of organisms. While the 50 aa CSI in SecA is a uniquely shared characteristic of the members of the order Thermotogales and Aquificales, the 76 aa CSI is specific for the order Thermales and H. schlegelii, a species from the phylum Firmicutes. In both cases, the identified CSIs are present in two unrelated groups of bacteria in the same location in SecA protein. Further, these CSIs are of the same (or similar) lengths and exhibit a high degree of conservation in their amino acid sequences. As horizontal gene transfers among thermophilic organisms are indicated to occur frequently [20,26,28,70], the simplest explanations to account for the presence of these large genetic characteristics in members from two unrelated phyla of bacteria would be that the genetic changes leading to either the 50 aa or 76 aa CSIs initially occurred in one of the lineages of thermophilic bacteria and then the SecA genes containing these CSIs were laterally transferred to the other phyla of thermophilic bacteria containing very similar CSIs. However, the phylogenetic analyses of SecA protein sequences do not support the view that the shared presence of these CSIs in the unrelated groups of thermophiles is due to lateral gene transfers. To account for the observed results, it is likely that the genetic changes leading to these large CSIs have occurred independently in two unrelated groups of thermophiles due to selective (pressure) advantageous functions of these insertions in the growth/survival of indicated groups of thermophilic organisms at high temperature. However, given the large sizes of both these CSIs and the observed high degree of sequence conservation within their sequences, the possibility that these genetic changes have occurred independently in different lineages is unusual and unexpected. Hence, another possibility to account for these results is that instead of the horizontal transfer of the entire SecA genes, only the genetic exchanges or recombination of specific segments of SecA genes containing these CSIs have occurred between the indicated specific groups of thermophiles organisms [26]. As the rest of the SecA gene has evolved independently in these lineages, it will account for the distinct branching of organisms containing these CSIs in a phylogenetic tree based on SecA protein sequences.
It is of interest that within the phylum Aquificae, the 50 aa CSI is only shared by members of the order Aquificales but not by species from the order Desulfurobacteriales, which are also thermophilic. However, an important difference between members of the orders Aquificales and Desulfurobacteriales is that while the former order is comprised of species that are aerobic or microaerophilic and obtain energy from hydrogen or reduced sulfur compounds through molecular oxygen, the members of the latter order are strict anaerobes and they obtain energy by reduction of sulfate, nitrate, elemental sulfur, or other compounds by molecular hydrogen [20,29,87,88]. Additionally, the members of these two orders are known to possess different metabolic pathways as their environments have different metabolic requirements [20,65]. These observations suggest that the 50 aa CSI in the SecA protein may confer selective advantage (thermostability) only under aerobic conditions.
However, irrespective of the evolutionary mechanisms that underlie the shared presence of these CSIs, based on the specific presence of these large CSIs in SecA homologs from only the thermophilic-hyperthermophilic organisms, it is strongly anticipated that these CSIs should play an important role in the thermostability of this protein at high temperature. Results from our structural analyses show that both these CSIs are located on the surface-exposed loops in a functionally important domain (NBD1) of the SecA protein. Of the two CSIs in SecA, the 50 aa CSI, which is specific for the order Thermotogales and Aquificales, is located in a surface-exposed loop that lies in close proximity to the ADP/ATP-binding site in the protein. Our comparison of the crystal structure of T. maritima SecA containing the 50 aa CSI with its homology model lacking the 50 aa CSI has revealed that a number of water molecules form an intermediate interaction between the residues from this CSI and ADP molecule. Although the roles of conserved water molecules in mediating protein-ligand interactions have been increasingly recognized in recent years [76,78,79,80,89,90,91,92], our understanding of the significance of these conserved waters in SecA, or how other sequence features of the protein contribute towards their conservation, is limited. To investigate this aspect, molecular dynamics (MD) simulations were carried out to examine the structural dynamics of the 50 aa CSI in SecA protein and the water molecules found in its proximity in the crystal structure of TmSecA at two different temperatures (303.15 K, and 363.15 K). The results from MD studies identify a network of highly stable water molecules that form an intermediate interaction between the residues such as Glu185 from the 50 aa CSI and adenine group of ADP at both temperatures. Earlier studies have indicated that the hydrogen bonding capability of water molecules with charged amino acids contributes towards the stability of hydrogen bonds formed between coenzymes/cofactors like ADP/ATP and proteins [14,93,94,95,96,97]. In view of these earlier studies, the high residence time of the water molecules inside the cavity formed by the 50 aa CSI, and their forming a conserved hydrogen bonding network with some conserved residues from the CSI during the course of simulation, strongly suggests that the 50 aa CSI likely plays a role in maintaining the constant stable network of water molecules which likely plays a role in stabilization of the ADP–ATP molecules in the active site of the protein at high temperature. Since ATP binding and hydrolysis is essential for the functioning of SecA protein, it is possible that the observed conserved hydrogen bonding network between the water molecules, residues from the CSI and ADP (ATP), helps in stabilizing the binding of ATP to the protein at high temperature.
However, the suggested role of the 50 aa CSI in stabilizing the binding of ADP/ATP to the protein at high temperature could be only one of the several factors by which this large CSI might be contributing towards the thermostability of the organisms for which they are specific. As noted earlier, these large CSI as well as the 76 aa CSI, specific to the member of the order Thermales and H. schlegelli, are present as surface-exposed loops in the structure of SecA protein. Based on earlier work, the surface loops in proteins are often involved in mediating novel protein-protein or protein-ligand interactions or in maintaining/stabilizing a specific oligomeric state of the proteins [45,46,47,86,98,99,100,101]. Earlier structural and functional studies also indicate that SecA can adopt a wide variety of oligomeric states [102,103,104,105] and it is possible that the presence of these CSI could stabilize certain oligomeric forms that are of relevance for the functioning of this protein in a thermophilic environment. In addition, most thermophilic and hyperthermophilic bacteria lacks SecB, a molecular chaperone protein which plays an important role in transferring pre-protein to SecA and SecYEG translocation system in most other bacteria [106,107,108]. Thus, it is likely that members of thermophilic and hyperthermophilic phyla utilize other chaperons or proteins for similar functions [107]. Although other novel proteins that may be required for the functioning of SecA protein in thermophilic organisms have not yet been characterized, it is possible that the presence of the unique surface-exposed loops formed by the CSIs in the SecA proteins from thermophilic organisms could serve as a platform for the unique binding of these proteins with the SecA. Overall, the results provided in this study highlights the unique sequence and structural features of SecA protein specific to the thermophilic and hyperthermophilic bacterial group. Our structural and computational analysis of these novel sequence features also provides some insights into the possible functions of one of these large CSIs in the context of the thermostability of the protein. However, further understanding of the functional significance of these large CSIs in the functioning of this protein in thermophilic organisms and conferring thermostability will only emerge from future detailed genetic and biochemical studies.

Supplementary Materials

The following are available online at https://www.mdpi.com/2076-2607/8/1/59/s1. Figure S1: A Web logo representation of the sequence characteristics of the amino acids in the 50 aa and 76 aa CSIs. Figure S2: The root-mean-square-deviation (RMSD) values calculated relative to the starting structure for the Thermotoga maritima SecA (TmSecA) with 50 insert (+CSI) and without 50 aa insert at 305.15 K and 363.15 K, over the 100 ns of molecular dynamics (MD) simulation trajectories. Figure S3: Trajectories for the occupancy of number of water molecules calculated by measuring the number of water molecules that are located within the 9 Å from ADP during the entire molecular dynamics (MD) simulation period of 100 ns. Figure S4A,B: Randomly picked snapshots at different time intervals extracted from the 100 ns MD trajectories of TmSecA (+CSI) at 303.15 K and 363.15 K showing the occupancy and interaction of water molecules with residue (Glu185) from 50 aa CSI and ADP.

Author Contributions

B.K. and D.P. carried out identification of CSIs, phylogenetic analysis, homology modelling and localization of the CSIs in protein structures, and preparation of a draft manuscript; B.K. carried out the molecular dynamics (MD) simulation and analysis of the results. R.S.G., planning and supervision of the work, checking the identified CSIs for their specificities, obtained funding for the project, and writing and finalizing of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the research grant RGPIN-2019-06397 from the Natural Sciences and Engineering Research Council of Canada to Radhey S. Gupta.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Wiegel, J.; Ljungdhal, L.G.; Demain, A.L. The Importance of Thermophilic Bacteria in Biotechnology. Crit. Rev. Biotechnol. 1985, 3, 39–108. [Google Scholar] [CrossRef]
  2. Lasa, I.; Berenguer, J. Thermophilic Enzymes and Their Biotechnological Potential. Microbiologia 1993, 9, 77–89. [Google Scholar] [PubMed]
  3. Turner, P.; Mamo, G.; Karlsson, E.N. Potential and Utilization of Thermophiles and Thermostable Enzymes in Biorefining. Microb. Cell Fact. 2007, 6, 9. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Bhandari, V.; Gupta, R.S. Molecular Signatures for the Phylum (class) Thermotogae and a Proposal for Its Division into Three Orders (Thermotogales, Kosmotogales ord. nov. and Petrotogales ord. nov.) Containing Four Families (Thermotogaceae, Fervidobacteriaceae fam. nov., Kosmotogaceae fam. nov. and Petrotogaceae fam. nov.) and a New Genus Pseudothermotoga gen. nov. with Five New Combinations. Antonie Van Leeuwenhoek 2014, 105, 143–168. [Google Scholar]
  5. Podar, M.; Reysenbach, A.L. New Opportunities Revealed by Biotechnological Explorations of Extremophiles. Curr. Opin. Biotechnol. 2006, 17, 250–255. [Google Scholar] [CrossRef]
  6. Huber, R.; Langworthy, T.A.; König, H.; Thomm, T.; Woese, C.R.; Sleytr, U.B.; Stetter, K.O. Thermotoga Maritima sp. nov. Represents a New Genus of Unique Extremely Thermophilic Eubacteria Growing up to 90 °C. Arch. Microbiol. 1986, 144, 324–333. [Google Scholar] [CrossRef]
  7. Robinson-Rechavi, M.; Godzik, A. Structural Genomics of Thermotoga Maritima Proteins Shows that Contact Order is a Major Determinant of Protein Thermostability. Structure 2005, 13, 857–860. [Google Scholar] [CrossRef] [Green Version]
  8. Sadaf, A.; Fatima, S.W.; Khare, S.K. Sporotrichum Thermophile Xylanases and Their Biotechnological Applications. In Fungi in Extreme Environments: Ecological Role and Biotechnological Significance; Tiquia-Arashiro, S., Grube, M., Eds.; Springer: Cham, Switzerland, 2019; pp. 307–328. [Google Scholar]
  9. Vieille, C.; Zeikus, G.J. Hyperthermophilic Enzymes: Sources, Uses, and Molecular Mechanisms for Thermostability. Microbiol. Mol. Biol. Rev. 2001, 65, 1–43. [Google Scholar] [CrossRef] [Green Version]
  10. Chien, A.; Edgar, D.B.; Trela, J.M. Deoxyribonucleic acid Polymerase from the Extreme Thermophile Thermus aquaticus. J. Bacteriol. 1976, 127, 1550–1557. [Google Scholar]
  11. Sterner, R.; Liebl, W. Thermophilic Adaptation of Proteins. Crit. Rev. Biochem. Mol. Biol. 2001, 36, 39–106. [Google Scholar] [CrossRef]
  12. Zhou, X.X.; Wang, Y.B.; Pan, Y.J.; Li, W.F. Differences in Amino acids Composition and Coupling Patterns between Mesophilic and Thermophilic Proteins. Amino. Acids 2008, 34, 25–33. [Google Scholar] [CrossRef] [PubMed]
  13. Szilagyi, A.; Zavodszky, P. Structural Differences between Mesophilic, Moderately Thermophilic and Extremely Thermophilic Protein Subunits: Results of a Comprehensive Survey. Structure 2000, 8, 493–504. [Google Scholar] [CrossRef] [Green Version]
  14. Vogt, G.; Woell, S.; Argos, P. Protein Thermal Stability, Hydrogen Bonds, and Ion Pairs. J. Mol. Biol. 1997, 269, 631–643. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Russell, R.J.; Gerike, U.; Danson, M.J.; Hough, D.W.; Taylor, G.L. Structural Adaptations of the Cold-Active Citrate Synthase from an Antarctic Bacterium. Structure 1998, 6, 351–361. [Google Scholar] [CrossRef]
  16. Glyakina, A.V.; Garbuzynskiy, S.O.; Lobanov, M.Y.; Galzitskaya, O.V. Different Packing of External Residues Can Explain Differences in the Thermostability of Proteins from Thermophilic and Mesophilic Organisms. Bioinformatics 2007, 23, 2231–2238. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Gonzalez-Siso, M.I. Editorial for the Special Issue: Thermophiles and Thermozymes. Microorganisms 2019, 7, 62. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Reysenbach, A.-L. Phylum BI. Aquificae phy. nov. In Bergey’s Manual of Systematic Bacteriology; Boone, D.R., Castenholz, R.W., Eds.; Springer: Berlin, Germany, 2001; pp. 359–367. [Google Scholar]
  19. Reysenbach, A.-L. Phylum BII. Thermotogae phy. nov. In Bergey’s Manual of Systematic Bacteriology; Boone, D.R., Castenholz, R.W., Eds.; Springer: Berlin, Germany, 2001; pp. 369–387. [Google Scholar]
  20. Gupta, R.S. The Phylum Aquificae. In The Prokaryotes: Prokaryotic Biology and Symbiotic Associations; Rosenberg, E., DeLong, E.F., Thompson, F., Lory, S., Stackebrand, E., Eds.; Springer: Berlin, Germany, 2014; pp. 418–445. [Google Scholar]
  21. Albuquerque, L.; Costa, M.S. The Family Thermaceae. In The Prokaryotes-Other Major Lineages of Bacteria and the Archaea; Rosenberg, E., DeLong, E., Lory, S., Stackebrandt, E., Thompson, F., Eds.; Springer: New York, NY, USA, 2014; pp. 955–987. [Google Scholar]
  22. Escuder-Rodriguez, J.J.; DeCastro, M.E.; Cerdan, M.E.; Rodriguez-Belmonte, E.; Becerra, M.; Gonzalez-Siso, M.I. Cellulases from Thermophiles Found by Metagenomics. Microorganisms 2018, 6, 66. [Google Scholar] [CrossRef] [Green Version]
  23. Bhandari, V.; Gupta, R.S. Phylum Thermotogae. In The Prokaryotes—Other Major Lineages of Bacteria and the Archea; Rosenberg, E., DeLong, E.F., Lory, S., Stackebrandt, E., Thompson, F., Eds.; Springer: Berlin/Heidelberg, Germany, 2014; pp. 989–1015. [Google Scholar] [CrossRef]
  24. Rosenberg, E. The family Deinococcaceae. In The Prokaryotes- Other Major Lineages of Bacteria and the Archaea; Rosenberg, E., DeLong, E., Lory, S., Stackebrandt, E., Thompson, F., Eds.; Springer: New York, NY, USA, 2014; pp. 613–615. [Google Scholar]
  25. Huber, R.; Hannig, M. Thermotogales. Prokaryotes 2006, 7, 899–922. [Google Scholar]
  26. Pollo, S.M.; Zhaxybayeva, O.; Nesbo, C.L. Insights into Thermoadaptation and the Evolution of Mesophily from the Bacterial Phylum Thermotogae. Can. J. Microbiol. 2015, 61, 655–670. [Google Scholar] [CrossRef] [Green Version]
  27. Griffiths, E.; Gupta, R.S. Molecular Signatures in Protein Sequences that are Characteristics of the Phylum Aquificae. Int. J. Syst. Evol. Microbiol. 2006, 56, 99–107. [Google Scholar] [CrossRef]
  28. Gupta, R.S.; Bhandari, V. Phylogeny and Molecular Signatures for the Phylum Thermotogae and Its Subgroups. Antonie Van Leeuwenhoek 2011, 100, 1–34. [Google Scholar] [CrossRef] [PubMed]
  29. Gupta, R.S.; Lali, R. Molecular Signatures for the Phylum Aquificae and Its Different Clades: Proposal for Division of the phylum Aquificae into the Emended Order Aquificales, Containing the Families Aquificaceae and Hydrogenothermaceae, and a New Order Desulfurobacteriales ord. nov., containing the family Desulfurobacteriaceae. Antonie Van Leeuwenhoek 2013, 104, 349–368. [Google Scholar] [PubMed]
  30. Ho, J.; Adeolu, M.; Khadka, B.; Gupta, R.S. Identification of Distinctive Molecular Traits that are Characteristic of the Phylum “Deinococcus-Thermus” and Distinguish Its Main Constituent Groups. Syst. Appl. Microbiol. 2016, 39, 453–463. [Google Scholar] [CrossRef] [PubMed]
  31. Rapoport, T.A.; Jungnickel, B.; Kutay, U. Protein Transport across the Eukaryotic Endoplasmic Reticulum and Bacterial Inner Membranes. Annu. Rev. Biochem. 1996, 65, 271–303. [Google Scholar] [CrossRef] [PubMed]
  32. Fekkes, P.; Driessen, A.J. Protein Targeting to the Bacterial Cytoplasmic Membrane. Microbiol. Mol. Biol. Rev. 1999, 63, 161–173. [Google Scholar] [PubMed]
  33. Lill, R.; Cunningham, K.; Brundage, L.A.; Ito, K.; Oliver, D.; Wickner, W. SecA Protein Hydrolyzes ATP and is an Essential Component of the Protein Translocation ATPase of Escherichia Coli. EMBO J. 1989, 8, 961–966. [Google Scholar] [CrossRef]
  34. Collinson, I. SecA-a New Twist in the Tale. J. Bacteriol. 2017, 199. [Google Scholar] [CrossRef] [Green Version]
  35. Chen, Y.; Bauer, B.W.; Rapoport, T.A.; Gumbart, J.C. Conformational Changes of the Clamp of the Protein Translocation ATPase SecA. J. Mol. Biol. 2015, 427, 2348–2359. [Google Scholar] [CrossRef] [Green Version]
  36. Vassylyev, D.G.; Mori, H.; Vassylyeva, M.N.; Tsukazaki, T.; Kimura, Y.; Tahirov, T.H.; Ito, K. Crystal Structure of the Translocation ATPase SecA from Thermus Thermophilus Reveals a Parallel, Head-To-Head Dimer. J. Mol. Biol. 2006, 364, 248–258. [Google Scholar] [CrossRef]
  37. Zimmer, J.; Rapoport, T.A. Conformational Flexibility and Peptide Interaction of the Translocation ATPase SecA. J. Mol. Biol. 2009, 394, 606–612. [Google Scholar] [CrossRef] [Green Version]
  38. Griffiths, E.; Gupta, R.S. Signature Sequences in Diverse Proteins Provide Evidence for the Late Divergence of the Order Aquificales. Int. Microbiol. 2004, 7, 41–52. [Google Scholar]
  39. Gupta, R.S. Identification of Conserved Indels that are Useful for Classification and Evolutionary Studies. In Methods in Microbiology New Approaches to Prokaryotics Systematics; Goodfellow, M., Sutcliffe, I.C., Chun, J., Eds.; Elsevier: London, UK, 2014; pp. 153–182. [Google Scholar]
  40. Larkin, M.A.; Blackshields, G.; Brown, N.P.; Chenna, R.; McGettigan, P.A.; McWilliam, H.; Valentin, F.; Wallace, I.M.; Wilm, A.; Lopez, R.; et al. Clustal W and Clustal X version 2.0. Bioinformatics 2007, 23, 2947–2948. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Goujon, M.; McWilliam, H.; Li, W.; Valentin, F.; Squizzato, S.; Paern, J.; Lopez, R. A New Bioinformatics Analysis Tools Framework at EMBL-EBI. Nucleic Acids Res. 2010, 38, W695–W699. [Google Scholar] [CrossRef] [PubMed]
  42. Castresana, J. Selection of Conserved Blocks from Multiple Alignments for Their Use in Phylogenetic Analysis. Mol. Biol. Evol. 2000, 17, 540–552. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Tamura, K.; Stecher, G.; Peterson, D.; Filipski, A.; Kumar, S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol. Biol. Evol. 2013, 30, 2725–2729. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Jones, D.T.; Taylor, W.R.; Thornton, J.M. The Rapid Generation of Mutation Data Matrices from Protein Sequences. Comput. Appl. Biosci. 1992, 8, 275–282. [Google Scholar] [CrossRef]
  45. Alnajar, S.; Khadka, B.; Gupta, R.S. Ribonucleotide Reductases from Bifidobacteria Contain Multiple Conserved Indels Distinguishing Them from All Other Organisms: In Silico Analysis of the Possible Role of a 43 aa Bifidobacteria-Specific Insert in the Class III RNR Homolog. Front. Microbiol. 2017, 8, 1409. [Google Scholar] [CrossRef] [Green Version]
  46. Gupta, R.S.; Nanda, A.; Khadka, B. Novel Molecular, Structural and Evolutionary Characteristics of the Phosphoketolases from Bifidobacteria and Coriobacteriales. PLoS ONE 2017, 12, e0172176. [Google Scholar] [CrossRef]
  47. Khadka, B.; Gupta, R.S. Identification of a Conserved 8 aa Insert in the PIP5K Protein in the Saccharomycetaceae Family of Fungi and the Molecular Dynamics Simulations and Structural Analysis to Investigate Its Potential Functional Role. Proteins 2017, 85, 1454–1467. [Google Scholar] [CrossRef]
  48. Khadka, B.; Gupta, R.S. Novel Molecular Signatures in the PIP4K/PIP5K Family of Proteins Specific for Different Isozymes and Subfamilies Provide Important Insights into the Evolutionary Divergence of this Protein Family. Genes 2019, 10, 312. [Google Scholar] [CrossRef] [Green Version]
  49. Shen, M.Y.; Sali, A. Statistical Potential for Assessment and Prediction of Protein Structures. Protein Sci. 2006, 15, 2507–2524. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. Lee, G.R.; Heo, L.; Seok, C. Effective Protein Model Structure Refinement by Loop Modeling and Overall Relaxation. Proteins 2016, 84 (Suppl. 1), 293–301. [Google Scholar] [CrossRef]
  51. Lovell, S.C.; Davis, I.W.; Arendall, W.B., III; de Bakker, P.I.; Word, J.M.; Prisant, M.G.; Richardson, J.S.; Richardson, D.C. Structure Validation by Calpha Geometry: Phi, psi and Cbeta deviation. Proteins 2003, 50, 437–450. [Google Scholar] [CrossRef] [PubMed]
  52. Colovos, C.; Yeates, T.O. Verification of Protein Structures: Patterns of Nonbonded Atomic Interactions. Protein Sci. 1993, 2, 1511–1519. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Sippl, M.J. Recognition of Errors in Three-Dimensional Structures of Proteins. Proteins 1993, 17, 355–362. [Google Scholar] [CrossRef] [PubMed]
  54. Wiederstein, M.; Sippl, M.J. ProSA-Web: Interactive Web Service for the Recognition of Errors in Three-Dimensional Structures of Proteins. Nucleic Acids Res. 2007, 35, W407–W410. [Google Scholar] [CrossRef] [Green Version]
  55. Eisenberg, D.; Luthy, R.; Bowie, J.U. VERIFY3D: Assessment of Protein Models with Three-Dimensional Profiles. Methods Enzym. 1997, 277, 396–404. [Google Scholar]
  56. Van Der, S.D.; Lindahl, E.; Hess, B.; Groenhof, G.; Mark, A.E.; Berendsen, H.J. GROMACS: Fast, Flexible, and Free. J. Comput. Chem. 2005, 26, 1701–1718. [Google Scholar] [CrossRef]
  57. Abrahama, M.J.; Schulzb, R.; Pálla, S.; Smith, J.C.; Hessa, B.; Lindahla, E. GROMACS: High Performance Molecular Simulations through Multi-Level Parallelism from Laptops to Supercomputers. SoftwareX 2016, 1–2, 19–25. [Google Scholar] [CrossRef] [Green Version]
  58. Jorgensen, W.L.; Chandrasekhar, J.; Madura, J.D.; Impey, R.W.; Klein, M.L. Comparison of Simple Potential Functions for Simulating Liquid Water. J. Chem. Phys. 1983, 79, 926–935. [Google Scholar] [CrossRef]
  59. Bussi, G.; Donadio, D.; Parrinello, M. Canonical Sampling through Velocity Rescaling. J. Chem. Phys. 2007, 126, 014101. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  60. Nosé, S.; Klein, M.L. Constant Pressure Molecular Dynamics for Molecular Systems. Mol. Phys. 1983, 50, 1055–1076. [Google Scholar] [CrossRef]
  61. Parrinello, M.; Rahman, A. Polymorphic Transitions in Single crYstals: A New Molecular Dynamics Method. J. Appl. Phys. 1981, 52, 7182–7190. [Google Scholar] [CrossRef]
  62. Kutzner, C.; Pall, S.; Fechner, M.; Esztermann, A.; de Groot, B.L.; Grubmuller, H. Best Bang for Your Buck: GPU Nodes for GROMACS Biomolecular Simulations. J. Comput. Chem. 2015, 36, 1990–2008. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  63. Humphrey, W.; Dalke, A.; Schulten, K. VMD: Visual Molecular Dynamics. J. Mol. Graph. 1996, 14, 33–38. [Google Scholar] [CrossRef]
  64. Nesbo, C.L.; Bradnan, D.M.; Adebusuyi, A.; Dlutek, M.; Petrus, A.K.; Foght, J.; Doolittle, W.F.; Noll, K.M. Mesotoga Prima gen. nov., sp. nov., the First Described Mesophilic Species of the Thermotogales. Extremophiles 2012, 16, 387–393. [Google Scholar] [CrossRef]
  65. L’Haridon, S.; Reysenbach, A.L.; Tindall, B.J.; Schonheit, P.; Banta, A.; Johnsen, U.; Schumann, P.; Gambacorta, A.; Stackebrandt, E.; Jeanthon, C. Desulfurobacterium Atlanticum sp. nov., Desulfurobacterium Pacificum sp. nov. and Thermovibrio Guaymasensis sp. nov., Three Thermophilic Members of the Desulfurobacteriaceae fam. nov., a deep Branching Lineage within the Bacteria. Int. J. Syst. Evol. Microbiol. 2006, 56, 2843–2852. [Google Scholar] [CrossRef] [Green Version]
  66. Hassan, F.M.N.; Gupta, R.S. Novel Sequence Features of DNA Repair Genes/Proteins from Deinococcus Species Implicated in Protection from Oxidatively Generated Damage. Genes 2018, 9, 149. [Google Scholar] [CrossRef] [Green Version]
  67. Bonjour, F.; Graber, A.; Aragno, M. Isolation of Bacillus schlegelii, a Thermophilic, Hydrogen Oxidizing, Aerobic Autotroph, from Geothermal and Nongeothermal Environments. Microb. Ecol. 1988, 16, 331–337. [Google Scholar] [CrossRef]
  68. Kampfer, P.; Glaeser, S.P.; Busse, H.J. Transfer of Bacillus Schlegelii to a Novel Genus and Proposal of Hydrogenibacillus Schlegelii gen. nov., comb. nov. Int. J. Syst. Evol. Microbiol. 2013, 63, 1723–1727. [Google Scholar] [CrossRef] [Green Version]
  69. Maker, A.; Hemp, J.; Pace, L.A.; Ward, L.M.; Fischer, W.W. Draft Genome Sequence of Hydrogenibacillus schlegelii MA48, a Deep-Branching Member of the Bacilli Class of Firmicutes. Genome Announc. 2017, 5, e00380-16. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  70. Zhaxybayeva, O.; Swithers, K.S.; Lapierre, P.; Fournier, G.P.; Bickhart, D.M.; Deboy, R.T.; Nelson, K.E.; Nesbo, C.L.; Doolittle, W.F.; Gogarten, J.P.; et al. On the Chimeric Nature, Thermophilic Origin, and Phylogenetic Placement of the Thermotogales. Proc. Natl. Acad. Sci. USA 2009, 106, 5865–5870. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. Hunt, J.F.; Weinkauf, S.; Henry, L.; Fak, J.J.; McNicholas, P.; Oliver, D.B.; Deisenhofer, J. Nucleotide Control of Interdomain Interactions in the Conformational Reaction Cycle of SecA. Science 2002, 297, 2018–2026. [Google Scholar] [CrossRef] [PubMed]
  72. Papanikolau, Y.; Papadovasilaki, M.; Ravelli, R.B.; McCarthy, A.A.; Cusack, S.; Economou, A.; Petratos, K. Structure of Dimeric SecA, the Escherichia Coli Preprotein Translocase Motor. J. Mol. Biol. 2007, 366, 1545–1557. [Google Scholar] [CrossRef]
  73. Sharma, V.; Arockiasamy, A.; Ronning, D.R.; Savva, C.G.; Holzenburg, A.; Braunstein, M.; Jacobs, W.R., Jr.; Sacchettini, J.C. Crystal Structure of Mycobacterium Tuberculosis SecA, a Preprotein Translocating ATPase. Proc. Natl. Acad. Sci. USA 2003, 100, 2243–2248. [Google Scholar]
  74. Vrontou, E.; Economou, A. Structure and Function of SecA, the Preprotein Translocase Nanomotor. Biochim. Biophys Acta 2004, 1694, 67–80. [Google Scholar]
  75. Zimmer, J.; Nam, Y.; Rapoport, T.A. Structure of a Complex of the ATPase SecA and the Protein-Translocation Channel. Nature 2008, 455, 936–943. [Google Scholar] [CrossRef]
  76. Barillari, C.; Taylor, J.; Viner, R.; Essex, J.W. Classification of Water Molecules in Protein Binding Sites. J. Am. Chem. Soc. 2007, 129, 2577–2587. [Google Scholar] [CrossRef]
  77. Lu, Y.; Wang, R.; Yang, C.Y.; Wang, S. Analysis of Ligand-Bound Water Molecules in High-Resolution Crystal Structures of Protein-Ligand Complexes. J. Chem. Inf. Model. 2007, 47, 668–675. [Google Scholar] [CrossRef]
  78. Homans, S.W. Water, Water Everywhere--Except Where it Matters? Drug Discov. Today 2007, 12, 534–539. [Google Scholar] [CrossRef]
  79. Singh, N.; Briggs, J.M. Molecular Dynamics Simulations of Factor Xa: Insight into Conformational Transition of Its Binding Subsites. Biopolymers 2008, 89, 1104–1113. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  80. Schiebel, J.; Gaspari, R.; Wulsdorf, T.; Ngo, K.; Sohn, C.; Schrader, T.E.; Cavalli, A.; Ostermann, A.; Heine, A.; Klebe, G. Intriguing Role of Water in Protein-Ligand Binding Studied by Neutron Crystallography on Trypsin Complexes. Nat. Commun. 2018, 9, 3559. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  81. Vieille, C.; Burdette, D.S.; Zeikus, J.G. Thermozymes. Biotechnol. Annu. Rev. 1996, 2, 1–83. [Google Scholar] [PubMed]
  82. Kumar, S.; Tsai, C.J.; Nussinov, R. Factors Enhancing Protein Thermostability. Protein Eng. 2000, 13, 179–191. [Google Scholar] [CrossRef] [Green Version]
  83. Berezovsky, I.N.; Shakhnovich, E.I. Physics and Evolution of Thermophilic Adaptation. Proc. Natl. Acad. Sci. USA 2005, 102, 12742–12747. [Google Scholar] [CrossRef] [Green Version]
  84. Mizuguchi, K.; Sele, M.; Cubellis, M.V. Environment Specific Substitution Tables for Thermophilic Proteins. BMC Bioinform. 2007, 8 (Suppl. 1), S15. [Google Scholar] [CrossRef] [Green Version]
  85. Taylor, T.J.; Vaisman, I.I. Discrimination of Thermophilic and Mesophilic Proteins. BMC Struct. Biol. 2010, 10 (Suppl. 1), S5. [Google Scholar] [CrossRef] [Green Version]
  86. Singh, B.; Gupta, R.S. Conserved Inserts in the Hsp60 (GroEL) and Hsp70 (DnaK) Proteins are Essential for Cellular Growth. Mol. Genet. Genom. 2009, 281, 361–373. [Google Scholar] [CrossRef]
  87. Huber, R.; Eder, W. Aquificales. In the Prokaryotes; Dworkin, M., Falkow, S., Rosenberg, E., Schleifer, E., Stackebrandt, E., Eds.; Springer: New York, NY, USA, 2006. [Google Scholar]
  88. Bonch-Osmolovskaya, E. Aquificales. In Encylopedia of Life Sciences; Wieley: Hoboken, NJ, USA, 2008; pp. 1–7. [Google Scholar]
  89. Knight, J.D.; Hamelberg, D.; McCammon, J.A.; Kothary, R. The Role of Conserved Water Molecules in the Catalytic Domain of Protein Kinases. Proteins 2009, 76, 527–535. [Google Scholar] [CrossRef] [Green Version]
  90. Nygaard, R.; Valentin-Hansen, L.; Mokrosinski, J.; Frimurer, T.M.; Schwartz, T.W. Conserved Water-Mediated Hydrogen Bond Network between TM-I, -II, -VI, and -VII in 7TM Receptor Activation. J. Biol. Chem. 2010, 285, 19625–19636. [Google Scholar] [CrossRef] [Green Version]
  91. Kaur, M.; Bahia, M.S.; Silakari, O. Exploring the Role of Water Molecules for Docking and Receptor Guided 3D-QSAR Analysis of Naphthyridine Derivatives as Spleen Tyrosine Kinase (Syk) Inhibitors. J. Chem. Inf. Model. 2012, 52, 2619–2630. [Google Scholar] [CrossRef] [PubMed]
  92. Jeszenoi, N.; Balint, M.; Horvath, I.; Van Der, S.D.; Hetenyi, C. Exploration of Interfacial Hydration Networks of Target-Ligand Complexes. J. Chem. Inf. Model. 2016, 56, 148–158. [Google Scholar] [CrossRef] [PubMed]
  93. Lett, C.M.; Berghuis, A.M.; Frey, H.E.; Lepock, J.R.; Guillemette, J.G. The Role of a Conserved Water Molecule in the Redox-Dependent Thermal Stability of Iso-1-Cytochrome c. J. Biol. Chem. 1996, 271, 29088–29093. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  94. Shi, R.; Lin, S.X. Cofactor Hydrogen Bonding onto the Protein Main Chain is Conserved in the Short Chain Dehydrogenase/Reductase Family and Contributes to Nicotinamide Orientation. J. Biol. Chem. 2004, 279, 16778–16785. [Google Scholar]
  95. Sterpone, F.; Stirnemann, G.; Hynes, J.T.; Laage, D. Water Hydrogen-Bond Dynamics around Amino Acids: The Key Role of Hydrophilic Hydrogen-Bond Acceptor Groups. J. Phys. Chem. B 2010, 114, 2083–2089. [Google Scholar] [CrossRef]
  96. Huggins, D.J.; Tidor, B. Systematic Placement of Structural Water Molecules for Improved Scoring of Protein-Ligand Interactions. Protein Eng. Des. Sel. 2011, 24, 777–789. [Google Scholar] [CrossRef]
  97. Milenkovic, S.; Bondar, A.N. Mechanism of Conformational Coupling in SecA: Key Role of Hydrogen-Bonding Networks and Water Interactions. Biochim. Biophys. Acta 2016, 1858, 374–385. [Google Scholar] [CrossRef]
  98. Geszvain, K.; Gruber, T.M.; Mooney, R.A.; Gross, C.A.; Landick, R. A Hydrophobic Patch on the Flap-Tip Helix of E.coli RNA Polymerase Mediates Sigma(70) Region 4 Function. J. Mol. Biol. 2004, 343, 569–587. [Google Scholar] [CrossRef]
  99. Akiva, E.; Itzhaki, Z.; Margalit, H. Built-In Loops Allow Versatility in Domain-Domain Interactions: Lessons from Self-Interacting Domains. Proc. Natl. Acad. Sci. USA 2008, 105, 13292–13297. [Google Scholar]
  100. Hashimoto, K.; Panchenko, A.R. Mechanisms of protein oligomerization, the critical role of insertions and deletions in maintaining different oligomeric states. Proc. Natl. Acad. Sci. USA 2010, 107, 20352–20357. [Google Scholar] [CrossRef] [Green Version]
  101. Schoeffler, A.J.; May, A.P.; Berger, J.M. A Domain Insertion in Escherichia Coli GyrB Adopts a Novel Fold that Plays a Critical Role in Gyrase Function. Nucleic Acids Res. 2010, 38, 7830–7844. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  102. Gouridis, G.; Karamanou, S.; Sardis, M.F.; Scharer, M.A.; Capitani, G.; Economou, A. Quaternary Dynamics of the SecA Motor Drive Translocase Catalysis. Mol. Cell 2013, 52, 655–666. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  103. Singh, R.; Kraft, C.; Jaiswal, R.; Sejwal, K.; Kasaragod, V.B.; Kuper, J.; Burger, J.; Mielke, T.; Luirink, J.; Bhushan, S. Cryo-Electron Microscopic Structure of SecA Protein Bound to the 70S Ribosome. J. Biol. Chem. 2014, 289, 7190–7199. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  104. Wowor, A.J.; Yan, Y.; Auclair, S.M.; Yu, D.; Zhang, J.; May, E.R.; Gross, M.L.; Kendall, D.A.; Cole, J.L. Analysis of SecA Dimerization in Solution. Biochemistry 2014, 53, 3248–3260. [Google Scholar] [CrossRef] [PubMed]
  105. Collinson, I.; Corey, R.A.; Allen, W.J. Channel Crossing: How are Proteins Shipped across the Bacterial Plasma Membrane? Philos. Trans. R. Soc. Lond. B Biol. Sci. 2015, 370. [Google Scholar] [CrossRef]
  106. Fekkes, P.; de Wit, J.G.; van der Wolk, J.P.; Kimsey, H.H.; Kumamoto, C.A.; Driessen, A.J. Preprotein Transfer to the Escherichia coli Translocase Requires the Co-Operative Binding of SecB and the Signal Sequence to SecA. Mol. Microbiol. 1998, 29, 1179–1190. [Google Scholar] [CrossRef] [Green Version]
  107. Pretz, M.G.; Remigy, H.; Swaving, J.; Albers, S.V.; Garrido, V.G.; Chami, M.; Engel, A.; Driessen, A.J. Functional and Structural Characterization of the Minimal Sec Translocase of the Hyperthermophile Thermotoga maritima. Extremophiles 2005, 9, 307–316. [Google Scholar] [CrossRef]
  108. Sala, A.; Calderon, V.; Bordes, P.; Genevaux, P. TAC from Mycobacterium Tuberculosis: A Paradigm for Stress-Responsive Toxin-Antitoxin Systems Controlled by SecB-like Chaperones. Cell Stress Chaperones 2013, 18, 129–135. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Excerpts from the sequence alignment of SecA protein showing a 50 aa conserved insert that is a distinctive characteristic of species from the orders Thermotogales and Aquificales but absent in the homologs from all other bacteria. The numbers in the parenthesis below the orders’ names indicate the number of species from these orders for which SecA homologs were identified and how many of them contained the indicated insert. The dashes (-) in the sequence alignment denote sequence identity with the amino acid shown on the top line. The accession numbers of the protein sequences are provided in the second column and numbers on top of the sequence alignment indicate the position of this sequence in T. maritima.
Figure 1. Excerpts from the sequence alignment of SecA protein showing a 50 aa conserved insert that is a distinctive characteristic of species from the orders Thermotogales and Aquificales but absent in the homologs from all other bacteria. The numbers in the parenthesis below the orders’ names indicate the number of species from these orders for which SecA homologs were identified and how many of them contained the indicated insert. The dashes (-) in the sequence alignment denote sequence identity with the amino acid shown on the top line. The accession numbers of the protein sequences are provided in the second column and numbers on top of the sequence alignment indicate the position of this sequence in T. maritima.
Microorganisms 08 00059 g001
Figure 2. Partial sequence alignment of SecA showing a 76 aa conserved insert that is a unique characteristic of species from the order Thermales and H. schlegelii but absent from the SecA homologs from all other bacteria. Except for H. schlegelii, no other Firmicutes species contained this insert. Other information concerning this sequence alignment is the same as in Figure 1.
Figure 2. Partial sequence alignment of SecA showing a 76 aa conserved insert that is a unique characteristic of species from the order Thermales and H. schlegelii but absent from the SecA homologs from all other bacteria. Except for H. schlegelii, no other Firmicutes species contained this insert. Other information concerning this sequence alignment is the same as in Figure 1.
Microorganisms 08 00059 g002
Figure 3. A maximum-likelihood phylogenetic tree based on SecA protein sequences from representative bacterial phyla. The groups of species containing the 50 aa conserved signature inserts (CSI) are marked by red arrows, whereas those containing the 76 aa CSI are denoted by blue arrows. The numbers on the nodes indicate bootstrap values for the observed groupings.
Figure 3. A maximum-likelihood phylogenetic tree based on SecA protein sequences from representative bacterial phyla. The groups of species containing the 50 aa conserved signature inserts (CSI) are marked by red arrows, whereas those containing the 76 aa CSI are denoted by blue arrows. The numbers on the nodes indicate bootstrap values for the observed groupings.
Microorganisms 08 00059 g003
Figure 4. Cartoon and transparent surface representation of the 3D structure of (A) Thermotoga maritima SecA (PDB ID: 4YS0) shown as green and the various structural domains present in this protein are colored and labeled; (B) homology model of T. maritima SecA lacking the 50 aa CSI. The bound adenosine diphosphate (ADP) in the structure is shown as a blue stick; (C) Structure of SecA dimer from Thermus thermophilus (TtSecA) (PDB ID: 2IPC). The 50 aa CSI and the 76 CSI are highlighted in red in these structures.
Figure 4. Cartoon and transparent surface representation of the 3D structure of (A) Thermotoga maritima SecA (PDB ID: 4YS0) shown as green and the various structural domains present in this protein are colored and labeled; (B) homology model of T. maritima SecA lacking the 50 aa CSI. The bound adenosine diphosphate (ADP) in the structure is shown as a blue stick; (C) Structure of SecA dimer from Thermus thermophilus (TtSecA) (PDB ID: 2IPC). The 50 aa CSI and the 76 CSI are highlighted in red in these structures.
Microorganisms 08 00059 g004
Figure 5. Snapshots from different time intervals extracted from the 100 ns molecular dynamic (MD) trajectories of TmSecA (+CSI) showing the coordinates of water molecules from the simulation (red and white spheres) that constantly occupy the location near the backbone of residue Glu185 from the 50 aa CSI in TmSecA at (A) 303.15 K, and (B) 363.15 K. Each of the extracted structures representing the different time scale is represented with different colors and the times are indicated in the snapshots. The magenta spheres represent the coordinates for crystallographic water molecules which are from the corresponding region in the crystal structure of TmSecA and they are superimposed to each of the extracted structures to show the occupancy of the simulation water molecules of that region. Any hydrogen bonds formed between simulation water molecules and adenine group of ADP or residues from the insert are shown as yellow dash lines. Other randomly picked snapshots at different time intervals that show the occupancy and interaction of water molecules with residue (Glu185) from 50 aa CSI and ADP in TmSecA are provided in Figure S4A,B. (C) Trajectories showing the time evolution of the number of hydrogen bond interactions formed between Glu185 residue backbone atom (O) from the 50 aa CSI and water molecules that are within 4 Å of Glu185 at 303.15 K (blue) and (D) 363.15 K (red), calculated over the 100 ns time period.
Figure 5. Snapshots from different time intervals extracted from the 100 ns molecular dynamic (MD) trajectories of TmSecA (+CSI) showing the coordinates of water molecules from the simulation (red and white spheres) that constantly occupy the location near the backbone of residue Glu185 from the 50 aa CSI in TmSecA at (A) 303.15 K, and (B) 363.15 K. Each of the extracted structures representing the different time scale is represented with different colors and the times are indicated in the snapshots. The magenta spheres represent the coordinates for crystallographic water molecules which are from the corresponding region in the crystal structure of TmSecA and they are superimposed to each of the extracted structures to show the occupancy of the simulation water molecules of that region. Any hydrogen bonds formed between simulation water molecules and adenine group of ADP or residues from the insert are shown as yellow dash lines. Other randomly picked snapshots at different time intervals that show the occupancy and interaction of water molecules with residue (Glu185) from 50 aa CSI and ADP in TmSecA are provided in Figure S4A,B. (C) Trajectories showing the time evolution of the number of hydrogen bond interactions formed between Glu185 residue backbone atom (O) from the 50 aa CSI and water molecules that are within 4 Å of Glu185 at 303.15 K (blue) and (D) 363.15 K (red), calculated over the 100 ns time period.
Microorganisms 08 00059 g005

Share and Cite

MDPI and ACS Style

Khadka, B.; Persaud, D.; Gupta, R.S. Novel Sequence Feature of SecA Translocase Protein Unique to the Thermophilic Bacteria: Bioinformatics Analyses to Investigate Their Potential Roles. Microorganisms 2020, 8, 59. https://doi.org/10.3390/microorganisms8010059

AMA Style

Khadka B, Persaud D, Gupta RS. Novel Sequence Feature of SecA Translocase Protein Unique to the Thermophilic Bacteria: Bioinformatics Analyses to Investigate Their Potential Roles. Microorganisms. 2020; 8(1):59. https://doi.org/10.3390/microorganisms8010059

Chicago/Turabian Style

Khadka, Bijendra, Dhillon Persaud, and Radhey S. Gupta. 2020. "Novel Sequence Feature of SecA Translocase Protein Unique to the Thermophilic Bacteria: Bioinformatics Analyses to Investigate Their Potential Roles" Microorganisms 8, no. 1: 59. https://doi.org/10.3390/microorganisms8010059

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop