The Parkinson Disease gene SNCA: Evolutionary and structural insights with pathological implication

Siddiqui, Irum Javaid; Pervaiz, Nashaiman; Abbasi, Amir Ali

doi:10.1038/srep24475

Download PDF

Article
Open access
Published: 15 April 2016

The Parkinson Disease gene SNCA: Evolutionary and structural insights with pathological implication

Irum Javaid Siddiqui¹,
Nashaiman Pervaiz¹ &
Amir Ali Abbasi¹

Scientific Reports volume 6, Article number: 24475 (2016) Cite this article

14k Accesses
92 Citations
29 Altmetric
Metrics details

Subjects

Abstract

After Alzheimer, Parkinson’s disease (PD) is the second most common neurodegenerative disorder. Alpha synuclein (SNCA) is deemed as a major component of Lewy bodies, a neuropathological feature of PD. Five point mutations in SNCA have been reported so far, responsible for autosomal dominant PD. This study aims to decipher evolutionary and structural insights of SNCA by revealing its sequence and structural evolutionary patterns among sarcopterygians and its paralogous counterparts (SNCB and SNCG). Rate analysis detected strong purifying selection on entire synuclein family. Structural dynamics divulges that during the course of sarcopterygian evolutionary history, the region encompassed 32 to 58 of N-terminal domain of SNCA has acquired its critical functional significance through the epistatic influence of the lineage specific substitutions. In sum, these findings provide an evidence that the region from 32 to 58 of N-terminal lipid binding alpha helix domain of SNCA is the most critical region, not only from the evolutionary perspective but also for the stability and the proper conformation of the protein as well as crucial for the disease pathogenesis, harboring critical interaction sites.

Alzheimer-related genes show accelerated evolution

Article Open access 13 March 2020

Anne Nitsche, Christian Arnold, … Thomas Arendt

A naturally occurring variant of SHLP2 is a protective factor in Parkinson’s disease

Article Open access 03 January 2024

Su-Jeong Kim, Brendan Miller, … Pinchas Cohen

A short motif in the N-terminal region of α-synuclein is critical for both aggregation and function

Article 09 March 2020

Ciaran P. A. Doherty, Sabine M. Ulamec, … David J. Brockwell

Introduction

PD is the second most common neurodegenerative disorder after Alzheimer which affects 1–2% of the population above age 65 and 4–5% above age 85¹. It is characterized by the loss of dopaminergic neurons from substantia nigra (a brain structure located in the mesencephalon that plays an important role in reward, addiction, and movement) and the presence of intracellular inclusions i.e. lewy bodies and lewy neuritis². Symptoms of PD include shaking, tremors, bradykinesia and difficulty with walking and gait. Other symptoms include sensory, sleep and emotional problems. SNCA is considered as the major causative gene involved in the early onset of familial Parkinson’s disease (FPD) characterized by five missense mutations identified so far i.e. A30P³, E46K⁴, H50Q⁵, G51D⁶ and A53T⁷. SNCA is also deemed to be involved in various other neurodegenerative disorders i.e. Alzheimer’s disease (AD), Lewy bodies’ disease (LBD) and Muscular System Atrophy (MSA)⁸.

SNCA is a 14.5 kDa, 140 a.a protein encoded by 5 exons with total transcript length of 3041 bps maps on 4q21.3-q22. The other members of synuclein family are SNCB and SNCG mapped to human chromosome 5q35 and 10q23.2-q23.3 respectively⁹. Architecture of SNCA protein reveals the presence of N-terminal region composed of incomplete KXKEGV motifs, extremely hydrophobic NAC domain and highly acidic C-terminal domain⁸. At physiological conditions, SNCA is believed to be intrinsically disordered monomer or helically folded tetramer¹⁰. Oligomeric structure of SNCA is considered as a toxic form but recent observation abolished this hypothesis^10,11. During the past two decades several hypotheses exist about toxic structural form of SNCA, but none of them are completely consensual. However, neurotoxic form of SNCA aggregates within neuron and spreads across the anatomically interconnected regions of PD brain through interneuronal transmission using various mechanisms¹². Although SNCA is expressed predominately in brain, it is also expressed in heart, skeletal muscle and pancreas^8,13. Molecular function of SNCA is quite ambiguous. Based on its structure, physical properties and interacting partners, several hypotheses for the normal function of SNCA have been proposed. It is considered to be involved in regulation of dopamine release and transport, induces fibrillization of microtubule associated protein tau, and exert neuroprotective phenotype in non-dopaminergic neurons by inhibiting both p53 expression and transactivation of proapoptotic genes leading to decreased caspase-3 activation^{8,13,14,15,16}. However, missense mutations (especially A53T) in SNCA abolished the neuroprotective effect of SNCA and promote apoptosis by reversing the expression of p53^14,15.

Due to the significant role of SNCA in FPD and other neurodegenerative disorders, this study was premeditated to decipher the molecular evolution of SNCA, which infers the phylogenetic history of synuclein family with the help of its putative orthologs and paralogs. Analysis revealed the sarcopterygian specific origin of SNCA which suggested its lineage specific functional role. On account of this interest, a comparative sequence and structural analysis was performed to estimate the selection and functional constraints on SNCA. Evolutionary rate difference was coupled with structural information to infer potential functional changes and the impact of lineage specific substitutions. In addition variations in domain topologies were explored by comparative analysis of known functional domains of SNCA protein. In light of the findings, it was hypothesized that the region from 32 to 58 of N-terminal lipid binding domain is the most “critical region” of SNCA from evolutionary, functional and disease pathogenesis perspective.

Materials and Methods

Sequence collection

Putative paralogs of human SNCA were determined by Ensembl Genome Browser¹⁷ using Ensembl paralogy prediction. The closest putative orthologs were obtained by BLASTp¹⁸ bidirectional searches against protein database available at Ensembl and National Centre for Biotechnology Information (NCBI). Confirmation about ancestral-descendents relationship among putative orthologs was done through clustering of homologous proteins within phylogenetic trees. Sequences whose position within a tree was sharply in conflict with the uncontested animal phylogeny were excluded. The list of all used sequences (protein sequence data) is given as Supplementary Data File 1.

Species that were selected includes Homo sapiens (Human), Pan troglodytes (Chimpanzee), Mus musculus (Mouse), Rattus norvegicus (Rat), Gallus gallus (Chicken), Canis familiaris (Dog), Equus caballus (Horse), Loxodonta Africana (Elephant), Dasypus novemcinctus (Armadillo), Anolis carolinensis (Lizard), Pelodiscus sinensis (Chinese softshell turtle), Xenopustropicalis (Frog), Latimeria chalumnae (Coelacanth), Danio rerio (Zebrafish), Takifugu rubripes (Fugu), Tetraodon nigroviridis (Tetraodon), Gasterosteus aculeatus (Stickleback), Oryzias latipes (Medaka).

Sequence analysis

The phylogenetic tree of SNCA family was reconstructed by using the neighbor-joining (NJ) method^19,20, complete deletion option was used to exclude any site which postulated a gap in the sequences. Poisson corrected (PC) amino acid distance and uncorrected proportion (p) of amino acid difference were used as amino acid substitution models. Because both methods produced similar results, only results from NJ tree based on uncorrected p-distance are presented here. Reliability of the resulting tree topology was tested by the bootstrap method²¹ (at1000 pseudo replicates) which generated the bootstrap probability for each interior branch in the tree. Maximum Likelihood (ML) tree was also constructed by using the Whelan and Goldman²² (WAG) model of amino acid replacement (see Supplementary Fig. S1).

Ancestral SNCA sequences were inferred by using ML method and WAG model of amino acid substitution. To investigate selection constraint within hominoids (human, chimpanzee, gorilla, orangutan), non-hominoids (macaque, marmoset, squirrel monkey, bushbaby), non-primate placental mammals (mouse, dog, cow, elephant) and non-mammalian tetrapods (chicken, turtle, frog, coelacanth), z-test was implemented with MEGA²³. dN-dS rate analysis was also conducted for each of the above mentioned group with the help of Goldman And Yang²⁴ (GY-94) method in Hyphy which estimates synonymous and non-synonymous substitution rates through codon based model²⁴. Evolutionary rate differences of putative paralogs (SNCB & SNCG) were also assessed among sarcopterygians.

Domains, motifs and sub-motifs have been assigned to human SNCA as described previously^25,26. ClustalW2 based multiple sequence alignments were used to map the putative positioning of these domains and motifs to paralogs of SNCA protein in human and its orthologs in various sarcopterygian species²⁷. Location and the positioning of the identified domains and motifs were also plotted. Substitutions that have occurred within sarcopterygians during evolution were assigned to human SNCA with the help of ancestor reconstruction technique. Previously reported human specific mutations involved in familial Parkinson’s disease were also mapped. Negatively constrained residues of SNCA among sarcopterygians were estimated with Hyphy by implementing Single Likelihood Ancestor Counting (SLAC) method which uses global codon model and maximum likelihood to reconstruct the evolutionary history²⁴. Impact of the substitutions that have occurred during evolution within sarcopterygians with K_a/K_s <1 were also classified on the basis of their physicochemical properties i.e. charge, volume, polarity into neutral or radical^28,29. Human paralogs of SNCA (SNCB & SNCG) were likewise compared and paralogs specific substitutions were identified.

Structural analysis

For the structural analysis with evolutionary and functional perspective, NMR structure of human SNCA (1XQ8) was attained from Protein Data Bank (PDB)³⁰ and used as a reference for comparative analysis of the structural deviations. After inferring ancestral sequences, sarcopterygian ancestral, mammalian and non-primate placental mammal’s specific SNCA proteins were modeled with the help of Modeller³¹. Structures of the putative paralogs (SNCB & SNCG) of human SNCA were also modelled by comparative homology modelling. Best structures were scrutinized on the basis of Discrete Optimized Protein Energy (DOPE) score, and then further energy minimization protocol was implemented with chimera³² in order to improve the quality of the modeled structures. Quality of the modeled structures were also investigated by Ramachandran³³ and Errat³⁴ plots (see Supplementary Fig. S2A,B, S5A,B). Superimposition of the modeled structures with 1XQ8 was carried out with chimera³² and root mean square deviation (RMSD) values were calculated. Quantification of the structural deviations observed in chimera³² were further reconnoitered by SPINEX³⁵ in terms of quantification of the deviations identified in the backbone torsion angles (phi Φ°, psi Ψ°) of the modeled ancestral proteins. Impact of the lineage specific substitutions on the modeled ancestral SNCA proteins were investigated with the help of MuPro³⁶ in terms of sequence and structural stability.In order to inspect the structural deviations in the human specific mutations i.e. A30P, E46K, H50Q, G51D, A53T involved in FPD, mutant models were also generated by Modeller³¹, analyzed and minimized by chimera³² and estimated with Ramachandran³³ and Errat³⁴ (see Supplementary Figs S3 and S4A,B).

Interaction study

To conduct the interaction study, Cluspro protein-protein docking server³⁷ was utilized. NMR structure of coiled-coil domain (2KES) of the interacting partner, synphilin-1 (SNCAIP) was obtained from PDB³⁸. Domains and motifs reported in literature were assigned to human SNCAIP. Interactions between human specific (1XQ8), sarcopterygian ancestral, mammalian, non-primate placental mammal’s specific and human specific mutant models of SNCA proteins were examined with the help of Ligplot³⁹ and PyMol⁴⁰.

Results

Phylogenetic analysis

Evolutionary relationship between SNCA and its putative paralogs was estimated by NJ and ML methods (Fig. 1, see Supplementary Fig. S1). Phylogenetic analysis of synuclein family revealed that two duplication events have contributed in diversification of this family. The first duplication event has transpired at the root of vertebrates, prior to tetrapod-teleost split, deduced in SNCG paralog and ancestor of SNCA/SNCB. Whereas the second duplication event has occurred at the root of sarcopterygian’s lineage, after speciation of the teleost fish (SNCA/B) resulted in SNCA and SNCB paralogs. It is also worth noting that the branch lengths for SNCG proteins are longer than those of other two paralogs, suggesting that this paralog may have rapidly evolved in comparison with SNCA and SNCB. Blast based bidirectional similarity searches fail to identify any ortholog of this family among invertebrates which reinforces the assumption of vertebrate specific origin of this family (Fig. 1, see Supplementary Fig. S1).

**Figure 1: Neighbor Joining tree of the synuclein family members.**

Comparing evolutionary rate of SNCA gene among sarcopterygians

In order to estimate the evolutionary rate differences of SNCA gene among various clades of sarcopterygians, the orthologs of SNCA from representative members of hominoids (human, chimpanzee, gorilla, orangutan), non-hominoids (macaque, marmoset, squirrel monkey, bushbaby), non-primate placental mammals (mouse, dog, cow, elephant) and non-mammalian tetrapods (chicken, turtle, frog, coelacanth) were obtained. Non synonymous (K_a/dN) and synonymous substitution rates (K_s/dS) were estimated for each group. And then z-test was applied to check selection constraint on the above mentioned groups.

The K_a-K_s (dN-dS) difference for hominoids was found as −2.241 (P = 0.014), non-hominoids was −4.716 (P = 0), non-primate placental mammals was −6.1777 (P = 0) and non-mammalian tetrapods was −7.085 (P = 0) (see Supplementary Table S1). In general, K_a value lower than K_s (K_a < K_s) suggests negative selection, i.e. non silent substitutions have been purged by natural selection, whereas the inverse scenario (K_a > K_s) implies positive selection i.e. advantageous mutations have accumulated during the course of evolution. However the evidence for positive or negative selection requires the value to be significantly different from each other^41,42. Results deduced by z-test (Ka-Ks < 0, p < 0.05) suggests that SNCA has deviated from neutrality during evolution showing the signature of negative selection constraint within sarcopterygian lineage (see Supplementary Table S1).

The evolutionary rate analysis were also performed for putative paralogous copies of SNCA, i.e. SNCB and SNCG. The K_a-K_s (dN-dS) difference for SNCB was identified as −1.661(P = 0.05) for hominoids, −4.708(P = 0) for non-hominoids primates, −5.212(P = 0) for non-primate placental mammals and −2.992(P = 0.002) for non-mammalian tetrapods (see Supplementary Table S2). The K_a-K_s (dN-dS) difference for SNCG was identified as −1.658(P = 0.05) for hominoids, −4.064(P = 0) for non-hominoid primates, −5.485(P = 0) for non-primate placental mammals, −6.306(P = 0) non-mammalian tetrapods and −6.341(P = 0) for fishes (fugu, tetraodon, stickleback, medaka) (see Supplementary Table S3). This data specifies that not only SNCA but other two members of synuclein family have also been retained under strong purifying selection pressure among the analyzed sarcopterygians.

Domain organization of SNCA

In order to gain an insight into comparative domain organization, complete domain annotation of SNCA gene was carried out entailing the orthologs representative from sarcopterygians (human, mouse, dog, chicken, coelacanth) as well as the paralogous copies in human (SNCB and SNCG). This annotation revealed the distinctive architecture of SNCA gene which is comprised of N-terminal A₂ lipid binding alpha helix domain (1–60), Non-amyloid β component (NAC) domain (61–95) and C-terminal acidic domain (96–140) (Fig. 2a).

N-terminal lipid binding domain consists of 5 KXKEGV imperfect repeats²⁶, and these repeats are identified as highly conserved among analyzed orthologs and paralogs of human SNCA in terms of number and position (Fig. 2a). This region is predicted to form amphipathic α-helices, and considered to be involved in interacting with phospholipids²⁶.

NAC domain forms the amyloidogenic core of SNCA²⁶. NAC comprised of GAV motif with VGGAVVTGV(66–74) consensus sequence and three GXXX sub-motifs (where X is any of Gly, Ala,Val, Ile, Leu, Phe, Tyr, Trp, Thr, Ser or Met)²⁶. Among the analyzed orthologs, NAC was identified as highly conserved. Whereas, among its counter paralogous copies, in SNCB the length of NAC (61–84) was reduced due to the absence of one GXXX motif and KXKEGV repeat. While no GXXX sub-motif was identified in SNCG. Among the three putative paralogs, the GAV motif was explicitly present in SNCA only (Fig. 2a). NAC is considered as extremely necessary for SNCA aggregation and fibrillation²⁶. Whereas GAV is surmised as signature motif responsible for this aggregation process²⁵.

C-terminal acidic domain harbors copper binding motif containing DPDNEA(119–124) consensus sequence⁴³ which was found highly conserved among the analyzed orthologs of SNCA. Multiple sequence alignment failed to identify the conservation of this motif among SNCB and SNCG paralogs (Fig. 2a). This domain of SNCA is enriched in acidic residues and prolines. Three highly conserved tyrosine residues, which are considered as a subfamily signature of SNCA and SNCB, are also located in this region²⁶. It is also proposed that copper binding accelerates the aggregation of SNCA and influences its pathological effects⁴⁴.

Lineage specific substitutions that have occurred during evolution among analyzed sarcopterygians have been mapped on human SNCA with the help of ancestor reconstruction technique. It classified that nine substitutions have occurred at the root of mammalian ancestor. While two substitutions occurred at the root of non-primate placental mammal’s lineage and one occurred specifically to the catarrhini’s (hominoids and old world monkeys) lineage (Fig. 2a) (Table 1). Out of these 12 substitutions identified, five (S64T, G68E, N87S, L94F, V95G) were found to be confined towards NAC, while six (A101G, F107A, M112I, M113L, P129S, E132G) were residing in the C-terminal acidic domain. Only 1 substitution (T53A) was identified in the N-terminal region (Fig. 2a). Physicochemical properties of these amino acid substitutions were then analyzed which illustrated that approximately all the substitutions that have occurred during evolution were of radical type except T53A and A101G (Table 1). This analysis highlights N-terminal lipid binding domain as highly conserved among the analyzed orthologs and paralogs. This finding was further strengthened by the physical positioning of previously reported five human specific mutations associated with familial Parkinson’s disease i.e. A30P³, E46K⁴, H50Q⁵, G51D⁶, and A53T⁷ on human SNCA. Results revealed the confinement of these mutations explicitly towards the N-terminal domain which in turn implies the significance of high conservation of this region not only with functional perspective but also for pathogenesis of FPD (Fig. 2a). With the help of SLAC-window analysis, it appears that N-terminal domain comprised of 15 negatively constrained sites which further advocates that strong selective constraints are operating their role in preserving this region during sarcopterygians evolution (Fig. 2b, see Supplementary Table S4).

Table 1 After the divergence from sarcopterygian ancestor, nine fixed amino-acid changes occurred at the root of mammals, whereas two occurred specifically in non-primate placental mammal’s lineage while one occurred specific in primate’s (catarrhini) lineage.

Full size table

Structural evolution of SNCA

To further inspect how purifying selection is performing its role in defining the spatial constraints on ancestral SNCA proteins at structural level, a comparative structural study was conducted. NMR structure of human SNCA (1XQ8) was taken as a reference and compared with modeled ancestral proteins (Fig. 3). Structural deviations were examined with the help of RMSD values (Fig. 3, see Supplementary Fig. S2A,B). Results revealed very remarkable aspects that were not anticipated by comparative analysis at sequence level. Comparative structural analysis suggests that structure of SNCA has passed through series of transitions to acquire its favored conformation. Superimposed models of ancestral SNCA proteins and 1XQ8 revealed common deviated region encompassed 32 to 58 of N-terminal lipid binding domain of SNCA (Table 2). These structural deviations were also measured with the help of backbone torsions quantification which highlighted the fact that the region 32 to 58 of SNCA is continuously evolving at structural level, despite of its high sequence conservation (Table 2). It was also identified that destabilizing substitutions have been incorporated during SNCA evolution with the aim to achieve its intrinsic disordered conformation which implies the functional constraints behind it (Table 1). So it seems logical to speculate from this structural comparison approach that during SNCA evolution those substitutions have been incorporated which not only caused destabilization of SNCA but also brought drastic structural shift in the identified region. This critical region (32–58) was also recognized crucial for the proper conformation of not only N-terminal A₂ alpha helix domain but for NAC domain as well. With the help of electron and X-ray diffraction techniques it has been reported that normal SNCA assembles through its N-terminal region which again highlights the significant role of this domain⁴⁵.

Table 2 Analysis of the lineage specific structural deviations in the backbone torsion angles of the ancestral SNCA proteins by incorporating the clade specific substitutions.

Full size table

Intriguingly, all human specific mutations involved in FPD pathogenesis reside in this crucial region which signifies that any change in this region will be deleterious because of the strong selection and functional constraints imposed on it. Superimposed mutant models with 1XQ8 identified major shifts toward lipid binding domain in A30P and H50Q, whereas major change was observed in lipid binding and NAC domains in case of E46K and A53T. Only G51D showed altered NAC region only (see Supplementary Fig.S4A,B). All five mutant models were having highly deviated region from 32 to 58 in common. It can be postulated from this comparative structural analysis that the primary effects and the role of these five SNCA mutations in FPD pathogenesis can be different because of their differential structural morphologies.

Further to investigate the structural differences among the human paralogs of SNCA, a comparative structural analysis was also conducted. As the NMR structures of human SNCB and SNCG have not been reported so far, their structures were modelled by taking NMR structure of human SNCA (1XQ8) as reference and the structural deviations were assessed (see Supplementary Fig.S5A,B). It appears that SNCB and SNCG structures are highly deviated from SNCA at N-terminal and NAC domain (Fig. 4).

**Figure 4: Structural divergence among human paralogs of SNCA.**

Analysis of the interactions between SNCA and coiled-coil domain of SNCAIP

In order to investigate further the importance of this critical region, its functional significance was then deciphered with the help of interaction study. For this purpose, synphilin-1 (SNCAIP) was considered, as domain annotation of synphilin-1 revealed that it is 919 a.a (3745 bp) protein encoded by 10 exons¹⁷, encompassed six ankyrin like repeats and one central coiled-coil domain (510–557) (Fig. 5a). It has been corroborated from biochemical and NMR techniques that SNCAIP interacts with N-terminal region of SNCA³⁸. Although the normal cellular function of these interacting partners is still unknown but it has been reported that SNCAIP is developmentally localized to synaptic terminals and its association with synaptic vesicles is modulated by SNCA. In this context, SNCAIP is regarded as synaptic partner of SNCA, implying that this interaction mediates the synaptic functions of SNCA, possibly by anchoring SNCA to the vesicle membrane⁴⁶.

In order to explore the role of critical region in interaction, docking analysis was conducted. Interactions have been identified between sarcopterygian ancestral, mammalian specific and non- primate placental mammal’s specific SNCA proteins, which revealed that interaction between SNCA and coiled-coil domain of SNCAIP has evolved with the passage of time i.e. lineage specific interactions have emerged during sarcopterygian evolutionary history (Fig. 5b). Interaction analysis between human specific SNCA and coiled-coil domain of SNCAIP revealed that at the root of catarrhines, few lineage specific interactions have evolved i.e. Lys32, Tyr39, and Lys45 (see Supplementary Fig. S6, see Supplementary Table S5). Interestingly these human (catarrhines) specific interactions reside in the identified critical region which strengthens our hypothesis of the structural and functional significance of this region and its vital role in FPD pathogenesis (Fig. 5c).

Interaction analysis between mutant models of human specific SNCA with SNCAIP revealed altered interaction patterns while some of the wild type interactions were retained too which signifies that SNCA and coiled-coil domain of SNCAIP not only interact in normal individuals but also in the FPD patients, however the pattern of interactions was found altered. Docked complexes of A30P-SNCAIP and E46K-SNCAIP revealed that the interactions shifted entirely to the NAC domain whereas H50Q-SNCAIP and G51D-SNCAIP complexes showed altered interactions involving N-terminal and NAC domains. Only interactions in A53T-SNCAIP were confined entirely to N-terminal domain but the pattern was found altered. It can be assumed that the interactions altered due to differential interaction pattern of SNCA and SNCAIP thus affecting their binding affinities which in turn influence SNCA aggregation (Fig. 5d, see Supplementary Fig.S7, see Supplementary Table S6).

Discussion

Advent of high throughput annotation of genes has enabled bioinformatics analysis of genes of interest to provide important insight into their evolutionary link with particular phenotypic trait and association with human disease⁴¹. SNCA is deemed as one of the major causative genes in different neurodegenerative disorders. Five point mutations in SNCA have been reported so far, responsible for autosomal dominant Parkinson’s disease. This novel study primarily highlights the sequence and structural mechanisms of SNCA protein evolution within sarcopterygian lineage, in particular for the first time, with the precise role of the lineage specific substitutions.

The ML and NJ gene phylogenies of synuclein family revealed that SNCA is a sarcopterygian specific gene and thus pinpoints its functional specificity to this class. The three putative paralogs of this family form the (SNCA,SNCB)(SNCG) topology indicating that SNCA and SNCB are closely related duplicate genes. Both of these paralogs share 63% identity which is also depicted in their cellular localization and function¹⁷ (Fig. 1, see Supplementary Fig. S1). For instance, SNCA and SNCB are expressed predominately in brain, particularly enriched at presynaptic terminals performing membrane associated processes⁹. SNCG share 53% sequence identity with SNCA¹⁷. The divergent phylogenetic positioning of SNCG might account for differences in the functional aspects of this protein and its putative paralogous counter parts in vertebrates⁹. Blast searches complemented by phylogenetic data confirm the absence of ortholog of this family in invertebrates.

Branching pattern of the phylogenetic tree of synuclein family revealed that the two putative paralogs, SNCA and SNCB are evolving relatively at a slower rate as compared to SNCG. This finding led us to examine the molecular evolution of SNCA and its putative paralogous genes (SNCB and SNCG) specifically in sarcopterygian lineage. For this purpose the average K_a (dN) and K_s (dS) values were calculated within different phylogenetic groups of sarcopterygians. Estimation of statistical significance of difference between average K_a and K_s (z-test) within each phylogenetic groups of sarcopterygians have shown signature of strong negative selection on SNCA and its putative paralogous copies (SNCB & SNCG) (see Supplementary Table S1, S2, S3) which corresponds to the structural and functional constrains on synuclein family. Particularly, the selective constraints on SNCA (being a major causative player in FPD) during sarcopterygians evolution is also depicted in highly conserved domain organization among its orthologous and paralogous proteins. Surveying domain topologies revealed highly preserved domain features: N-terminal A₂ alpha helix lipid binding domain, hydrophobic NAC domain and an acidic C-terminal domain. Explicitly, the N-terminus was identified as highly preserved among the analyzed orthologs and paralogs. This finding has directed this study to inspect the role of this region in defining the spatial constraints during the structural evolution of SNCA protein (Figs 2a,b).

Comparative structural analysis of human SNCA with its ancestral and mutant proteins and also with its paralogous proteins in human (SNCB and SNCG) revealed significant structural deviations encompassing region 32 to 58 of N-terminal lipid binding domain of SNCA, despite of its high sequence conservation (Figs 3 and 4, Table 2, see Supplementary Figs S2A,B, S4A,B and S5A,B). The destabilization of SNCA observed through structural evolution can be best explained by speculating that such intrinsic disordered proteins undergo transitions to more ordered states upon binding to their targets (Table 1). So it can be assumed that lineage specific substitutions (associated with putative conformational remodeling and functional diversification) occurred at the expense of protein stability. These substitutions which were confined to the C-terminal have actually regulated the structural dynamics of the identified N-terminal region (32–58) which is termed as “critical region” of SNCA by the phenomena called as “epistasis”. The interaction study conducted between SNCA and coiled-coil domain of SNCAIP revealed that the critical region harbors crucial interaction sites which enlightened the vital role of this region in normal cellular as well as disease processes (Fig. 5a–d). The functional significance of this identified region was further advocated by various evidences reported previously. Rasia et al.⁴⁷, stated an hypothesis that Cu(II) binding perturbs N-terminal long range interactions that are critical for stabilizing a soluble native like conformation of SNCA. Region 49–52 was reported as the strongest potential site for copper binding according to this study⁴⁷. Recently, it was reported that interface for dopamine mediated SNCA dimerisation encompassed region 43–60 of SNCA⁴⁸. These evidences strengthen our stated hypothesis that the critical region is not only performing its role in structural remodeling but also harbors crucial interaction sites for protein-protein or protein-metal interactions. It has been identified that SNCA mutants have a greater propensity to interact with and penetrate the lipid membrane through the domain from 36 to 45. This interaction causes an increase in SNCA oligomerization and toxicity which reinforces our assumption of the crucial role of this critical region in FPD pathogenesis¹¹.

Although the normal cellular function and the metabolic pathways in which SNCA is involved are quite ambiguous, it has been reported that oxidative stress and proteasome inhibition are implicated in PD and other neurodegenerative disorders^49,50. Apoptosis can be caused by proteasome inhibition and increased intracellular reactive oxygen species (ROS) levels can cause DNA and protein damage leading to cell death⁵¹. Various reports have shown that both proteasome inhibition and ROS can also trigger mitochondrial and endoplasmic reticulum (ER) stress cell death pathways⁵². PD patients have been observed to show increased levels of oxidative damage to DNA, lipids and proteins⁵³. It has been accounted that oxidative stress increases SNCA aggregation. Oxidatively modified SNCA is more prone to aggregation than native protein. It has been identified that A53T mutant of SNCA causes increase in ER stress and elevate caspase-3, caspase-9 and caspase-12 activity⁵⁴. A53T also causes mitochondrial depolarization resulting in accumulation of cytochrome c in cytosol leading to cell death⁵⁴. Intriguingly, this human deleterious mutation (A53T) was discovered as a wild type in other sarcopterygian lineages analyzed i.e. non-hominoids, non-primate placental mammals, non-mammalian tetrapods (Table 1). These findings led us to hypothesize that the identified critical region plays a crucial mechanistic role in normal cellular functioning of SNCA. Furthermore, the reason behind observing “parkinsonism” only in humans can be attributed to human specific oxidative challenges and/or compensatory evolution in non-catarrhini sarcopterygians.

Conclusion

Keeping in view the indispensable role of SNCA in the neurodegenerative processes, it is concluded that there are selective forces at work in sarcopterygians regulating the molecular and cellular mechanisms of SNCA. If this is the case, then fine tuning of these mechanisms through subtle changes in protein activity might be one of the contributing factors in bringing vital evolutionary changes to match the different environmental and ecological needs. The result of present study provides evidence that during the course of evolution, the region encompassed 32 to 58 of N-terminal lipid binding domain has acquired its critical significance in normal cellular function of SNCA and disease pathogenesis. The epistatic influence of the lineage specific substitutions might have led to the structural remodeling and functional innovation of SNCA. i.e., any mutation in this critical region is deleterious. These findings pave the way to investigate further the vital role of identified critical region of SNCA in different interaction studies and also with drug discovery perspective to target this region for the treatment of FPD.

Additional Information

How to cite this article: Siddiqui, I. J. et al. The Parkinson Disease gene SNCA: Evolutionary and structural insights with pathological implication. Sci. Rep. 6, 24475; doi: 10.1038/srep24475 (2016).

References

Bisaglia, M., Mammi, S. & Bubacco, L. Structural insights on physiological functions and pathological effects of α-synuclein. The FASEB Journal 23, 329–340 (2009).
Article CAS Google Scholar
Vilar, M. et al. The fold of α-synuclein fibrils. Proceedings of the National Academy of Sciences 105, 8637–8642 (2008).
Article CAS ADS Google Scholar
Kruger, R. et al. Ala30Pro mutation in the gene encoding α-synuclein in Parkinson’s disease. Nat Genet 18, 106–108 (1998).
Article CAS Google Scholar
Zarranz, J. J. et al. The new mutation, E46K, of α-synuclein causes parkinson and Lewy body dementia. Annals of neurology 55, 164–173 (2004).
Article CAS Google Scholar
Silke, A. C. et al. Alpha synuclein p. H50Q, a novel pathogenic mutation for Parkinson’s disease. Movement Disorders 28, 811–813 (2013).
Article Google Scholar
Lesage, S. et al. G51D α synuclein mutation causes a novel Parkinsonian–pyramidal syndrome. Annals of neurology 73, 459–471 (2013).
Article CAS Google Scholar
Polymeropoulos, M. H. et al. Mutation in the α-synuclein gene identified in families with Parkinson’s disease. science 276, 2045–2047 (1997).
Article CAS Google Scholar
Hashimoto, M. & Masliah, E. Alpha synuclein in Lewy Body Disease and Alzheimer’s Disease. Brain pathology 9, 707–720 (1999).
Article CAS Google Scholar
George, J. M. The synucleins. Genome Biol 3(3002). 3001-3002.3006 (2002).
Dettmer, U. et al. Parkinson-causing α-synuclein missense mutations shift native tetramers to monomers as a mechanism for disease initiation. Nature Communications 6, doi: 10.1038/ncomms8314 (2015).
Tsigelny, I. F. et al. Molecular Determinants of α-Synuclein Mutants’ Oligomerization and Membrane Interactions. ACS Chemical Neuroscience 6, 403–416 (2015).
Article CAS Google Scholar
Recasens, A. & Dehay, B. Alpha-synuclein spreading in Parkinson’s disease. Front Neuroanat 8, doi: 10.3389/fnana.2014.00159 (2014).
Lücking, C. & Brice, A. Alpha-synuclein and Parkinson’s disease. Cellular and Molecular Life Sciences CMLS 57, 1894–1908 (2000).
Article Google Scholar
da Costa, C. A., Ancolio, K. & Checler, F. Wild-type but not Parkinson’s disease-related ala-53 → Thr mutant α-synuclein protects neuronal cells from apoptotic stimuli. Journal of Biological Chemistry 275, 24065–24069 (2000).
Article CAS Google Scholar
da Costa, C. A., Paitel, E., Vincent, B. & Checler, F. α-Synuclein Lowers p53-dependent Apoptotic Response of Neuronal Cells ABOLISHMENT BY 6-HYDROXYDOPAMINE AND IMPLICATION FOR PARKINSON′ S DISEASE. Journal of Biological Chemistry 277, 50980–50984 (2002).
Article Google Scholar
Tang, Y., Zhao, W., Chen, Y., Zhao, Y. & Gu, W. Acetylation is indispensable for p53 activation. Cell 133, 612–626 (2008).
Article CAS Google Scholar
Cunningham, F. et al. Ensembl 2015. Nucleic acids research 43, 662–669 (2015). URL http://asia.ensembl.org/index.html.
Article Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. Journal of molecular biology 215, 403–410 (1990).
Article CAS Google Scholar
Saitou, N. & Nei, M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Molecular biology and evolution 4, 406–425 (1987).
CAS PubMed Google Scholar
Russo, C. A. Efficiencies of different statistical tests in supporting a known vertebrate phylogeny. Molecular biology and evolution 14, 1078–1080 (1997).
Article CAS Google Scholar
Felsenstein, J. Confidence Limits on Phylogenies: An Approach Using the Bootstrap. Evolution 39, 783–791 (1985).
Article Google Scholar
Whelan, S. & Goldman, N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Molecular biology and evolution 18, 691–699 (2001).
Article CAS Google Scholar
Tamura, K., Stecher, G., Peterson, D., Filipski, A. & Kumar, S. MEGA6: molecular evolutionary genetics analysis version 6.0. Molecular biology and evolution 30, 2725–2729 (2013). URL http://www.megasoftware.net/.
Article CAS Google Scholar
Goldman, N. & Yang, Z. A codon-based model of nucleotide substitution for protein-coding DNA sequences. Molecular biology and evolution 11, 725–736 (1994).
CAS PubMed Google Scholar
Du, H. N. et al. A peptide motif consisting of glycine, alanine, and valine is required for the fibrillization and cytotoxicity of human α-synuclein. Biochemistry 42, 8870–8878 (2003).
Article CAS Google Scholar
Uverskya, V. N. & Finka, A. L. Amino acid determinants of alpha synuclein aggregation: putting together pieces of the puzzle. FEBS Letters 522, 9–13 (2002).
Article Google Scholar
Thomopson, J., Higgins, D. G. & Gibson, T. ClustalW. Nucleic Acids Res 22, 4673–4680 (1994).
Article Google Scholar
Grantham, R. Amino acid difference formula to help explain protein evolution. science 185, 862–864 (1974).
Article CAS ADS Google Scholar
Betts, M. J. & Russell, R. B. Amino acid properties and consequences of substitutions. Bioinformatics for geneticists 317, 289–298 (2003).
Article Google Scholar
Ulmer, T. S., Bax, A., Cole, N. B. & Nussbaum, R. L. Structure and dynamics of micelle-bound human α-synuclein. Journal of Biological Chemistry 280, 9595–9603 (2005).
Article CAS Google Scholar
Webb, B. & Sali, A. Comparative Protein Structure Modeling Using MODELLER. Curr Protoc Bioinformatics 47, doi: 10.1002/0471250953.bi0506s47 (2014). URL http://salilab.org/modeller/.
Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. Journal of computational chemistry 25, 1605–1612 (2004). URL http://www.cgl.ucsf.edu/chimera/.
Article CAS Google Scholar
Sheik, S., Sundararajan, P., Hussain, A. & Sekar, K. Ramachandran plot on the web. Bioinformatics 18, 1548–1549 (2002).
Article CAS Google Scholar
Colovos, C. & Yeates, T. O. Verification of protein structures: patterns of nonbonded atomic interactions. Protein Science 2, 1511–1519 (1993).
Article CAS Google Scholar
Faraggi, E., Zhang, T., Yang, Y., Kurgan, L. & Zhou, Y. SPINE X: improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles. Journal of computational chemistry 33, 259–267 (2012). URL http://sparks.informatics.iupui.edu/.
Article CAS Google Scholar
Cheng, J., Randall, A. & Baldi, P. Prediction of protein stability changes for single‐site mutations using support vector machines. Proteins: Structure, Function, and Bioinformatics 62, 1125–1132 (2006).
Article CAS Google Scholar
Comeau, S. R., Gatchell, D. W., Vajda, S. & Camacho, C. J. ClusPro: a fully automated algorithm for protein–protein docking. Nucleic acids research 32, 96–99 (2004). URL http://cluspro.bu.edu/.
Article Google Scholar
Xie, Y. Y. et al. Interaction with synphilin-1 promotes inclusion formation of α-synuclein: mechanistic insights and pathological implication. The FASEB Journal 24, 196–205 (2010).
Article Google Scholar
Wallace, A. C., Laskowski, R. A. & Thornton, J. M. LIGPLOT: a program to generate schematic diagrams of protein-ligand interactions. Protein engineering 8, 127–134 (1995). URL http://www.ebi.ac.uk/thornton-srv/software/LIGPLOT/.
Article CAS Google Scholar
DeLano, W. L. The PyMOL molecular graphics system. (2002).
Abbasi, A. A. Molecular evolution of HR, a gene that regulates the postnatal cycle of the hair follicle. Scientific Reports 1, doi: 10.1038/srep00032 (2011).
Abbasi, A. A., Goode, D. K., Amir, S. & Grzeschik, K. H. Evolution and functional diversification of the GLI family of transcription factors in vertebrates. Evolutionary bioinformatics online 5, 5–13 (2009).
CAS PubMed PubMed Central Google Scholar
Binolfi, A. et al. Interaction of α-synuclein with divalent metal ions reveals key differences: A link between structure, binding specificity and fibrillation enhancement. Journal of the American Chemical Society 128, 9893–9901 (2006).
Article CAS Google Scholar
Breydo, L., Wu, J. W. & Uversky, V. N. Αlpha synuclein misfolding and Parkinson’s disease. Biochimica et Biophysica Acta (BBA)-Molecular Basis of Disease 1822, 261–285 (2012).
Article CAS Google Scholar
Serpell, L. C., Berriman, J., Jakes, R., Goedert, M. & Crowther, R. A. Fiber diffraction of synthetic α-synuclein filaments shows amyloid-like cross-β conformation. Proceedings of the National Academy of Sciences 97, 4897–4902 (2000).
Article CAS ADS Google Scholar
Ribeiro, C. S., Carneiro, K., Ross, C. A., Menezes, J. R. & Engelender, S. Synphilin-1 is developmentally localized to synaptic terminals, and its association with synaptic vesicles is modulated by α-synuclein. Journal of Biological Chemistry 277, 23927–23933 (2002).
Article CAS Google Scholar
Rasia, R. M. et al. Structural characterization of copper (II) binding to α-synuclein: Insights into the bioinorganic chemistry of Parkinson’s disease. Proceedings of the National Academy of Sciences of the United States of America 102, 4294–4299 (2005).
Article CAS ADS Google Scholar
Leong, S. L. et al. The N-Terminal Residues 43 to 60 Form the Interface for Dopamine Mediated α-Synuclein Dimerisation. Plos one 10, e0116497 (2015).
Article Google Scholar
Zhang, Y., Dawson, V. L. & Dawson, T. M. Oxidative stress and genetics in the pathogenesis of Parkinson’s disease. Neurobiology of disease 7, 240–250 (2000).
Article CAS Google Scholar
Ischiropoulos, H. & Beckman, J. S. Oxidative stress and nitration in neurodegeneration: cause, effect, or association? Journal of Clinical Investigation 111, 163–169 (2003).
Article CAS Google Scholar
Friedman, J. & Xue, D. To live or die by the sword: the regulation of apoptosis by the proteasome. Developmental cell 6, 460–461 (2004).
Article CAS Google Scholar
Hayashi, T. et al. Oxidative damage to the endoplasmic reticulum is implicated in ischemic neuronal cell death. Journal of Cerebral Blood Flow & Metabolism 23, 1117–1128 (2003).
Article CAS Google Scholar
Hald, A. & Lotharius, J. Oxidative stress and inflammation in Parkinson’s disease: is there a causal link? Experimental neurology 193, 279–290 (2005).
Article CAS Google Scholar
Smith, W. W. et al. Endoplasmic reticulum stress and mitochondrial cell death pathways mediate A53T mutant alpha-synuclein-induced toxicity. Human molecular genetics 14, 3801–3811 (2005).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by Higher Education Commission (HEC) of Pakistan and the National Center for Bioinformatics, Quaid-i-Azam University, Islamabad.

Author information

Authors and Affiliations

National Center for Bioinformatics, Program of Comparative and Evolutionary Genomics, Faculty of Biological Sciences, Quaid-i-Azam University, Islamabad, 45320, Pakistan
Irum Javaid Siddiqui, Nashaiman Pervaiz & Amir Ali Abbasi

Authors

Irum Javaid Siddiqui
View author publications
You can also search for this author in PubMed Google Scholar
Nashaiman Pervaiz
View author publications
You can also search for this author in PubMed Google Scholar
Amir Ali Abbasi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.A.A. conceived the project and designed the experiments. I.J.S. performed the experiments. A.A.A., I.J.S. and N.P. analyzed the data. A.A.A., I.J.S. and N.P. wrote the paper.

Corresponding author

Correspondence to Amir Ali Abbasi.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information (PDF 2828 kb)

Supplementary Data File (DOC 44 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Siddiqui, I., Pervaiz, N. & Abbasi, A. The Parkinson Disease gene SNCA: Evolutionary and structural insights with pathological implication. Sci Rep 6, 24475 (2016). https://doi.org/10.1038/srep24475

Download citation

Received: 11 October 2015
Accepted: 30 March 2016
Published: 15 April 2016
DOI: https://doi.org/10.1038/srep24475

This article is cited by

Personalized Protein-Protein Interaction Networks Towards Unraveling the Molecular Mechanisms of Alzheimer’s Disease
- Betül CEYLAN
- Elif DÜZ
- Tunahan ÇAKIR
Molecular Neurobiology (2024)
MLKL deficiency alleviates neuroinflammation and motor deficits in the α-synuclein transgenic mouse model of Parkinson’s disease
- Lu Geng
- Wenqing Gao
- Jixi Li
Molecular Neurodegeneration (2023)
Intramolecular interaction kinetically regulates fibril formation by human and mouse α-synuclein
- Takashi Ohgita
- Hiroki Kono
- Hiroyuki Saito
Scientific Reports (2023)
Ethnicity- and sex-specific genome wide association study on Parkinson’s disease
- Kye Won Park
- Ho-Sung Ryu
- Sun Ju Chung
npj Parkinson's Disease (2023)
Studying the effect of alpha-synuclein and Parkinson’s disease linked mutants on inter pathway connectivities
- Sagnik Sen
- Ashmita Dey
- Ujjwal Maulik
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.