Abstract
Amino acid sequence analysis corresponding to the PPE proteins in H37Rv and CDC 1551 strains of theMycobacterium tuberculosis genomes resulted in the identification of a previously uncharacterized 225 amino acid-residue common region in 22 proteins. The pairwise sequence identities were as low as 18%. Conservation of amino acid residues was observed at fifteen positions that were distributed over the whole length of the region. The secondary structure corresponding to this region is predicted to be a mixture of a-helices and β-strands. Although the function is not known, proteins with this region specific to mycobacterial species may be associated with a common function. We further observed another group of 20 PPE proteins corresponding to the conserved C-terminal region comprising 44 amino acid residues with GFxGT and PxxPxxW sequence motifs. This region is preceded by a hydrophobic region, comprising 40–100 amino acid residues, that is flanked by charged amino acid residues. Identification of conserved regions described above may be useful to detect related proteins from other genomes and assist the design of suitable experiments to test their corresponding functions. Amino acid sequence analysis corresponding to the PE proteins resulted in the identification of tandem repeats comprising 41-43 amino acid residues in the C-terminal variable regions in two PE proteins (Rv0978 and Rv0980). These correspond to the AB repeats that were first identified in some proteins of theMethanosarcina mazei genome, and were demonstrated as surface antigens. We observed the AB repeats also in several other proteins of hitherto uncharacterized function inArchaea andBacteria genomes. Some of these proteins are also associated with another repeat called the C-repeat or the PKD-domain comprising 85 amino acid residues. The secondary structure corresponding to the AB repeat is predicted mainly as 4 β-strands. We suggest that proteins with AB repeats inMycobacterium tuberculosis and other genomes may be associated as surface antigens. TheM. leprae genome, however, does not contain either the AB or C-repeats and different proteins may therefore be recruited as surface antigens in theM. leprae genome compared to theM. tuberculosis genome.
Similar content being viewed by others
References
Altschul S F, Gish W, Miller W, Myers E W and Lipman D J 1990 Basic local alignment search tool;J. Mol. Biol. 215 403–410
Altschul S F, Madden T L, Schäffer A A, Zhang J, Zhang Z, Miller W and Lipman D J 1997 Gapped BLAST and PSI-BLAST: a new generation of protein database search programs;Nucleic Acids Res. 25 3389–3402
Bateman A, Birney E, Durbin R, Eddy S R, Howe K L and Sonnhammer E L 2000 The Pfam protein families database;Nucleic Acids Res. 28 263–266
Bycroft M, Bateman A, Clarke J, Hamill S J, Sandford R, Thomas R L and Chothia C 1999 The structure of a PKD domain from polycystin-1: implications for polycystic kidney disease;EMBO J. 18 297–305
Cole S Tet al 1998 Deciphering the biology ofMycobacterium tuberculosis from the complete genome sequence;Nature (London) 393 537–544
Cole S Tet al 2001 Massive gene decay in the leprosy bacillus;Nature (London) 409 1007–1011
Conway de Macario E and Macario A J L 1986 Immunology of Archea-bacteria: identification, antigenic relationships and immunochemistry of surface structures;Syst. Appl. Microbiol. 7 320–324
Dybvig K 1993 DNA rearrangements and phenotypic switching in prokaryotes;Mol. Microbiol. 10 465–471
Fleischmann R Det al 2001 Whole genome comparison ofMycobacterium tuberculosis clinical and laboratory strains; http://www.ncbi.nlm.nih.gov/cgi-bin/Entrez/
Galagan J Eet al 2002 The genome ofM. acetivorans reveals extensive metabolic and physiological diversity;Genome Res. 12 532–542
Henning H, Fiona L and Rolf A 2003 CCP11 newsletter; http:// wserv1.dl.ac.uk/CCP/CCP1 1/newsletter/vol2_3/sptr.html
Kehoe M 1994 Cell wall associated proteins in Gram-positive bacteria; inBacterial cell wall, new comprehensive biochemistry (eds) J M Ghuysen and R Hakenbeck (New York: Elsevier) vol. 27, pp 217–261
Lupas A, Englehardt H, Peters J, Santarius U, Volker S and Baumeister W 1994 Domain structure of theAcetogenium kivui surface layer revealed by electron crystallography and sequence analysis;J. Bacteriol. 176 1224–1233
Lemaire M, Ohayon H, Gounnon P, Fujino T and Beguin P 1995 OlpB. A new outer layer protein ofClostridium thermocellum and binding of its S-layer-like domains to components of the cell envelope;J. Bacteriol. 177 2451–2459
Matuschek M, Sahm K, Zibat A and Bahl H 1996 Characterization of genes fromThermoanaerobacterium thermosulfurigenes EM1 that encode two glycosyl hydrolases with conserved S-layer-like domains;Mol. Gen. Genet. 252 493–496
Mayerhofer L E, de Macario E C and Macario A J L 1995 Conservation and variability in Archaea: Protein antigens with tandem repeats encoded by a cluster of genes with common motifs inMethanosarcina mazei S-6;Gene 165 87–91
Rost B, Sander C and Schneider R 1994 PHD — an automatic mail server for protein secondary structure prediction;CABIOS 10 53–60
Schaftenaar G, Cuelenaere K, Noordik J H and Etzold T 1996 A Tcl-based SRS v. 4 interface;Comput. Appl. Biosci. 12 151–155
Slack F J and Ruvkun G 1998 A novel repeat domain that is often associated with RING finger and B-box motifs;Trends Biochem. Sci. 23 474–475
Thompson J D, Higgins D G and Gibson T J 1994 CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice;Nucleic Acids Res. 22 4673–4680
Vega Lopez F, Brooks L A, Dockrell H M, De Smet K A, Thompson J K, Hussain R and Stoker N G 1993 Sequence and immunological characterization of a serine-rich antigen fromMycobacterium leprae;Infect. Immun. 61 2145–2153
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Adindla, S., Guruprasad, L. Sequence analysis corresponding to the PPE and PE proteins inMycobacterium tuberculosis and other genomes. J Biosci 28, 169–179 (2003). https://doi.org/10.1007/BF02706216
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF02706216