Skip to main content
Log in

Sequence analysis corresponding to the PPE and PE proteins inMycobacterium tuberculosis and other genomes

  • Articles
  • Published:
Journal of Biosciences Aims and scope Submit manuscript

Abstract

Amino acid sequence analysis corresponding to the PPE proteins in H37Rv and CDC 1551 strains of theMycobacterium tuberculosis genomes resulted in the identification of a previously uncharacterized 225 amino acid-residue common region in 22 proteins. The pairwise sequence identities were as low as 18%. Conservation of amino acid residues was observed at fifteen positions that were distributed over the whole length of the region. The secondary structure corresponding to this region is predicted to be a mixture of a-helices and β-strands. Although the function is not known, proteins with this region specific to mycobacterial species may be associated with a common function. We further observed another group of 20 PPE proteins corresponding to the conserved C-terminal region comprising 44 amino acid residues with GFxGT and PxxPxxW sequence motifs. This region is preceded by a hydrophobic region, comprising 40–100 amino acid residues, that is flanked by charged amino acid residues. Identification of conserved regions described above may be useful to detect related proteins from other genomes and assist the design of suitable experiments to test their corresponding functions. Amino acid sequence analysis corresponding to the PE proteins resulted in the identification of tandem repeats comprising 41-43 amino acid residues in the C-terminal variable regions in two PE proteins (Rv0978 and Rv0980). These correspond to the AB repeats that were first identified in some proteins of theMethanosarcina mazei genome, and were demonstrated as surface antigens. We observed the AB repeats also in several other proteins of hitherto uncharacterized function inArchaea andBacteria genomes. Some of these proteins are also associated with another repeat called the C-repeat or the PKD-domain comprising 85 amino acid residues. The secondary structure corresponding to the AB repeat is predicted mainly as 4 β-strands. We suggest that proteins with AB repeats inMycobacterium tuberculosis and other genomes may be associated as surface antigens. TheM. leprae genome, however, does not contain either the AB or C-repeats and different proteins may therefore be recruited as surface antigens in theM. leprae genome compared to theM. tuberculosis genome.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Altschul S F, Gish W, Miller W, Myers E W and Lipman D J 1990 Basic local alignment search tool;J. Mol. Biol. 215 403–410

    PubMed  CAS  Google Scholar 

  • Altschul S F, Madden T L, Schäffer A A, Zhang J, Zhang Z, Miller W and Lipman D J 1997 Gapped BLAST and PSI-BLAST: a new generation of protein database search programs;Nucleic Acids Res. 25 3389–3402

    Article  PubMed  CAS  Google Scholar 

  • Bateman A, Birney E, Durbin R, Eddy S R, Howe K L and Sonnhammer E L 2000 The Pfam protein families database;Nucleic Acids Res. 28 263–266

    Article  PubMed  CAS  Google Scholar 

  • Bycroft M, Bateman A, Clarke J, Hamill S J, Sandford R, Thomas R L and Chothia C 1999 The structure of a PKD domain from polycystin-1: implications for polycystic kidney disease;EMBO J. 18 297–305

    Article  PubMed  CAS  Google Scholar 

  • Cole S Tet al 1998 Deciphering the biology ofMycobacterium tuberculosis from the complete genome sequence;Nature (London) 393 537–544

    Article  CAS  Google Scholar 

  • Cole S Tet al 2001 Massive gene decay in the leprosy bacillus;Nature (London) 409 1007–1011

    Article  CAS  Google Scholar 

  • Conway de Macario E and Macario A J L 1986 Immunology of Archea-bacteria: identification, antigenic relationships and immunochemistry of surface structures;Syst. Appl. Microbiol. 7 320–324

    Google Scholar 

  • Dybvig K 1993 DNA rearrangements and phenotypic switching in prokaryotes;Mol. Microbiol. 10 465–471

    Article  PubMed  CAS  Google Scholar 

  • Fleischmann R Det al 2001 Whole genome comparison ofMycobacterium tuberculosis clinical and laboratory strains; http://www.ncbi.nlm.nih.gov/cgi-bin/Entrez/

  • Galagan J Eet al 2002 The genome ofM. acetivorans reveals extensive metabolic and physiological diversity;Genome Res. 12 532–542

    Article  PubMed  CAS  Google Scholar 

  • Henning H, Fiona L and Rolf A 2003 CCP11 newsletter; http:// wserv1.dl.ac.uk/CCP/CCP1 1/newsletter/vol2_3/sptr.html

  • Kehoe M 1994 Cell wall associated proteins in Gram-positive bacteria; inBacterial cell wall, new comprehensive biochemistry (eds) J M Ghuysen and R Hakenbeck (New York: Elsevier) vol. 27, pp 217–261

    Google Scholar 

  • Lupas A, Englehardt H, Peters J, Santarius U, Volker S and Baumeister W 1994 Domain structure of theAcetogenium kivui surface layer revealed by electron crystallography and sequence analysis;J. Bacteriol. 176 1224–1233

    PubMed  CAS  Google Scholar 

  • Lemaire M, Ohayon H, Gounnon P, Fujino T and Beguin P 1995 OlpB. A new outer layer protein ofClostridium thermocellum and binding of its S-layer-like domains to components of the cell envelope;J. Bacteriol. 177 2451–2459

    PubMed  CAS  Google Scholar 

  • Matuschek M, Sahm K, Zibat A and Bahl H 1996 Characterization of genes fromThermoanaerobacterium thermosulfurigenes EM1 that encode two glycosyl hydrolases with conserved S-layer-like domains;Mol. Gen. Genet. 252 493–496

    PubMed  CAS  Google Scholar 

  • Mayerhofer L E, de Macario E C and Macario A J L 1995 Conservation and variability in Archaea: Protein antigens with tandem repeats encoded by a cluster of genes with common motifs inMethanosarcina mazei S-6;Gene 165 87–91

    Article  PubMed  CAS  Google Scholar 

  • Rost B, Sander C and Schneider R 1994 PHD — an automatic mail server for protein secondary structure prediction;CABIOS 10 53–60

    PubMed  CAS  Google Scholar 

  • Schaftenaar G, Cuelenaere K, Noordik J H and Etzold T 1996 A Tcl-based SRS v. 4 interface;Comput. Appl. Biosci. 12 151–155

    PubMed  CAS  Google Scholar 

  • Slack F J and Ruvkun G 1998 A novel repeat domain that is often associated with RING finger and B-box motifs;Trends Biochem. Sci. 23 474–475

    Article  PubMed  CAS  Google Scholar 

  • Thompson J D, Higgins D G and Gibson T J 1994 CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice;Nucleic Acids Res. 22 4673–4680

    Article  PubMed  CAS  Google Scholar 

  • Vega Lopez F, Brooks L A, Dockrell H M, De Smet K A, Thompson J K, Hussain R and Stoker N G 1993 Sequence and immunological characterization of a serine-rich antigen fromMycobacterium leprae;Infect. Immun. 61 2145–2153

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Adindla, S., Guruprasad, L. Sequence analysis corresponding to the PPE and PE proteins inMycobacterium tuberculosis and other genomes. J Biosci 28, 169–179 (2003). https://doi.org/10.1007/BF02706216

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02706216

Keywords

Navigation