Article
Conservation of Intrinsic Disorder in Protein Domains and Families: I. A Database of Conserved Predicted Disordered Regions
Purdue University, Indianapolis, Indiana 46202, and Institute for Biological Instrumentation, Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia Center for Computational Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Indiana University School of Medicine.
Molecular Kinetics, Inc.
School of Informatics, Indiana University
Purdue University.
To whom correspondence should be addressed. Center for Computational Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, 714 N. Senate St., Suite 250, Indianapolis, IN 46202. Phone: (317) 278-9650. Fax: (317) 278-9217. E-mail: vuversky@iupui.edu (V.N.U.) or kedunker@iupui.edu (A K.D).
Institute for Biological Instrumentation, Russian Academy of Sciences.
Abstract

Many protein regions have been shown to be intrinsically disordered, lacking unique structure under physiological conditions. These intrinsically disordered regions are not only very common in proteomes, but also crucial to the function of many proteins, especially those involved in signaling, recognition, and regulation. The goal of this work was to identify the prevalence, characteristics, and functions of conserved disordered regions within protein domains and families. A database was created to store the amino acid sequences of nearly one million proteins and their domain matches from the InterPro database, a resource integrating eight different protein family and domain databases. Disorder prediction was performed on these protein sequences. Regions of sequence corresponding to domains were aligned using a multiple sequence alignment tool. From this initial information, regions of conserved predicted disorder were found within the domains. The methodology for this search consisted of finding regions of consecutive positions in the multiple sequence alignments in which a 90% or more of the sequences were predicted to be disordered. This procedure was constrained to find such regions of conserved disorder prediction that were at least 20 amino acids in length. The results of this work included 3653 regions of conserved disorder prediction, found within 2898 distinct InterPro entries. Most regions of conserved predicted disorder detected were short, with less than 10% of those found exceeding 30 residues in length.
Keywords: intrinsic disorder • protein structure−function • disorder prediction • PONDR
View: Full Text HTML | Hi-Res PDF
Article Tools
History
- Published In Issue April 07, 2006
- Received February 15, 2006
Cart



