Abstract
It becomes too expensive computationally to compare a query protein with protein structures in a 3D structure databases for determining their similarity. Therefore, we emphasize that solving structural similarity search is to develop fast structure comparison algorithms. We propose a new method for comparing the structural similarity in protein structure databases with a given query protein by using topological pattern of proteins. In our approach, the geometry of SSEs (Secondary Structure Elem ents) is represented by spatial data types and indexed using an Rtree. We discover topological patterns of SSEs in 3D space using 9IM topological relations ac celerated by Rtree index join to all the structures in 3D structure databases. A similarity search algorithm compares topological patterns of a query protein with those of proteins in the structure database. Experimental re sults show that execution time of our method is 3 times faster than DALITE while keeping the accuracy similar. This study identifies that similarity search based on spatial databases can find the similar structures rapidly and generate smal l candidate sets for the generalized alignment tools such as DALI and SSAP.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Park, S.H., Ryu, K.H., Son, H.S.: Modeling Protein Structures with Spatial Model for Structure Comparison. In: Liu, J., Cheung, Y.-m., Yin, H. (eds.) IDEAL 2003. LNCS, vol. 2690, pp. 490–497. Springer, Heidelberg (2003)
Camoglu, O., Kahveci, T., Singh, A.K.: PSI: indexing protein structures for fast similarity search. Bioinformatics 19, 81i-83i (2003)
Oracle Spatial User’s Guide and Reference: Loading and Indexing Spatial Object Types. Rclcasc8.1.5, Oracle (2001)
Holm, L., Park, J.: DaliLite workbench for protein structure comparison. Bioinformatics 16, 566–567 (2000)
Holm, L., Sander, C.: Proein structure comparison by alignment of distance matrices. J. of Molecular Biology 233, 123–138 (1993)
Holm, L., Sander, C.: Proein structure comparison by alignment of distance matrices. J. of Molecular Biology 233, 123–138 (1993)
Clementini, E., Felice, P., van Oostrom, P.: A small set of formal topological relationships suitable for end-user interaction. In: Proc. of Spatial Databases Symp., Singapore, pp. 277–295 (1993)
Singh, A.P., Brutlag, D.L.: Protein Structure Alignment: A Comparison of Methods, Dept. of Biochemistry, Stanford Univ (2000)
Gibrat, J.-F., Madej, T., Bryant, H.: Surprising similarities in structure comparison. Current Opinion in Structural Biology 6, 377–385 (1996)
Martin, A.: The ups and downs of protein topology: rapid comparison of protein structure. Protein Engineering 13, 829–837 (2000)
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The Protein Data bank. J. Nucleic Acids Research 28, 235–242 (2000)
Rigaus, P., School, M., Voisard, A.: Spatial Databases with application to G1S, pp. 29–61. Academic Press, San Diego (2002)
Higgins, D., Tailor, W.: Bioinfomatics: Sequence, Structure, and databanks, 1st edn. Oxford University Press, New York (2000)
Witten, I.H., Moffat, A., Bell, T.C.: Managing Gigabytes: Compressing and Indexing Documents and Images, 2nd edn. Morgan Kaufmann Publishing, San Francisco (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Park, SH., Ryu, K.H. (2004). Fast Similarity Search for Protein 3D Structure Databases Using Spatial Topological Patterns. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds) Database and Expert Systems Applications. DEXA 2004. Lecture Notes in Computer Science, vol 3180. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30075-5_74
Download citation
DOI: https://doi.org/10.1007/978-3-540-30075-5_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22936-0
Online ISBN: 978-3-540-30075-5
eBook Packages: Springer Book Archive