Skip to main content

Fast Similarity Search for Protein 3D Structure Databases Using Spatial Topological Patterns

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3180))

Abstract

It becomes too expensive computationally to compare a query protein with protein structures in a 3D structure databases for determining their similarity. Therefore, we emphasize that solving structural similarity search is to develop fast structure comparison algorithms. We propose a new method for comparing the structural similarity in protein structure databases with a given query protein by using topological pattern of proteins. In our approach, the geometry of SSEs (Secondary Structure Elem ents) is represented by spatial data types and indexed using an Rtree. We discover topological patterns of SSEs in 3D space using 9IM topological relations ac celerated by Rtree index join to all the structures in 3D structure databases. A similarity search algorithm compares topological patterns of a query protein with those of proteins in the structure database. Experimental re sults show that execution time of our method is 3 times faster than DALITE while keeping the accuracy similar. This study identifies that similarity search based on spatial databases can find the similar structures rapidly and generate smal l candidate sets for the generalized alignment tools such as DALI and SSAP.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Park, S.H., Ryu, K.H., Son, H.S.: Modeling Protein Structures with Spatial Model for Structure Comparison. In: Liu, J., Cheung, Y.-m., Yin, H. (eds.) IDEAL 2003. LNCS, vol. 2690, pp. 490–497. Springer, Heidelberg (2003)

    Google Scholar 

  2. Camoglu, O., Kahveci, T., Singh, A.K.: PSI: indexing protein structures for fast similarity search. Bioinformatics 19, 81i-83i (2003)

    Google Scholar 

  3. Oracle Spatial User’s Guide and Reference: Loading and Indexing Spatial Object Types. Rclcasc8.1.5, Oracle (2001)

    Google Scholar 

  4. Holm, L., Park, J.: DaliLite workbench for protein structure comparison. Bioinformatics 16, 566–567 (2000)

    Article  Google Scholar 

  5. Holm, L., Sander, C.: Proein structure comparison by alignment of distance matrices. J. of Molecular Biology 233, 123–138 (1993)

    Article  Google Scholar 

  6. Holm, L., Sander, C.: Proein structure comparison by alignment of distance matrices. J. of Molecular Biology 233, 123–138 (1993)

    Article  Google Scholar 

  7. Clementini, E., Felice, P., van Oostrom, P.: A small set of formal topological relationships suitable for end-user interaction. In: Proc. of Spatial Databases Symp., Singapore, pp. 277–295 (1993)

    Google Scholar 

  8. Singh, A.P., Brutlag, D.L.: Protein Structure Alignment: A Comparison of Methods, Dept. of Biochemistry, Stanford Univ (2000)

    Google Scholar 

  9. Gibrat, J.-F., Madej, T., Bryant, H.: Surprising similarities in structure comparison. Current Opinion in Structural Biology 6, 377–385 (1996)

    Article  Google Scholar 

  10. Martin, A.: The ups and downs of protein topology: rapid comparison of protein structure. Protein Engineering 13, 829–837 (2000)

    Article  Google Scholar 

  11. Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The Protein Data bank. J. Nucleic Acids Research 28, 235–242 (2000)

    Article  Google Scholar 

  12. Rigaus, P., School, M., Voisard, A.: Spatial Databases with application to G1S, pp. 29–61. Academic Press, San Diego (2002)

    Google Scholar 

  13. Higgins, D., Tailor, W.: Bioinfomatics: Sequence, Structure, and databanks, 1st edn. Oxford University Press, New York (2000)

    Google Scholar 

  14. Witten, I.H., Moffat, A., Bell, T.C.: Managing Gigabytes: Compressing and Indexing Documents and Images, 2nd edn. Morgan Kaufmann Publishing, San Francisco (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Park, SH., Ryu, K.H. (2004). Fast Similarity Search for Protein 3D Structure Databases Using Spatial Topological Patterns. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds) Database and Expert Systems Applications. DEXA 2004. Lecture Notes in Computer Science, vol 3180. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30075-5_74

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30075-5_74

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22936-0

  • Online ISBN: 978-3-540-30075-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics