Normalized Similarity of RNA Sequences

Backofen, Rolf; Hermelin, Danny; Landau, Gad M.; Weimann, Oren

doi:10.1007/11575832_40

Rolf Backofen¹⁸,
Danny Hermelin¹⁹,
Gad M. Landau^20,21 &
…
Oren Weimann¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3772))

Included in the following conference series:

International Symposium on String Processing and Information Retrieval

1510 Accesses
2 Citations

Abstract

We introduce a normalized version of the LCS metric as a new local similarity measure for comparing two RNAs. An \(\mathcal{O}(n^{2}m{\rm lg}m)\) time algorithm is presented for computing the maximum normalized score of two RNA sequences, where n and m are the lengths of the sequences and n ≤ m. This algorithm has the same time complexity as the currently best known global LCS algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alber, J., Gramm, J., Guo, J., Niedermeier, R.: Towards optimally solving the longest common subsequence problem for sequences with nested arc annotations in linear time. In: Apostolico, A., Takeda, M. (eds.) CPM 2002. LNCS, vol. 2373, pp. 99–114. Springer, Heidelberg (2002)
Chapter Google Scholar
Apostolico, A., Guerra, C.: The longest common subsequence problem revisited. Algorithmica 2, 315–336 (1987)
Article MATH MathSciNet Google Scholar
Arslan, A.N., Eǧecioğlu, Ö., Pevzner, P.A.: A new approach to sequence alignment: normalized sequence alignment. Bioinformatics 17(4), 327–337 (2001)
Article Google Scholar
Bille, P.: A survey on tree edit distance and related problems. Theoretical Computer Science 337, 217–239 (2005)
Article MATH MathSciNet Google Scholar
Chartrand, P., Meng, X.-H., Singer, R.H., Long, R.M.: Structural elements required for the localization of ASH1 mRNA and of a green fluorescent protein reporter particle in vivo. Current Biology 9, 333–336 (1999)
Article Google Scholar
Couzin, J.: Breakthrough of the year. Small RNAs make big splash. Science 298(5602), 2296–2297 (2002)
Article Google Scholar
Efraty, N., Landau, G.M.: Sparse normalized local alignment. In: Sahinalp, S.C., Muthukrishnan, S.M., Dogrusoz, U. (eds.) CPM 2004. LNCS, vol. 3109, pp. 333–346. Springer, Heidelberg (2004)
Chapter Google Scholar
Evans, P.A.: Algorithms and complexity for annotated sequence analysis. PhD thesis, University of Alberta (1999)
Google Scholar
Gramm, J., Guo, J., Niedermeier, R.: Pattern matching for arc annotated sequences. In: Agrawal, M., Seth, A.K. (eds.) FSTTCS 2002. LNCS, vol. 2556, pp. 182–193. Springer, Heidelberg (2002)
Chapter Google Scholar
Hirschberg, D.S.: Algorithms for the longest common subsequence problem. Journal of the ACM 24(4), 664–675 (1977)
Article MATH MathSciNet Google Scholar
Hunt, J.W., Szymanski, T.G.: A fast algorithm for computing longest common subsequences. Communications of the ACM 20(5), 350–353 (1977)
Article MATH MathSciNet Google Scholar
Jiang, T., Lin, G.-H., Ma, B., Zhang, K.: The longest common subsequence problem for arc-annotated sequences. In: Giancarlo, R., Sankoff, D. (eds.) CPM 2000. LNCS, vol. 1848, pp. 154–165. Springer, Heidelberg (2000)
Chapter Google Scholar
Klein, P.N.: Computing the Edit-Distance between Unrooted Ordered Trees. In: Bilardi, G., Pietracaprina, A., Italiano, G.F., Pucci, G. (eds.) ESA 1998. LNCS, vol. 1461, pp. 91–102. Springer, Heidelberg (1998)
Chapter Google Scholar
Moore, P.B.: Structural motifs in RNA. Annual review of biochemistry 68, 287–300 (1999)
Article Google Scholar
Shasha, D., Zhang, K.: Simple fast algorithms for the editing distance between trees and related problems. SIAM Journal on Computing 18(6), 1245–1262 (1989)
Article MATH MathSciNet Google Scholar
Smith, T.F., Waterman, M.S.: The identification of common molecular subsequences. Journal of Molecular Biology 147, 195–197 (1981)
Article Google Scholar
Zhang, K.: Computing similarity between RNA secondary structures. In: Proc. of the IEEE joint symposium on Intelligence and Systems conference, pp. 126–132 (1998)
Google Scholar
Zuker, M.: On finding all suboptimal foldings of an RNA molecule. Science 244(4900), 48–52 (1989)
Article MathSciNet Google Scholar
Zuker, M., Stiegler, P.: Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information. Nucleic Acids Research 9(1), 133–148 (1981)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science, Friedrich-Schiller Universität Jena, Jena Center for Bioinformatics, Germany
Rolf Backofen
Department of Computer Science, University of Haifa, Israel
Danny Hermelin & Oren Weimann
Department of Computer Science, University of Haifa, Haifa, Israel
Gad M. Landau
Department of Computer and Information Science, Polytechnic University, New York, USA
Gad M. Landau

Authors

Rolf Backofen
View author publications
You can also search for this author in PubMed Google Scholar
Danny Hermelin
View author publications
You can also search for this author in PubMed Google Scholar
Gad M. Landau
View author publications
You can also search for this author in PubMed Google Scholar
Oren Weimann
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Toronto,
Mariano Consens
Dept. of Computer Science, University of Chile,
Gonzalo Navarro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Backofen, R., Hermelin, D., Landau, G.M., Weimann, O. (2005). Normalized Similarity of RNA Sequences. In: Consens, M., Navarro, G. (eds) String Processing and Information Retrieval. SPIRE 2005. Lecture Notes in Computer Science, vol 3772. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11575832_40

Download citation

DOI: https://doi.org/10.1007/11575832_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29740-6
Online ISBN: 978-3-540-32241-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics