skip to main content
10.1145/1060590.1060623acmconferencesArticle/Chapter ViewAbstractPublication PagesstocConference Proceedingsconference-collections
Article

Low distortion embeddings for edit distance

Published:22 May 2005Publication History

ABSTRACT

We show that 0,1d endowed with edit distance embeds into l1 with distortion 2O(√log dlog log d). We further show efficient implementations of the embedding that yield solutions to various computational problems involving edit distance. These include sketching, communication complexity, nearest neighbor search. For all these problems, we improve upon previous bounds.

References

  1. A. Andoni, M. Deza, A. Gupta, P. Indyk, and S. Raskhodnikova. Lower bounds for embedding edit distance into normed spaces. In Proc. SODA 2003, pages 523--526. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Z. Bar-Yossef, T.S. Jayram, R. Krauthgamer, and R. Kumar. Approximating edit distance efficiently. In Proc. FOCS 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. Batu, F. Ergun, J. Kilian, A. Magen, S. Raskhodnikova, R. Rubinfeld, and R. Sami. A sublinear algorithm for weakly approximating edit distance. In Proc. STOC 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Broder, S. Glassman, M. Manasse, and G. Zweig. Syntactic clustering of the Web. In Proc. of the 6th Int'l World Wide Web Conf., 1997, pages 391--404. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. G. Cormode and S. Muthukrishnan. The string edit distance matching problem with moves. In Proc. SODA 2002, pages 667--676. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. G. Cormode, M. Paterson, C.S. Sahinalp, and U. Vishkin. Communication complexity of document exchange. In Proc. SODA 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Indyk. Approximate nearest neighbor under edit distance via product metrics. In Proc. SODA 2004, pages 646--650. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. P. Indyk and R. Motwani. Approximate nearest neighbors: towards removing the curse of dimensionality. In Proc. STOC 1998, pages 604--613. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. E. Kushilevitz, R. Ostrovsky, and Y. Rabani. Efficient search for approximate nearest neighbor in high dimensional spaces. SIAM Journal on Computing, 30(2):457--474, 2000. (Preliminary version appeared in Proc. STOC 1998.) Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. V.I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. Doklady Akademii Nauk SSSR, 163(4):845--848, 1965 (Russian). English translation in Soviet Physics Doklady, 10(8):707--710, 1966.Google ScholarGoogle Scholar
  11. W.J. Masek and M.S. Paterson. A faster algorithm for computing string edit distnace. Journal of Computer and Systems Sciences, 20(1):18-31, 1980.Google ScholarGoogle ScholarCross RefCross Ref
  12. S. Muthukrishnan and S.C. Sahinalp. Approximate nearest neighbors and sequence comparison with block operations. In Proc. STOC 2000, pages 416--424. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Low distortion embeddings for edit distance

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        STOC '05: Proceedings of the thirty-seventh annual ACM symposium on Theory of computing
        May 2005
        778 pages
        ISBN:1581139608
        DOI:10.1145/1060590

        Copyright © 2005 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 22 May 2005

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate1,469of4,586submissions,32%

        Upcoming Conference

        STOC '24
        56th Annual ACM Symposium on Theory of Computing (STOC 2024)
        June 24 - 28, 2024
        Vancouver , BC , Canada

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader