skip to main content
research-article

Efficient RkNN retrieval with arbitrary non-metric similarity measures

Published:01 September 2010Publication History
Skip Abstract Section

Abstract

A RkNN query returns all objects whose nearest k neighbors contain the query object. In this paper, we consider RkNN query processing in the case where the distances between attribute values are not necessarily metric. Dissimilarities between objects could then be a monotonic aggregate of dissimilarities between their values, such aggregation functions being specified at query time. We outline real world cases that motivate RkNN processing in such scenarios. We consider the AL-Tree index and its applicability in RkNN query processing. We develop an approach that exploits the group level reasoning enabled by the AL-Tree in RkNN processing. We evaluate our approach against a Naive approach that performs sequential scans on contiguous data and an improved block-based approach that we provide. We use real-world datasets and synthetic data with varying characteristics for our experiments. This extensive empirical evaluation shows that our approach is better than existing methods in terms of computational and disk access costs, leading to significantly better response times.

References

  1. How fast is your disk? http://www.linuxinsight.com/how_fast_is_your-disk.html, January 2007.Google ScholarGoogle Scholar
  2. E. Achtert, C. Böhm, P. Kröger, P. Kunath, A. Pryakhin, and M. Renz. Efficient reverse k-nearest neighbor search in arbitrary metric spaces. In SIGMOD Conference, pages 515--526, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. E. Achtert, H.-P. Kriegel, P. Kröger, M. Renz, and A. Züfle. Reverse k-nearest neighbor search in dynamic and general metric databases. In EDBT, pages 886--897, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. V. Athitsos, M. Potamias, P. Papapetrou, and G. Kollios. Nearest neighbor retrieval using distance-based hashing. In ICDE, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. H. Bast, D. Majumdar, R. Schenkel, M. Theobald, and G. Weikum. Io-top-k: Index-access optimized top-k query processing. In VLDB, pages 475--486, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. L. Bentley. Multidimensional binary search trees used for associative searching. CACM, 1975. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. G.-H. Cha. Non-metric similarity ranking for image retrieval. In DEXA, pages 853--862, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. H. Chen, R. Shi, K. Furuse, and N. Ohbo. Finding rknn straightforwardly with large secondary storage. In INGS, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. O Cheong A Vigneron and J Yon Reverse nearest neighbor queries in fixed dimension. CoRR, abs/0905.4441, 2009.Google ScholarGoogle Scholar
  10. W. Chung, Gray and Horst. Windows 2000 disk io performance. Microsoft Research TR, June 2000.Google ScholarGoogle Scholar
  11. P. Ciaccia, M. Patella, and P. Zezula. M-tree: An efficient access method for similarity search in metric spaces. In VLDB, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. E. Dellis and B. Seeger. Efficient computation of reverse skyline queries. In VLDB, pages 291--302, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. P. M. Deshpande, D. P, and K. Kummamuru. Efficient online top-k retrieval with arbitrary similarity measures. In EDBT, pages 356--367, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Fagin, A. Lotem, and M. Naor. Optimal aggregation algorithms for middleware. J. Comput. Syst. Sci., 66(4):614--656, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. K. Goh, B. Li, and E. Chang. Dyndex: A dynamic and nonmetric space indexer. In ACM Intl. Conference on Multimedia, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. Guttman. R-trees: A dynamic index structure for spatial searching. In SIGMOD, 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. F. Korn and S. Muthukrishnan. Influence sets based on reverse nearest neighbor queries. In SIGMOD Conference, pages 201--212, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. H.-P. Kriegel, P. Kröger, M. Renz, A. Züfle, and A. Katzdobler. Reverse k-nearest neighbor search based on aggregate point access methods. In SSDBM, pages 444--460, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. K. C. K. Lee, B. Zheng, and W.-C. Lee. Ranked reverse nearest neighbor search. IEEE TKDE, 20(7):894--910, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Lin, D. Etter, and D. DeBarr. Exact and approximate reverse nearest neighbor search for multimedia data. In SDM, pages 656--667, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  21. G. Murphy and D. Medin. The role of theories in conceptual coherence. In Psychological Review, 1985.Google ScholarGoogle ScholarCross RefCross Ref
  22. D. P, P. M. Deshpande, D. Majumdar, and R. Krishnapuram. Efficient skyline retrieval with arbitrary similarity measures. In EDBT, 2009.Google ScholarGoogle Scholar
  23. A. Singh, H. Ferhatosmanoglu, and A. S. Tosun. High dimensional reverse nearest neighbor queries. In CIKM, pages 91--98, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. T. Skopal and J. Lokoc. Nm-tree: Flexible approximate similarity search in metric and non-metric spaces. In DEXA, pages 312--325, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. I. Stanoi, D. Agrawal, and A. E. Abbadi. Reverse nearest neighbor queries for dynamic databases. In In SIGMOD Workshop on DMKD, pages 44--53, 2000.Google ScholarGoogle Scholar
  26. Y. Tao, D. Papadias, and X. Lian. Reverse knn search in arbitrary dimensionality. In VLDB, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. Vlachos, D. Gunopulos, and G. Kollios. Robust similarity measures for mobile object trajectories. In DEXA 2002, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. C. Xia, W. Hsu, and M.-L. Lee. Erknn: efficient reverse k-nearest neighbors retrieval with local knn-distance estimation. In CIKM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. C. Yang and K.-I. Lin. An index structure for efficient reverse nearest neighbor queries. In ICDE, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. J. L. Yanmin Luo, Canhong Lian and H. Chen. Finding rknn by compressed straightforward index. In ISKE, 2008.Google ScholarGoogle Scholar
  31. M. L. Yiu and N. Mamoulis. Reverse nearest neighbors search in ad hoc subspaces. IEEE TKDE, 19(3):412--426, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Efficient RkNN retrieval with arbitrary non-metric similarity measures
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image Proceedings of the VLDB Endowment
          Proceedings of the VLDB Endowment  Volume 3, Issue 1-2
          September 2010
          1658 pages

          Publisher

          VLDB Endowment

          Publication History

          • Published: 1 September 2010
          Published in pvldb Volume 3, Issue 1-2

          Qualifiers

          • research-article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader