skip to main content
10.1145/775047.775143acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Non-linear dimensionality reduction techniques for classification and visualization

Published:23 July 2002Publication History

ABSTRACT

In this paper we address the issue of using local embeddings for data visualization in two and three dimensions, and for classification. We advocate their use on the basis that they provide an efficient mapping procedure from the original dimension of the data, to a lower intrinsic dimension. We depict how they can accurately capture the user's perception of similarity in high-dimensional data for visualization purposes. Moreover, we exploit the low-dimensional mapping provided by these embeddings, to develop new classification techniques, and we show experimentally that the classification accuracy is comparable (albeit using fewer dimensions) to a number of other classification procedures.

References

  1. R. Agrawal, C. Faloutsos, and A. Swami. Efficient Similarity Search in Sequence Databases. In Proc. of the 4th FODO, pages 69--84, Oct. 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. N. Beckmann, H. Kriegel, and R. Schnei. The r * -tree: an efficient and robust access method for points and rectangles. In Proceedings of ACM SIGMOD Conference, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Bellman. Adaptive Control Processes. Princeton Univ. Press, 1961.Google ScholarGoogle ScholarCross RefCross Ref
  4. C. Bentley and M. O. Ward. Animating multidimensional scaling to visualize n-dimensional data sets. In In Proc. of lnfo Vis, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. K. Chan and A. W.-C. Fu. Efficient Time Series Matching by Wavelets. In Proc. of ICDE, pages 126--133, Mar. 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. T. Cover and P. Hart. Nearest Neighbor Pattern Classification. IEEE Trans. on Information Theory, pp. 21--27, 1967.Google ScholarGoogle ScholarCross RefCross Ref
  7. C. Domeniconi, J. Peng, and D. Gunopulos. An Adaptive Metric Machine for Pattern Classification. Advances in Neural Information Processing Systems, 2000.Google ScholarGoogle Scholar
  8. C. Faloutsos and K.-I. Lin. FastMap: A fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets. In Proc. ACM SIGMOD, pages 163--174, May 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. C. Faloutsos, M. Ranganathan, and I. Manolopoulos. Fast Subsequence Matching in Time Series Databases. In Proceedings of ACM SIGMOD, pages 419--429, May 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Friedman. Flexible Metric Nearest Neighbor Classification. Tech. Report, Dept. of Statistics, Stanford University, 1994.Google ScholarGoogle Scholar
  11. T. Hastie and R. Tibshirani. Discriminant Adaptive Nearest Neighbor Classification. IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 18, No. 6, pp. 607--615, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Haykin. Neural Networks: A Comprehensive Foundation. Macmillan College Publishing Company New York, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. T. Ho. Nearest Neighbors in Random Subspaces. Lecture Notes in Computer Science: Advances in Pattern Recognition, pp. 640--648, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Inselberg and B. Dimsdale. Parallel coordinates: A tool for visualizing multidimensional geometry. In In Proc. of IEEE Visualization, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. C. L. J. B. Tenenbaum, V. de Silva. A global geometric framework for nonlinear dimensionality reduction. Science v. 290 no. 5500, pages 2319--2323, 2000.Google ScholarGoogle Scholar
  16. I. T. Jolliffe. Principal Component Analysis. Springer-Verlag, New York, 1989.Google ScholarGoogle Scholar
  17. E. Keogh, K. Chakrabarti, S. Mehrotra, and M. Pazzani. Locally adaptive dimensionality reduction for indexing large time series databases. In Proc. of ACM SIGMOD, pages 151--162, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. H. S. S. D. D. Lee. The manifold ways of perception. Science, v. 290 no. 5500, pages 2268--2269.Google ScholarGoogle Scholar
  19. R. C. T. Lee, J. R. Slagle, and H. Blum. A triangulation method for the sequential mapping of points from N-space to two-space. IEEE Transactions on Computers, pages 288--92, Mar. 1977.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. Lowe. Similarity Metric Learning for a Variable-Kernel Classifier. Neural Computation, 7(1):72--85, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. G. McLachlan. Discriminant Analysis and Statistical Pattern Recognition. New York: Wiley, 1992.Google ScholarGoogle Scholar
  22. C. Merz and P. Murphy. UCI Repository of Machine Learning databases. http://www.ics.uci.edu/mlearn/MLRepository.html, 1996.Google ScholarGoogle Scholar
  23. T. Poggio and F. Girosi. Networks for approximation and learning, proc. IEEE 78, 1481, 1990.Google ScholarGoogle ScholarCross RefCross Ref
  24. M. Polito and P. Perona. Grouping and dimensionality reduction by locally linear embedding. In NIPS, 2001.Google ScholarGoogle Scholar
  25. J. Quinlan. C4.5: Programs for Machine Learning. Morgan-Kaufmann Publishers, Inc., 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. S. R. L. Saul. Nonlinear dimensionality reduction by locally linear embedding. Science v. 290 no. 5500, pages 2223--2326, 2000.Google ScholarGoogle Scholar

Index Terms

  1. Non-linear dimensionality reduction techniques for classification and visualization

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
                July 2002
                719 pages
                ISBN:158113567X
                DOI:10.1145/775047

                Copyright © 2002 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 23 July 2002

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • Article

                Acceptance Rates

                KDD '02 Paper Acceptance Rate44of307submissions,14%Overall Acceptance Rate1,133of8,635submissions,13%

                Upcoming Conference

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader