Article

Non-linear dimensionality reduction techniques for classification and visualization

Authors:
Michail Vlachos

UC Riverside

UC Riverside
View Profile

,
Carlotta Domeniconi

UC Riverside

UC Riverside
View Profile

,
Dimitrios Gunopulos

UC Riverside

UC Riverside
View Profile

,
George Kollios

Boston University

Boston University
View Profile

,
Nick Koudas

AT&T Labs Research

AT&T Labs Research
View Profile

KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data miningJuly 2002Pages 645–651https://doi.org/10.1145/775047.775143

Published:23 July 2002Publication History

KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 645–651

ABSTRACT

In this paper we address the issue of using local embeddings for data visualization in two and three dimensions, and for classification. We advocate their use on the basis that they provide an efficient mapping procedure from the original dimension of the data, to a lower intrinsic dimension. We depict how they can accurately capture the user's perception of similarity in high-dimensional data for visualization purposes. Moreover, we exploit the low-dimensional mapping provided by these embeddings, to develop new classification techniques, and we show experimentally that the classification accuracy is comparable (albeit using fewer dimensions) to a number of other classification procedures.

References

R. Agrawal, C. Faloutsos, and A. Swami. Efficient Similarity Search in Sequence Databases. In Proc. of the 4th FODO, pages 69--84, Oct. 1993. Google ScholarDigital Library
N. Beckmann, H. Kriegel, and R. Schnei. The r * -tree: an efficient and robust access method for points and rectangles. In Proceedings of ACM SIGMOD Conference, 1990. Google ScholarDigital Library
R. Bellman. Adaptive Control Processes. Princeton Univ. Press, 1961.Google ScholarCross Ref
C. Bentley and M. O. Ward. Animating multidimensional scaling to visualize n-dimensional data sets. In In Proc. of lnfo Vis, 1996. Google ScholarDigital Library
K. Chan and A. W.-C. Fu. Efficient Time Series Matching by Wavelets. In Proc. of ICDE, pages 126--133, Mar. 1999. Google ScholarDigital Library
T. Cover and P. Hart. Nearest Neighbor Pattern Classification. IEEE Trans. on Information Theory, pp. 21--27, 1967.Google ScholarCross Ref
C. Domeniconi, J. Peng, and D. Gunopulos. An Adaptive Metric Machine for Pattern Classification. Advances in Neural Information Processing Systems, 2000.Google Scholar
C. Faloutsos and K.-I. Lin. FastMap: A fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets. In Proc. ACM SIGMOD, pages 163--174, May 1995. Google ScholarDigital Library
C. Faloutsos, M. Ranganathan, and I. Manolopoulos. Fast Subsequence Matching in Time Series Databases. In Proceedings of ACM SIGMOD, pages 419--429, May 1994. Google ScholarDigital Library
J. Friedman. Flexible Metric Nearest Neighbor Classification. Tech. Report, Dept. of Statistics, Stanford University, 1994.Google Scholar
T. Hastie and R. Tibshirani. Discriminant Adaptive Nearest Neighbor Classification. IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 18, No. 6, pp. 607--615, 1996. Google ScholarDigital Library
S. Haykin. Neural Networks: A Comprehensive Foundation. Macmillan College Publishing Company New York, 1994. Google ScholarDigital Library
T. Ho. Nearest Neighbors in Random Subspaces. Lecture Notes in Computer Science: Advances in Pattern Recognition, pp. 640--648, 1998. Google ScholarDigital Library
A. Inselberg and B. Dimsdale. Parallel coordinates: A tool for visualizing multidimensional geometry. In In Proc. of IEEE Visualization, 1990. Google ScholarDigital Library
J. C. L. J. B. Tenenbaum, V. de Silva. A global geometric framework for nonlinear dimensionality reduction. Science v. 290 no. 5500, pages 2319--2323, 2000.Google Scholar
I. T. Jolliffe. Principal Component Analysis. Springer-Verlag, New York, 1989.Google Scholar
E. Keogh, K. Chakrabarti, S. Mehrotra, and M. Pazzani. Locally adaptive dimensionality reduction for indexing large time series databases. In Proc. of ACM SIGMOD, pages 151--162, 2001. Google ScholarDigital Library
H. S. S. D. D. Lee. The manifold ways of perception. Science, v. 290 no. 5500, pages 2268--2269.Google Scholar
R. C. T. Lee, J. R. Slagle, and H. Blum. A triangulation method for the sequential mapping of points from N-space to two-space. IEEE Transactions on Computers, pages 288--92, Mar. 1977.Google ScholarDigital Library
D. Lowe. Similarity Metric Learning for a Variable-Kernel Classifier. Neural Computation, 7(1):72--85, 1995. Google ScholarDigital Library
G. McLachlan. Discriminant Analysis and Statistical Pattern Recognition. New York: Wiley, 1992.Google Scholar
C. Merz and P. Murphy. UCI Repository of Machine Learning databases. http://www.ics.uci.edu/mlearn/MLRepository.html, 1996.Google Scholar
T. Poggio and F. Girosi. Networks for approximation and learning, proc. IEEE 78, 1481, 1990.Google ScholarCross Ref
M. Polito and P. Perona. Grouping and dimensionality reduction by locally linear embedding. In NIPS, 2001.Google Scholar
J. Quinlan. C4.5: Programs for Machine Learning. Morgan-Kaufmann Publishers, Inc., 1993. Google ScholarDigital Library
S. R. L. Saul. Nonlinear dimensionality reduction by locally linear embedding. Science v. 290 no. 5500, pages 2223--2326, 2000.Google Scholar

Index Terms

Non-linear dimensionality reduction techniques for classification and visualization

Recommendations

Linear Dimensionality Reduction via a Heteroscedastic Extension of LDA: The Chernoff Criterion

Abstract--We propose an eigenvector-based heteroscedastic linear dimension reduction (LDR) technique for multiclass data. The technique is based on a heteroscedastic two-class technique which utilizes the so-called Chernoff criterion, and successfully ...
Read More
Supervised nonlinear dimensionality reduction for visualization and classification

When performing visualization and classification, people often confront the problem of dimensionality reduction. Isomap is one of the most promising nonlinear dimensionality reduction techniques. However, when Isomap is applied to real-world data, it ...
Read More
Dimensionality reduction for classification
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
July 2002
719 pages
ISBN:158113567X
DOI:10.1145/775047
Conference Chair:
Osmar R. Zaïane
University of Alberta, Canada
,
General Chair:
Randy Goebel
University of Alberta, Canada
,
Program Chairs:
David Hand
Imperial College, UK
,
Daniel Keim
AT&T
,
Raymond Ng
University of British Columbia, Canada
Copyright © 2002 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 July 2002
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
KDD '02 Paper Acceptance Rate44of307submissions,14%Overall Acceptance Rate1,133of8,635submissions,13%
More
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

KDD '24: The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 122
  Total Citations
  View Citations
- 1,978
  Total Downloads
- Downloads (Last 12 months)65
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Non-linear dimensionality reduction techniques for classification and visualization

KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Linear Dimensionality Reduction via a Heteroscedastic Extension of LDA: The Chernoff Criterion

Supervised nonlinear dimensionality reduction for visualization and classification

Dimensionality reduction for classification