Abstract
Visualization provides insight through images and can be considered as a collection of application specific mappings: ProblemDomain → VisualRange. For the visualization of multivariate problems a multidimensional system of parallel coordinates (abbr. ∥-coords) is constructed which induces a one-to-one mapping between subsets of N-space and subsets of 2-space. The result is a rigorous methodology for doing and seeing N-dimensional geometry. Starting with an the overview of the mathematical foundations, it is seen that the display of high-dimensional datasets and search for multivariate relations among the variables is transformed into a 2-D pattern recognition problem. This is the basis for the application to Visual Data Mining which is illustrated with a real dataset of VLSI (Very Large Scale Integration — “chip”) production. Then a recent geometric classifier is presented and applied to 3 real datasets. The results compared to those of 23 other classifiers have the least error. The algorithm, has quadratic computational complexity in the size and number of parameters, provides comprehensible and explicit rules, does dimensionality selection — where the minimal set of original variables required to state the rule is found, and orders these variables so as to optimize the clarity of separation between the designated set and its complement. Finally a simple visual economic model of a real country is constructed and analyzed in order to illustrate the special strength of ∥-coords in modeling multivariate relations by means of hypersurfaces.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
T. Avidan and S. Avidan. Parallax-a Data Mining tool based on parallel coordinates. Comput. Statit., 13-1:65, 1998.
E. W. Bassett. Ibm’s ibm fix. Industrial Computing, 14(41):23–25, 1995.
V. G. Boltyanskii. Envelopes, R.B.Brown translator (original in Russian). Pergamon Press, New York, 1964.
A. Chatterjee. Visualizing Multidimensional Poly topes and Topologies for Tolerances. Ph.D. Thesis, Dept. Comp. Sci., Univ. of S. Calif., 1995.
H. S. M. Coxeter. The Real Projective Plane. Third Edition, Springer-Verlag, New York, 1992.
J. Eickemeyer. Visualizing p-flats in N-space using Parallel Coordinates. Ph.D. Thesis, Dept. Comp. Sc, UCLA, 1992.
Deterding D. H. Speaker Normalization for Automatic Speech Recognition. Ph.D. Thesis, Cambridge University, 1989.
A. Inselberg. Don’t panic … do it in parallel! Comput. Statit., 14:53–77, 1999.
A. Inselberg. Parallel Coordinates: VISUAL Multidimensional Geometry and it’s Applications, Springer-Verlag, New York, 2005.
A. Inselberg and B. Dimsdale. Multidimensional lines ii: Proximity and applications. SIAM J. of Applied Math., 54-2:578–596, 1994.
A. Inselberg, M. Reif, and T. Chomut. Convexity algorithms in parallel coordinates. J. ACM, 34:765–801, 1987.
A. Inselberg and Avidan T. The Automated Multidimensional Detective, in Proc. of IEEE Information Visualization’ 99, 112–119. IEEE Comp. Soc., Los Alamitos, CA, 1999.
Robinson A. J. and Fallside F. A Dynamic Connectionist Model of Phoneme Recognition. Proc. of 1st European Neural Network Conf. (nEURO), 1988.
D. A. Keim and H. P. Kriegel. Visualization techniques for mining large databases: A comparison. Trans. Knowl. and Data Engr., 8-6:923–938, 1996.
A. R. Martin and M. O. Ward. High dimensional brushing for interactive exploration of multivariate data, Proc. IEEE Conf. on Visualization, Atlanta, GA, 271–278. IEEE Comp. Soc., Los Alamitos, CA, 1995.
D. Michie, D.J. Spiegelhalter, and Taylor C.C. Machine Learning, Neural and Statistical Classification. Ellis Horwood series in AI, 1994.
C. Schmid and H. Hinterberger. Comparative Multivariate Visualization Across Conceptually Different Graphic Displays, in Proc. of 7th SSDBM. IEEE Comp. Soc., Los Alamitos, CA, 1994.
N. Shang and L. Breiman. Distribution based trees are more accurate. Proc. of 1st Inter. Conf. on Neural Info. Proc. (ICONIP96), 133, 1996.
D. F. Swayne, D. Cook, and A. Buja. XGobi: Interactive Dynamic Graphics in the X Window System. JCGS, 7-1, 113–130, 1998.
E. R. Tufte. The Visual Display of Quantitative Information. Graphic Press, Connecticut, 1983.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science+Business Media, Inc.
About this chapter
Cite this chapter
Inselberg, A. (2005). Visualization and Data Mining for High Dimensional Datasets. In: Maimon, O., Rokach, L. (eds) Data Mining and Knowledge Discovery Handbook. Springer, Boston, MA. https://doi.org/10.1007/0-387-25465-X_14
Download citation
DOI: https://doi.org/10.1007/0-387-25465-X_14
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-24435-8
Online ISBN: 978-0-387-25465-4
eBook Packages: Computer ScienceComputer Science (R0)