Abstract
We have developed a distributed data mining algorithm based on the progressive knowledge extraction principle. The knowledge factors, the data attributes that are significant statistically or based on a predefined mining function, are extracted progressively from the distributed data sets. The critical data attributes and sample data set are selected iteratively from distributed data sources. The experiments showed that the algorithm is valid and has the potentials for the large distributed data mining practices.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Provost, F.: Distributed Data Mining: Scaling Up and Beyond. In: Advances in Distributed and Parallel Knowledge Discovery, pp. 3–28. The MIT Press, Cambridge (2000)
Liu, J.B., Han, J.: A Practical Knowledge Discovery Process for Distributed Data Mining. In: Proceedings of 11th International Conference on Intelligent Systems, Boston, pp. 11–16 (2002)
Yost, J.K., Liu, J.B., McConnaughay, K.D.M., Winn, W.: HPNC for Bradley University in Science and Engineering Research, NSF Grant 0125067 (2001)
Aggarwal, C.C., Yu, P.S.: A New Approach to Online Generation of Association Rules. IEEE Transaction on Knowledge and Data Engineering 13(4), 527–540 (2001)
Aggarwal, C.C., Yu, P.S.: Mining Associations with the Collective Strength Approach. IEEE Transaction on Knowledge and Data Engineering 13(6), 863–873 (2001)
Aggarwal, C.C., Yu, P.S.: Redefining Clustering for High-Dimensional Applications. IEEE Transaction on Knowledge and Data Engineering 14(2), 210–225 (2002)
Aggarwal, C.C., Sun, Z., Yu, P.S.: Fast Algorithm for Online Generation of Profile Associate Rules. IEEE Transaction on Knowledge and Data Engineering 14(5), 1017–1028 (2002)
IBM Redbooks: Intelligent Miner for Data: Enhance Your Business Intelligence. IBM Corporation (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, J.B., Thanneru, U., Cheng, D. (2004). A Distributed Knowledge Extraction Data Mining Algorithm. In: Zhang, J., He, JH., Fu, Y. (eds) Computational and Information Science. CIS 2004. Lecture Notes in Computer Science, vol 3314. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30497-5_119
Download citation
DOI: https://doi.org/10.1007/978-3-540-30497-5_119
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24127-0
Online ISBN: 978-3-540-30497-5
eBook Packages: Computer ScienceComputer Science (R0)