Novelty Framework for Knowledge Discovery in Databases

Al-Hegami, Ahmed Sultan; Bhatnagar, Vasudha; Kumar, Naveen

doi:10.1007/978-3-540-30076-2_5

Ahmed Sultan Al-Hegami¹⁹,
Vasudha Bhatnagar¹⁹ &
Naveen Kumar¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3181))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

440 Accesses
7 Citations

Abstract

Knowledge Discovery in Databases (KDD) is an iterative process that aims at extracting interesting, previously unknown and hidden patterns from huge databases. Use of objective measures of interestingness in popular data mining algorithms often leads to another data mining problem, although of reduced complexity. The reduction in the volume of the discovered rules is desirable in order to improve the efficiency of the overall KDD process. Subjective measures of interestingness are required to achieve this. In this paper we study novelty of the discovered rules as a subjective measure of interestingness. We propose a framework to quantify novelty of the discovered rules in terms of their deviations from the known rules. The computations are carried out using the importance that the user gives to different deviations. The computed degree of novelty is then compared with the user given threshold to report novel rules to the user. We implement the proposed framework and experiment with some public datasets. The experimental results are quite promising.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: From Data Mining to Knowledge Discovery: An Overview. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, Menlo Park, CA (1996)
Google Scholar
Pyle, D.: Data Preparation for Data Mining. Morgan Kaufmann, San Francisco (1999)
Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. John Wiley & Sons (Asia) PV. Ltd., Chichester (2002)
Google Scholar
Liu, B., Hsu, W., Chen, S.: Using General Impressions to Analyze Discovered Classification Rules. In: Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, KDD 1997 (1997)
Google Scholar
Piateskey-Shapiro, G., Matheus, C.J.: The Interestingness of Deviations. In: Proceedings of AAAI Workshop on Knowledge Discovery in Databases (1994)
Google Scholar
Liu, B., Hsu, W., Mun, L., Lee, H.: Finding Interesting Patterns Using User Expectations. Technical Report:TRA7/96. Department of Information Systems and Computer Science, National University of Singapore (1996)
Google Scholar
Silberschatz, A., Tuzhilin, A.: On Subjective Measures of Interestingness in Knowledge Discovery. In: Proceedings of the 1st International Conference on Knowledge Discovery and Data Mining (1995)
Google Scholar
Silberschatz, A., Tuzhilin, A.: What Makes Patterns Interesting in Knowledge Discovery Systems. IEEE Transactions on Knowledge and Data Engineering 5(6) (1996)
Google Scholar
Liu, B., Hsu, W.: Post Analysis of Learned Rules. In: Proceedings of the 13th National Conference on AI(AAAI 1996) (1996)
Google Scholar
Padmanabhan, B., Tuzhilin, A.: Unexpectedness as a Measure of Interestingness in Knowledge Discovery. Working paper # IS-97-6, Dept. of Information Systems, Stern School of Business, NYU (1997)
Google Scholar
Kohonen, T.: Self-Organization and Associative Memory, 3rd edn. Springer, Berlin (1993)
Google Scholar
Yairi, T., Kato, Y., Hori, K.: Fault Detection by Mining Association Rules from House-keeping Data. In: Proceedings of International Symposium on Artificial Intelligence, Robotics and Automation in Space, SAIRAS 2001 (2001)
Google Scholar
Marsland, S.: On-Line Novelty Detection Through Self-Organization, with Application to Robotics. Ph.D. Thesis, Department of Computer Science, University of Manchester (2001)
Google Scholar
Basu, S., Mooney, R.J., Pasupuleti, K.V., Ghosh, J.: Using Lexical Knowledge to Evaluate the Novelty of Rules Mined from Text. In: Proceedings of the NAACL workshop and other Lexical Resources: Applications, Extensions and Customizations (2001)
Google Scholar
Liu, B., Hsu, W., Ma, Y.: Integrating Classification and Association Rule Mining. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, KDD 1998 (1998)
Google Scholar
Pujari, A.K.: Data Mining Techniques. 1st edn. Universities Press(India) Limited (2001)
Google Scholar
Dunham, M.H.: Data Mining: Introductory and Advanced Topics, 1st edn. Pearson Education (Singaphore) Pte. Ltd., London (2003)
Google Scholar
Williams, G.J.: Evolutionary Hot Spots Data Mining: An Architecture for Exploring for Interesting Discoveries. In: Zhong, N., Zhou, L. (eds.) PAKDD 1999. LNCS (LNAI), vol. 1574, pp. 184–193. Springer, Heidelberg (1999)
Chapter Google Scholar
Psaila, G.: Discovery of Association Rule Meta-Patterns. In: Mohania, M., Tjoa, A.M. (eds.) DaWaK 1999. LNCS, vol. 1676, pp. 219–228. Springer, Heidelberg (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Delhi, Delhi-07, India
Ahmed Sultan Al-Hegami, Vasudha Bhatnagar & Naveen Kumar

Authors

Ahmed Sultan Al-Hegami
View author publications
You can also search for this author in PubMed Google Scholar
Vasudha Bhatnagar
View author publications
You can also search for this author in PubMed Google Scholar
Naveen Kumar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of Informatics, Kyoto University, Yoshida-Honmachi, 606-8501, Sakyo, Kyoto, Japan
Yahiko Kambayashi
I.B.M. India Research Lab,, India
Mukesh Mohania
Institute for Application Oriented Knowledge Processing (FAW), Johannes Kepler University Linz, Austria
Wolfram Wöß

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Al-Hegami, A.S., Bhatnagar, V., Kumar, N. (2004). Novelty Framework for Knowledge Discovery in Databases. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2004. Lecture Notes in Computer Science, vol 3181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30076-2_5

Download citation

DOI: https://doi.org/10.1007/978-3-540-30076-2_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22937-7
Online ISBN: 978-3-540-30076-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics