Skip to main content

Novelty Framework for Knowledge Discovery in Databases

  • Conference paper
Data Warehousing and Knowledge Discovery (DaWaK 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3181))

Included in the following conference series:

Abstract

Knowledge Discovery in Databases (KDD) is an iterative process that aims at extracting interesting, previously unknown and hidden patterns from huge databases. Use of objective measures of interestingness in popular data mining algorithms often leads to another data mining problem, although of reduced complexity. The reduction in the volume of the discovered rules is desirable in order to improve the efficiency of the overall KDD process. Subjective measures of interestingness are required to achieve this. In this paper we study novelty of the discovered rules as a subjective measure of interestingness. We propose a framework to quantify novelty of the discovered rules in terms of their deviations from the known rules. The computations are carried out using the importance that the user gives to different deviations. The computed degree of novelty is then compared with the user given threshold to report novel rules to the user. We implement the proposed framework and experiment with some public datasets. The experimental results are quite promising.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: From Data Mining to Knowledge Discovery: An Overview. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, Menlo Park, CA (1996)

    Google Scholar 

  2. Pyle, D.: Data Preparation for Data Mining. Morgan Kaufmann, San Francisco (1999)

    Google Scholar 

  3. Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. John Wiley & Sons (Asia) PV. Ltd., Chichester (2002)

    Google Scholar 

  4. Liu, B., Hsu, W., Chen, S.: Using General Impressions to Analyze Discovered Classification Rules. In: Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, KDD 1997 (1997)

    Google Scholar 

  5. Piateskey-Shapiro, G., Matheus, C.J.: The Interestingness of Deviations. In: Proceedings of AAAI Workshop on Knowledge Discovery in Databases (1994)

    Google Scholar 

  6. Liu, B., Hsu, W., Mun, L., Lee, H.: Finding Interesting Patterns Using User Expectations. Technical Report:TRA7/96. Department of Information Systems and Computer Science, National University of Singapore (1996)

    Google Scholar 

  7. Silberschatz, A., Tuzhilin, A.: On Subjective Measures of Interestingness in Knowledge Discovery. In: Proceedings of the 1st International Conference on Knowledge Discovery and Data Mining (1995)

    Google Scholar 

  8. Silberschatz, A., Tuzhilin, A.: What Makes Patterns Interesting in Knowledge Discovery Systems. IEEE Transactions on Knowledge and Data Engineering 5(6) (1996)

    Google Scholar 

  9. Liu, B., Hsu, W.: Post Analysis of Learned Rules. In: Proceedings of the 13th National Conference on AI(AAAI 1996) (1996)

    Google Scholar 

  10. Padmanabhan, B., Tuzhilin, A.: Unexpectedness as a Measure of Interestingness in Knowledge Discovery. Working paper # IS-97-6, Dept. of Information Systems, Stern School of Business, NYU (1997)

    Google Scholar 

  11. Kohonen, T.: Self-Organization and Associative Memory, 3rd edn. Springer, Berlin (1993)

    Google Scholar 

  12. Yairi, T., Kato, Y., Hori, K.: Fault Detection by Mining Association Rules from House-keeping Data. In: Proceedings of International Symposium on Artificial Intelligence, Robotics and Automation in Space, SAIRAS 2001 (2001)

    Google Scholar 

  13. Marsland, S.: On-Line Novelty Detection Through Self-Organization, with Application to Robotics. Ph.D. Thesis, Department of Computer Science, University of Manchester (2001)

    Google Scholar 

  14. Basu, S., Mooney, R.J., Pasupuleti, K.V., Ghosh, J.: Using Lexical Knowledge to Evaluate the Novelty of Rules Mined from Text. In: Proceedings of the NAACL workshop and other Lexical Resources: Applications, Extensions and Customizations (2001)

    Google Scholar 

  15. Liu, B., Hsu, W., Ma, Y.: Integrating Classification and Association Rule Mining. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, KDD 1998 (1998)

    Google Scholar 

  16. Pujari, A.K.: Data Mining Techniques. 1st edn. Universities Press(India) Limited (2001)

    Google Scholar 

  17. Dunham, M.H.: Data Mining: Introductory and Advanced Topics, 1st edn. Pearson Education (Singaphore) Pte. Ltd., London (2003)

    Google Scholar 

  18. Williams, G.J.: Evolutionary Hot Spots Data Mining: An Architecture for Exploring for Interesting Discoveries. In: Zhong, N., Zhou, L. (eds.) PAKDD 1999. LNCS (LNAI), vol. 1574, pp. 184–193. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  19. Psaila, G.: Discovery of Association Rule Meta-Patterns. In: Mohania, M., Tjoa, A.M. (eds.) DaWaK 1999. LNCS, vol. 1676, pp. 219–228. Springer, Heidelberg (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Al-Hegami, A.S., Bhatnagar, V., Kumar, N. (2004). Novelty Framework for Knowledge Discovery in Databases. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2004. Lecture Notes in Computer Science, vol 3181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30076-2_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30076-2_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22937-7

  • Online ISBN: 978-3-540-30076-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics