ABSTRACT
Association rule mining, a critical technology for decision making, faces two key challenges: (a.) performance: unacceptably high response times that are not capable of supporting interactive analysis, and (b.) usability: lack of support for sense-making of rule relationships. In this paper I describe two solutions proposed in my dissertation research that tackle these challenges. Our proposed COLARM system addresses the online mining of local association rules. Further, our proposed PARAS and FIRE systems provide solutions for interactive mining of global association rules. Overall, this research encompasses contributions at the intersection of data mining, knowledge management and visual analytics.
- C. C. Aggarwal, C. Procopiuc, and P. S. Yu. Finding localized associations in market basket data. IEEE Trans. on Knowl. and Data Eng., pages 51--62, 2002. Google ScholarDigital Library
- C. C. Aggarwal and P. S. Yu. A new approach to online generation of association rules. IEEE Trans. Knowl. Data Eng., 13(4):527--540, 2001. Google ScholarDigital Library
- R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In SIGMOD, pages 207--216, 1993. Google ScholarDigital Library
- R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In VLDB, pages 487--499, 1994. Google ScholarDigital Library
- C. Borgelt. Efficient implementations of apriori, eclat and fp-growth. http://www.borgelt.net, August 2013.Google Scholar
- S. Brin, R. Motwani, J. D. Ullman, and S. Tsur. Dynamic itemset counting and implication rules for market basket data. In SIGMOD, pages 255--264, 1997. Google ScholarDigital Library
- M. Das, D. P, P. Deshpande, and R. Kannan. Fast rule mining over multi-dimensional windows. In SDM, pages 582--593, 2011.Google ScholarCross Ref
- R. L. Graham, D. E. Knuth, and O. Patashnik. Concrete Mathematics. Addison-Wesley, 1994. Google ScholarDigital Library
- Z. Guo, M. O. Ward, and E. A. Rundensteiner. Nugget browser: Visual subgroup mining and statistical significance discovery in multivariate dataset. IV2011, pages 267--275, 2011. Google ScholarDigital Library
- M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The weka data mining software: an update. SIGKDD Explor. Newsl., 11(1):10--18, 2009. Google ScholarDigital Library
- J. Han and Y. Fu. Discovery of multiple-level association rules from large databases. VLDB, pages 420--431, 1995. Google ScholarDigital Library
- J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In SIGMOD, pages 1--12, 2000. Google ScholarDigital Library
- C. Hidber. Online association rule mining. ACM SIGMOD Rec., 28(2):145--156, 1999. Google ScholarDigital Library
- M. Kaya and R. Alhajj. Online mining of fuzzy multidimensional weighted association rules. Applied Intelligence, 29:13--34, 2008. Google ScholarDigital Library
- M. Kubat, A. Hafez, V. V. Raghavan, J. R. Lekkala, and W. K. Chen. Itemset trees for targeted association querying. IEEE Trans. Knowl. Data Eng., 15(6):1522--1534, 2003. Google ScholarDigital Library
- X. Lin, A. Mukherji, E. A. Rundensteiner, C. Ruiz, and M. O. Ward. PARAS: A parameter space framework for online association mining. In PVLDB, volume 6 (3), pages 193--204, January 2013. Google ScholarDigital Library
- C. Lucchese, S. Orlando, R. Perego, and F. Silvestri. Webdocs: a real-life huge transac. dataset. In FIMI, 2004.Google Scholar
- A. Mukherji, X. Lin, C. R. Botaish, J. Whitehouse, E. A. Rundensteiner, M. O. Ward, and C. Ruiz. Paras: interactive parameter space exploration for association rule mining. In SIGMOD, pages 1017--1020, June 2013. Google ScholarDigital Library
- A. Mukherji, X. Lin, J. Whitehouse, C. R. Botaish, E. A. Rundensteiner, and M. O. Ward. Fire: interactive visual support for parameter space-driven rule mining. In CIKM, pages 2447--2452, 2013. Google ScholarDigital Library
- A. Mukherji, E. A. Rundensteiner, and M. O. Ward. Colarm: Cost-based optimization for localized association rule mining. In EDBT, pages 181--192, 2014.Google Scholar
- B. Nag, P. M. Deshpande, and D. J. DeWitt. Using a knowledge cache for interactive discovery of association rules. In KDD, pages 244--253, 1999. Google ScholarDigital Library
- R. Ng, L. V. S. Lakshmanan, J. Han, and T. Mah. Exploratory mining via constrained frequent set queries. SIGMOD Rec., 1999. Google ScholarDigital Library
- S. Sahar. Interestingness preprocessing. In ICDM, pages 489--496, 2001. Google ScholarDigital Library
- E. H. Simpson. The interpretation of interaction in contingency tables. Journal of the Royal Statistical Society (Series B), 13:238--241, 1951.Google ScholarCross Ref
- R. Srikant and R. Agrawal. Mining generalized association rules. In VLDB, pages 407--419, 1995. Google ScholarDigital Library
- R. Srikant and R. Agrawal. Mining quantitative association rules in large relational tables. In SIGMOD, pages 1--12, 1996. Google ScholarDigital Library
- UCI machine learning repository. http://www.ics.uci.edu/~mlearn/MLRepository.html, July 2013.Google Scholar
- C.-Y. Wang, S.-S. Tseng, and T.-P. Hong. Flexible online association rule mining based on multidimensional pattern relations. Inf. Sci., pages 1752--1780, 2006. Google ScholarDigital Library
- T. Wu, Y. Chen, and J. Han. Association mining in large databases: A re-examination of its measures. In PKDD, pages 621--628, 2007. Google ScholarDigital Library
- M. J. Zaki and C.-J. Hsiao. Charm: An efficient algorithm for closed itemset mining. In SIAM SDM, 2002.Google ScholarCross Ref
- M. J. Zaki, S. Parthasarathy, M. Ogihara, and W. Li. New algorithms for fast discovery of association rules. In SIGKDD, pages 283--286, 1997.Google Scholar
Index Terms
- Facilitating Interactive Mining of Global and Local Association Rules
Recommendations
TCOM, an innovative data structure for mining association rules among infrequent items
Association rule mining is one of the most important areas in data mining, which has received a great deal of attention. The purpose of association rule mining is the discovery of association relationships or correlations among a set of items. In this ...
Reliable representations for association rules
Association rule mining has contributed to many advances in the area of knowledge discovery. However, the quality of the discovered association rules is a big concern and has drawn more and more attention recently. One problem with the quality of the ...
A Survey on Association Rule Mining
ACCT '15: Proceedings of the 2015 Fifth International Conference on Advanced Computing & Communication TechnologiesTask of extracting useful and interesting knowledge from large data is called data mining. It has many aspects like clustering, classification, association mining, outlier detection, regression etc. Among them association rule mining is one of the ...
Comments