ABSTRACT
Users of collaborative applications as well as individual users in their private environment return to previously visited Web pages for various reasons; apart from pages visited due to backtracking, they typically have a number of favorite or important pages that they monitor or tasks that reoccur on an infrequent basis. In this paper, we introduce a library of methods that facilitate revisitation through the effective prediction of the next page request. It is based on a generic framework that inherently incorporates contextual information, handling uniformly both server- and the client-side applications. Unlike other existing approaches, the methods it encompasses are real-time, since they do not rely on training data or machine learning algorithms. We evaluate them over two large, real-world datasets, with the outcomes suggesting a significant improvement over methods typically used in this context. We have also made our implementation and data publicly available, thus encouraging other researchers to use it as a benchmark and to extend it with new techniques for supporting user's navigational activity.
- E. Adar, J. Teevan, and S. T. Dumais. Large scale analysis of web revisitation patterns. In CHI, pages 1197--1206, 2008. Google ScholarDigital Library
- G. Adomavicius and A. Tuzhilin. Using data mining methods to build customer profiles. IEEE Computer, 34(2):74--82, 2001. Google ScholarDigital Library
- R. Agrawal, T. Imielinski, and A. N. Swami. Mining association rules between sets of items in large databases. In SIGMOD, pages 207--216, 1993. Google ScholarDigital Library
- R. Agrawal and R. Srikant. Mining sequential patterns. In ICDE, pages 3--14, 1995. Google ScholarDigital Library
- D. W. Albrecht, I. Zukerman, and A. E. Nicholson. Pre-sending documents on the www: A comparative study. In IJCAI, pages 1274--1279, 1999. Google ScholarDigital Library
- M. Awad, L. Khan, and B. M. Thuraisingham. Predicting www surfing using multiple evidence combination. VLDB J., 17(3):401--417, 2008. Google ScholarDigital Library
- J. Brank, N. Milic-Frayling, A. Frayling, and G. Smyth. Predictive algorithms for browser support of habitual user activities on the web. In Web Intelligence, pages 629--635, 2005. Google ScholarDigital Library
- S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30(1--7):107--117, 1998. Google ScholarDigital Library
- P. Brusilovsky. Adaptive hypermedia. User Modeling and User-Adapted Interaction, 11(1--2):87--110, 2001. Google ScholarDigital Library
- A. Cockburn and B. J. McKenzie. What do web users do? an empirical analysis of web use. Int. J. Hum.-Comput. Stud., 54(6):903--922, 2001. Google ScholarDigital Library
- M. Deshpande and G. Karypis. Selective markov models for predicting web page accesses. ACM Trans. Internet Techn., 4(2):163--184, 2004. Google ScholarDigital Library
- M. El-Sayed, C. Ruiz, and E. A. Rundensteiner. Fs-miner: efficient and incremental mining of frequent sequence patterns in web logs. In WIDM, pages 128--135, 2004. Google ScholarDigital Library
- X. Fu, J. Budzik, and K. J. Hammond. Mining navigation history for recommendation. In IUI, pages 106--112, 2000. Google ScholarDigital Library
- W. Gaul and L. Schmidt-Thieme. Mining generalized association rules for sequential and path data. In ICDM, pages 593--596, 2001. Google ScholarDigital Library
- M. Géry and M. H. Haddad. Evaluation of web usage mining approaches for user's next request prediction. In WIDM, pages 74--81, 2003. Google ScholarDigital Library
- D. Hawking, N. Craswell, P. Bailey, and K. Griffiths. Measuring search engine quality. Inf. Retr., 4(1):33--59, 2001. Google ScholarDigital Library
- E. Herder. Characterizations of user web revisit behavior. In LWA, pages 32--37, 2005.Google Scholar
- P. Kazienko. Mining indirect association rules for web recommendation. Applied Mathematics and Computer Science, 19(1):165--186, 2009. Google ScholarDigital Library
- I. Koychev and I. Schwab. Adaptation to drifting user's interests. In ECML Workshop: Machine Learning in New Information Age, pages 39--46, 2000.Google Scholar
- B. Mobasher, R. Cooley, and J. Srivastava. Automatic personalization based on web usage mining. Communications of the ACM, 43(8):142--151, 2000. Google ScholarDigital Library
- H. Obendorf, H. Weinreich, E. Herder, and M. Mayer. Web page revisitation revisited: implications of a long-term click-stream study of browser usage. In CHI, pages 597--606, 2007. Google ScholarDigital Library
- G. Papadakis, C. Niederee, and W. Nejdl. Decay-based ranking for social application content. In WEBIST, pages 276--282, 2010.Google Scholar
- A. G. Parameswaran, G. Koutrika, B. Bercovitz, and H. Garcia-Molina. Recsplorer: recommendation algorithms based on precedence mining. In SIGMOD, pages 87--98, 2010. Google ScholarDigital Library
- J. Pei, J. Han, and W. Wang. Mining sequential patterns with constraints in large databases. In CIKM, pages 18--25, 2002. Google ScholarDigital Library
- J. J. Sandvig, B. Mobasher, and R. Burke. Robustness of collaborative recommendation based on association rule mining. In RecSys, pages 105--112, 2007. Google ScholarDigital Library
- L. Tauscher and S. Greenberg. How people revisit web pages: empirical findings and implications for the design of history systems. Int. J. Hum.-Comput. Stud., 47(1):97--137, 1997. Google ScholarDigital Library
- Y. Yao, L. Shi, and Z. Wang. A markov prediction model based on page hierarchical clustering. Int. J. Distrib. Sen. Netw., 5(1):89--89, 2009. Google ScholarDigital Library
- I. Zukerman, D. W. Albrecht, and A. E. Nicholson. Predicting users' requests on the www. In UM, pages 275--284, 1999. Google ScholarDigital Library
Index Terms
- Client- and server-side revisitation prediction with SUPRA
Recommendations
Methods for web revisitation prediction: survey and experimentation
More than 45 % of the pages that we visit on the Web are pages that we have visited before. Browsers support revisits with various tools, including bookmarks, history views and URL auto-completion. However, these tools only support revisits to a small ...
Beyond the usual suspects: context-aware revisitation support
HT '11: Proceedings of the 22nd ACM conference on Hypertext and hypermediaA considerable amount of our activities on the Web involves revisits to pages or sites. Reasons for revisiting include active monitoring of content, verification of information, regular use of online services, and reoccurring tasks. Browsers support for ...
Large scale analysis of web revisitation patterns
CHI '08: Proceedings of the SIGCHI Conference on Human Factors in Computing SystemsOur work examines Web revisitation patterns. Everybody revisits Web pages, but their reasons for doing so can differ depending on the particular Web page, their topic of interest, and their intent. To characterize how people revisit Web content, we ...
Comments