ACM Home Page
Please provide us with feedback. Feedback
Web search clickstreams
Full text PdfPdf (481 KB)
Source Internet Measurement Conference archive
Proceedings of the 6th ACM SIGCOMM conference on Internet measurement table of contents
Rio de Janeriro, Brazil
SESSION: Traffic table of contents
Pages: 245 - 250  
Year of Publication: 2006
ISBN:1-59593-561-4
Authors
Nils Kammenhuber  Technische Universität München, Germany
Julia Luxenburger  Max-Planck Institute of Informatics, Germany
Anja Feldmann  Deutsche Telekom Laboratories, Germany
Gerhard Weikum  Max-Planck Institute of Informatics, Germany
Sponsors
SIGCOMM: ACM Special Interest Group on Data Communication
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 14,   Downloads (12 Months): 135,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1177080.1177110
What is a DOI?

ABSTRACT

Search engines are a vital part of the Web and thus the Internet infrastructure. Therefore understanding the behavior of users searching the Web gives insights into trends, and enables enhancements of future search capabilities. Possible data sources for studying Web search behavior are either server-side logs or client-side logs. Unfortunately, current server-side logs are hard to obtain as they are considered proprietary by the search engine operators. Therefore we in this paper present a methodology for extracting client-side logs from the traffic exchanged between a large user group and the Internet. The added benefit of our methodology is that we do not only extract the search terms, the query sequences, and search results of each individual user but also the full clickstream, i.e., the result pages users view and the subsequently visited hyperlinked pages. We propose a finite-state Markov model that captures the user web searching and browsing behavior and allows us to deduce users' prevalent search patterns. To our knowledge, this is the first such detailed client-side analysis of clickstreams.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
Google basic search. http://www.google.com/support/bin/static.py?page=searchguides.html&ctx=basics.
2
 
3
4
 
5
 
6
H. Cui, J.-R. Wen, J.-Y. Nie, and W.-Y. Ma. Query expansion by mining user logs. In IEEE Trans. Knowl. Data Eng. 15(4), 2003.
 
7
B. Jansen and U. Pooch. Web user studies: A review and framework for future work. In American Society of Information Science and Technology, 2001.
 
8
B. Krishnamurthy and J. Rexford. Web Protocols and Practice. Addison-Wesley, 2001.
9
 
10
J. Luxenburger and G. Weikum. Query-log based authority analysis for web information search. In WISE, 2004.
 
11
12
13
 
14
C. Silverstein, M. Henzinger, H. Marais, and M. Moricz. Analysis of a very large altavista query log. Technical report, SRC Technical Note 014, 1998.
 
15
A. Spink, B. J. Jansen, and H. C. Ozmultu. Use of query reformulation and relevance feedback by excite users. In Internet Research: Electronic Networking Applications and Policy, 2000.
 
16
 
17
18


Collaborative Colleagues:
Nils Kammenhuber: colleagues
Julia Luxenburger: colleagues
Anja Feldmann: colleagues
Gerhard Weikum: colleagues