| Web search clickstreams |
| Full text |
Pdf
(481 KB)
|
| Source
|
Internet Measurement Conference
archive
Proceedings of the 6th ACM SIGCOMM conference on Internet measurement
table of contents
Rio de Janeriro, Brazil
SESSION: Traffic
table of contents
Pages: 245 - 250
Year of Publication: 2006
ISBN:1-59593-561-4
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 14, Downloads (12 Months): 135, Citation Count: 1
|
|
|
ABSTRACT
Search engines are a vital part of the Web and thus the Internet infrastructure. Therefore understanding the behavior of users searching the Web gives insights into trends, and enables enhancements of future search capabilities. Possible data sources for studying Web search behavior are either server-side logs or client-side logs. Unfortunately, current server-side logs are hard to obtain as they are considered proprietary by the search engine operators. Therefore we in this paper present a methodology for extracting client-side logs from the traffic exchanged between a large user group and the Internet. The added benefit of our methodology is that we do not only extract the search terms, the query sequences, and search results of each individual user but also the full clickstream, i.e., the result pages users view and the subsequently visited hyperlinked pages. We propose a finite-state Markov model that captures the user web searching and browsing behavior and allows us to deduce users' prevalent search patterns. To our knowledge, this is the first such detailed client-side analysis of clickstreams.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Google basic search. http://www.google.com/support/bin/static.py?page=searchguides.html&ctx=basics.
|
 |
2
|
|
| |
3
|
|
 |
4
|
Steven M. Beitzel , Eric C. Jensen , Abdur Chowdhury , David Grossman , Ophir Frieder, Hourly analysis of a very large topically categorized web query log, Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, July 25-29, 2004, Sheffield, United Kingdom
[doi> 10.1145/1008992.1009048]
|
| |
5
|
|
| |
6
|
H. Cui, J.-R. Wen, J.-Y. Nie, and W.-Y. Ma. Query expansion by mining user logs. In IEEE Trans. Knowl. Data Eng. 15(4), 2003.
|
| |
7
|
B. Jansen and U. Pooch. Web user studies: A review and framework for future work. In American Society of Information Science and Technology, 2001.
|
| |
8
|
B. Krishnamurthy and J. Rexford. Web Protocols and Practice. Addison-Wesley, 2001.
|
 |
9
|
|
| |
10
|
J. Luxenburger and G. Weikum. Query-log based authority analysis for web information search. In WISE, 2004.
|
| |
11
|
|
 |
12
|
|
 |
13
|
|
| |
14
|
C. Silverstein, M. Henzinger, H. Marais, and M. Moricz. Analysis of a very large altavista query log. Technical report, SRC Technical Note 014, 1998.
|
| |
15
|
A. Spink, B. J. Jansen, and H. C. Ozmultu. Use of query reformulation and relevance feedback by excite users. In Internet Research: Electronic Networking Applications and Policy, 2000.
|
| |
16
|
|
| |
17
|
|
 |
18
|
|
|