Abstract
This paper introduces the concept of a Web server access hierarchy—a three-tier hierarchy that describes the traffic to a Web server in three levels: as aggregate traffic from multiple clients, as traffic from individual clients, and as traffic within sessions of individual clients. A detailed workload characterization study was undertaken of the Web server access hierarchy of a busy commercial server using an access log of 80 million requests captured over seven days of observation. The behavioural characteristics that emerge from this study show different features at each level and suggest effective stategies for managing resources at busy Internet Web servers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
V. Almeida, A. Bestavros, M. Crovella, and A. Oliveira, “Characterizing Reference Locality in the World Wide Web,“ in Proceedings of the IEEE Conference on Parallel and Distributed Information Systems, Miami Beach, Florida, pp. 92–103, December 1996.
V. Almeida, D. Menasce, R. Riedi, F. Pelegrinelli, R. Fonseca, and W. Meira, Jr., “Analyzing Web Robots and their Impact on Caching,” in Proceedings of the Sixth Workshop on Web Caching and Content Distribution, Boston, Massachusetts, June 2001.
V. Almeida, D. Menasce, R. Riedi, F. Pelegrinelli, R. Fonseca, and W. Meira, Jr., “Analyzing Robot Behaviour in E-Business Sites,” in Proceedings of the ACM SIGMETRICS Conference, Cambridge, Massachusetts, pp. 338–339, June 2001.
M. F. Arlitt, “Characterizing Web User Sessions,” Performance Evaluation Review, Vol. 28, No. 2, pp. 50–56, September 2000.
M. F. Arlitt and C. L. Williamson, “Internet Web Servers: Workload Characterization and Performance Implications,” IEEE/ACM Transactions on Networking, Vol. 5, No. 5, pp. 631–645, October 1997.
M. Arlitt and T. Jin, “A Workload Characterization Study of the 1998 World Cup Web Site,” IEEE Networks, Vol. 14, No. 3, pp. 30–37, May/June 2000.
P. Barford, A. Bestavros, A. Bradley, and M. Crovella, “Changes in Web Client Access Patterns: Characteristics and Caching Implications,” World Wide Web, Vol. 2, No. 1, pp. 15–28, January 1999.
H. Braun and K. C. Claffy, “Web Traffic Characterization: An Assessment of the Impact of Caching Documents from NCSA’s Web Server,” Computer Networks and ISDN Systems, Vol. 28, pp. 37–51, 1996.
L. D. Catledge, and J. E. Pitkow, “Characterizing Browsing Strategies in the World Wide Web,” Computer Networks and ISDN Systems, Vol. 26, No. 6, pp. 1065–1073, 1995.
M. E. Crovella and A. Bestavros, “Self-Similarity in World Wide Web Traffic: Evidence and Possible Causes,” in Proceedings of the ACM SIGMETRICS Conference, Philadelphia, Pennsylvania, pp. 160–169, May 1996.
C. R. Cunha and A. Bestavros, and M. E. Crovella, “Characteristic of World Wide Web Client-Based Traces,” Technical Report TR-95-010, Department of Computer Science, Boston University, Boston, Massachussets, April 1995.
J. H. Hine, C. E. Wills, A. Martel, and J. Sommers, “Combining Client Knowledge and Resource Dependencies for Improved World Wide Web Performance”, in Proceedings of the INET 1998 Conference, Geneva, Switzerland, July 1998.
S. Jin and A. Bestavros, “Sources and Characteristics of Web Temporal Locality,” in Proceedings of the Eighth International Symposium on Modeling, Analysis and Simulation of Computer and Telecomminucation Systems, San Francisco, California, August/September 2000.
M. G. Kienzle, J. A. Garay, and W. H. Tetzlaff, “Analysis of Page-Reference Strings of an Interactive System,” IBM Journal of Research and Development, Vol. 32, No. 4, pp. 523–535, July 1988.
A. Mahanti, Web Proxy Workload Characterization and Modeling, M.Sc. Thesis, Department of Computer Science, University of Saskatchewan, Saskatoon, Saskatchewan, September 1999.
D. Menasce, V. Almeida, R. Riedi, F. Peligrinelli, R. Fonseca, W. Meira Jr., “In Search of Invariants for E-Business Workloads,” in Proceedings of the Second ACM Electronic Commerce Conference, Minneapolis, Minnesota, October 2000.
J. C. Mogul, “The Case for Persistent-Connection HTTP,” in Proceedings of the ACM SIGCOMM Conference, Cambridge, Massachussets, pp. 299–313, August 1995.
J. C. Mogul “Network Behavior of a Busy Web Server and Its Clients,“ WRL Research Report 95/4, Digital Western Research Laboratory, May 1995.
A. A. Oke, Workload Characterization for Resource Management at Web Servers, M.Sc. Thesis, Department of Computer Science, University of Saskatchewan, Saskatoon, Saskatchewan, October 2000.
J. Spirn, “Distance String Models for Program Behaviour,” IEEE Computer, Vol. 9, No. 11, pp. 14–20, November 1976.
K. Yap, “A Technical Overview of the New HTTP/1.1 Specification,” in Proceedings of the Third Australian World Wide Web Conference, Australia, May 1997.
G. K. Zipf, Human Behaviour and the Principle of Least Effort, Addison-Wesley, Cambridge, Massachusetts, 1949.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Oke, A., Bunt, R. (2002). Hierarchical Workload Characterization for a Busy Web Server. In: Field, T., Harrison, P.G., Bradley, J., Harder, U. (eds) Computer Performance Evaluation: Modelling Techniques and Tools. TOOLS 2002. Lecture Notes in Computer Science, vol 2324. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46029-2_23
Download citation
DOI: https://doi.org/10.1007/3-540-46029-2_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43539-6
Online ISBN: 978-3-540-46029-9
eBook Packages: Springer Book Archive