ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
Performance Evaluation
Volume 62, Issues 1-4, October 2005, Pages 456-474
Performance 2005
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (732 K)

Article Toolbox
 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.peva.2005.07.030    
How to Cite or Link Using DOI (Opens New Window)

Copyright © 2005 Elsevier B.V. All rights reserved.

Systems with multiple servers under heavy-tailed workloads

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Konstantinos Psounisa, Corresponding Author Contact Information, E-mail The Corresponding Author, Pablo Molinero-Fernándezb, 1, E-mail The Corresponding Author, Balaji Prabhakarc, E-mail The Corresponding Author and Fragkiskos Papadopoulosd,E-mail The Corresponding Author

aDepartments of Electrical Engineering and Computer Science, University of Southern California, USA

bDepartment of Electrical Engineering, Stanford University, USA

cDepartments of Electrical Engineering and Computer Science, Stanford University, USA

dDepartment of Electrical Engineering, University of Southern California, USA


Available online 24 August 2005.

Abstract

The heavy-tailed nature of Internet flow sizes, web pages and computer files can cause non-preemptive scheduling policies to have a large average response time. Since there are numerous communication and distributed processing systems where preempting jobs can be quite expensive, reducing response times under this constraint is a pressing issue. One proposal for tackling non-preemption is through the use of multiple servers: classify jobs according to size and assign a server to each class. Unfortunately, in most systems of interest, job sizes are unknown.

An alterative is to queue all jobs together in a central-queue and assign them in a FCFS fashion to the next available server. But, this has been believed to yield large response times. In this paper, we argue that this is not the case, so long as there are enough servers. The question then is: what is the right number of servers, and is this small enough to be practical?

Despite the large amount of prior work in analyzing the behavior of a central-queue system, no existing models are accurate for the case of heavy-tailed size distributions. Our main contribution is a simple yet accurate model for a central-queue with multiple servers. This model accurately predicts the right number of servers, and the average and variance of the response time of the system. Hence, it can be used to improve the performance of some real systems, such as multi-server supercomputing centers and multi-channel communication systems.

Keywords: Heavy-tailed size distribution; Multi-server computer systems; M/G/K queue; Expected delay; Practical approximation formula; Blocking probability

Article Outline

1. Introduction
2. A single queue with many servers
3. An approximate model for the dynamics of the system
3.1. Blocking probability
3.2. A more realistic size distribution
3.3. Testing the model under real traces
3.4. Predicting the variance
4. Comparison to existing models
5. On the optimal number of servers
6. Conclusions
References
Vitae










Corresponding Author Contact InformationCorresponding author.
1 Present address: NetSpira Networks, Madrid, Spain.

Performance Evaluation
Volume 62, Issues 1-4, October 2005, Pages 456-474
Performance 2005
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.