Measurement-based end to end latency performance prediction for SLA verification

doi:10.1016/j.jss.2003.12.031

Journal of Systems and Software

Volume 74, Issue 3, 1 February 2005, Pages 243-254

https://doi.org/10.1016/j.jss.2003.12.031 Get rights and content

Abstract

End-to-end delay is one of the important metrics used to define perceived quality of service. Measurement is a fundamental tool of network management to assess the performance of the network. The conventional approach to measure the performance metrics is on an intrusive basis that may cause extra-burden to the network. In contrast to this, our scheme can be considered as non-intrusive. The main idea relies on the knowledge of the queuing behaviour. The queue length is non-intrusively monitored, and then we capture the parameters of the queue state distribution of every queue along the path in order to deduce the end-to-end delay performance.

Introduction

Recently, service level agreements (SLA) (TeleManagement Forum, 2001), a contract between network operators and its customers, have been commonly employed to define the expected quality of service provided by the network operator. Failure to meet the requirements listed in the SLA incurs compensation to the customers. The customers may wish to know if the perceived services meet the requirement. Similarly, the network operators have a responsibility to give evidence of meeting the SLA requirement. Measurement is necessary to collect the statistics to provide the evidence.

End-to-end delay requirement is one of the quality of service performance requirements that appears in an SLA. An SLA may define the proportion of packets whose delay will exceed a given value. The current approach for the end-to-end delay requirement is known to be intrusive (Cole et al., 2000). It can be on either one-way or two-way basis. The sending end generates the testing packets and injects these into the network. For the two-way measurement scheme, the round-trip delay is measured first. The end-to-end delay is then approximated, by taking one-half of the round-trip delay. The merit of this scheme is that network synchronization is unnecessary, but the downside is that an asymmetric forward and return path causes poor estimation results. However, in the one-way delay measurement method (Shalunov et al., 2001), the receiving end collects the testing packets to determine the transmission delay. Regardless of whether a one-way or two-way delay measurement scheme is employed, due to their intrusive nature, there is a drawback such that the accuracy of measurement greatly depends on the sampling frequency and the sampling method. Clearly, insufficient testing packets make the measurement results unreliable. However, too many testing packets may cause extra burden on the network and change its actual load condition.

The end-to-end delay in the network domain mainly comprises of the queuing delay and the link propagation delay for the path. The propagation delay is intrinsic and fixed along the path. The variation of the end-to-end delay depends on the queue size seen by the arrival packets. On average, the greater the queue size, the longer the packet will stay in the hop. The queuing behaviour has a greater impact on the end-to-end delay performance.

Our measurement scheme is based on queue length monitoring. In (Grossglauser and Bolot, 1999), authors report different queue length distributions arising under different traffic conditions. For example, the tail of a queuing distribution can be exponential (Markovian type traffic) or hyperbolic (ON/OFF source with heavy-tailed on and off periods). In this paper, we consider both Markovian type and power-law traffic types, the local queue state distribution characteristic is estimated by measurement in order to deduce the end-to-end delay performance. Our methodology is applicable to network providers who may want to carry out the end-to-end delay performance evaluation for their customers. The scheme is based on non-intrusive queue length monitoring at each node and so it does not cause any intrusion into the network.

Our approach is to estimate each local queue length distribution by measurement. The distributions are further used to determine the end-to-end delay distribution. Non-intrusive measurement-based queue length distribution estimations are presented in various research works (Kesidis, 1999; Eun and Shroff, 2001). Generally, they share a common motive such that the main objective is to employ the queue length distribution estimate for QoS inference for different purposes like Admission Control¹ (Jamin et al., 1997; Breslau et al., 2000) or Capacity Adjustment (Kesidis, 1999). Different measurement techniques were proposed in the literatures but they can mainly be classified as (i) the incoming traffic measurement, (ii) direct queue length monitoring. In (Duffield, 1998), the incoming traffic is passively monitored and the batch of time-slotted measurement data is stored for queue length distribution estimation with analytical models like large deviation principle, LDP (Chang, 2000), maximum variance technique (Eun and Shroff, 2001). A queuing analysis based on the large deviation principle theoretically reveals that, for a wide-ranging traffic input, the tail of the queue length distribution decays exponentially in the regime of large buffers. The measured traffic statistic is used to determine the rate function (Chang, 2000). The tail of the queue length distribution can be determined by using this rate function, given knowledge of the service rate.

However, recent experimental results demonstrated that the traffic in packet networks may exhibit self-similarity that it is more bursty than the conventional Markovian traffic model. In the presence of power-law traffic, the tail of the queue length distribution will then exhibit power-law decay instead of exponential decay and the LDP will not be applicable to this scenario. Authors in (Eun and Shroff, 2001) proposed a novel technique, maximum variance technique (MVT), based on extreme value theory to estimate the queue length distribution. Similar to the technique using LDP, these works were inspired by the phenomenon of “rare event happens in the most likely way” and the fact that a single link will carry hundreds or even thousands of applications in a high-speed networks. Under this circumstance, the input process can then be assumed to be a general class of Gaussian processes which includes a large class of self-similar process. Hence, the queue length distribution is determined by finding the time scale (dominant time-scale) which maximizes the Standard Gaussian tail function. Both LDP and MVT provide the asymptotic approximation for the queue length distribution. An accurate estimation of the queue length estimation is not guaranteed. Furthermore, large measurement storage requirement may happen for better accuracy. Computational complexity is also not negligible, for instance, in MVT, it can be time-consuming to generate the statistic in various time-scales.

For the direct queue length monitoring, Kesidis (1999) proposed two-point measurement and three point queue length monitoring method for the queues with exponential tail distribution or heavy tailed distribution respectively. In this scheme, the occurrence of the queue length greater than several predefined sizes (2 or 3 points) are measured. Then, the tail of queue length distribution is obtained by extrapolating the points. Clearly, there is a trade-off in the choice of the monitoring points. When the monitoring points are very small, the measurement result is more accurate, but the extrapolation error is larger. In our scheme, accurate capture of the queue length distributions is also key to the success of our goal. We employ the latter approach (direct queue length monitoring). Only four measurement data are taken for each measurement period and so it does not cause a storage problem. In addition, the estimation result is less dependent on the measurement point compared with the scheme in (Kesidis, 1999). This means our methodology is more practical in real life scenarios. The rest of the paper is organized as follows: The first part of the paper illustrates our methodology based on Markovian traffic. The extension to the power-law traffic is presented in the second part. The simulation results are included in both sections. The final section provides the conclusion.

Section snippets

Preliminary

It is well known that the tail of the queue length distribution decays exponentially for a FIFO queue multiplexing Markovian traffic (Pitts and Schormans, 2000).

Fig. 1 shows the typical queue length distribution for a FIFO queue multiplexing Markovian traffic. It can be found that the queue length distribution can be divided into two regions often referred to the packet-scale region and the burst-scale region. The packet-scale queuing behaviour is caused by the random arrival patterns of the

Framework of measurement

Fig. 2 illustrates the framework of our measurement scheme. Queue length monitoring which will be discussed in Section 4, is carried out at each queue. After each measurement period, the measurement data is time-stamped, and is then used in the mathematical scheme.

Our main purpose is to perform measurement to estimate end-to-end performance in a path to determine if it satisfies the end-to-end delay requirement listed in the SLA. When the management unit needs the data to carry out the SLA

Measurement

Fig. 3 illustrates our queue length monitoring scheme. The queue is logically partitioned into two regions such as queue_high and queue_low regions by a partition point called q_p. The partition point should be large enough to be placed in the burst-scale region. Four measurement data are recorded during measurement period. If the current queue length seen by an incoming packet is greater than the partition point, then the data q_high and freq_high are updated, otherwise, q_low and freq_high. When the

Partition point selection

We performed simulations with NS2 (UCB/LBNL/VINT) to study the impact of the partition point on the queue state distribution estimation. The Markovian traffic is created with ON/OFF sources

Queue length monitoring and re-construction

This section illustrates how to extend our scheme to cope with power-law traffic. In the presence of power-law traffic, unlike Markovian traffic, the tail of queue length distribution will decay much more slowly, which is known as “heavy tailed”. The tail distribution originally as in Eq. (2) should now be modelled as q(x)=c_bx^η_b. The tail of the queue length distribution roughly forms a straight line in a log–log plot. c_b denotes the decay constant and η_b is the decay rate that represents the

Conclusion

In this paper we proposed a measurement methodology for the inference of the end-to-end packet delay performance. This scheme is considered as model-based. We illustrated how to perform the measurement in order to capture the delay distribution and so the end-to-end delay distribution in the presence of Markovian traffic or power-law traffic. Based on comparison with simulation results, this scheme is quite promising. Since the queue length is monitored non-intrusively, in contrast to active

References (22)

Agilent Technologies, 2000. Packet over Sonet/SDH (POS) functional testing. Product...
Aida, M., Ishibashi, K., Kanazawa, T., 2002. Compact-monitor: change-of-measure based passive/active monitoring...
Breslau, L., Jamin, S., Shenker, S., 2000. Comments on the performance of measurement based admission control...
C.S. Chang
Performance Guarantees in Communication Networks
(2000)
Cole, R.G., Kalbfleish, C., Romascanu, D., 2000. A framework for active probes for performance monitoring. Internet...
Duffield, N., 1998. Applications of large deviations to performance analysis with long-range dependent traffic....
Eun, D.Y., Shroff, N.B., 2001. A measurement-analytic framework for QoS estimation based on the dominant time scale....
I.S. Gradshteyn et al.
Table of Integrals, Series, and Products
(2000)
M. Grossglauser et al.
On the relevance of long-range dependence in network traffics
IEEE/ACM Transactions on Networking
(1999)
Jamin, S., Shenker, S.J., Danzig, P.B., 1997. Comparison of measurement-based admission control algorithm for...

Kesidis, G., 1999. Bandwidth adjustments using on-line packet-level measurements. SPIE conference on performance and...

Cited by (0)

View full text