Skip to main content
Log in

Combined performance and availability analysis of distributed resources in grid computing

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

This paper presents a mathematical model to evaluate the performance of grid resources when availability of the resources is taken into account. The proposed model uses continuous time Markov chains (CTMCs) to model the failure-repair behavior of a grid resource. In grid computing environment, a resource not only may fail during task execution, but also it can cancel its membership at any time. Hence, the proposed CTMC considers the availability of a grid resource to a grid user in both failure and membership refusal situations. After modeling the availability of the resource, the mean sojourn time of grid tasks in each of the availability states is estimated. Assigning the mean sojourn times of the tasks as performance levels to each of the CTMC’s states, a Markov reward model (MRM) representing the combined performance and availability measures is obtained. Computing the cumulative state probability of the CTMC and multiplying reward rates of the MRM’s states to each of the corresponding state probabilities, the expected accumulated sojourn time of grid tasks in each of the grid resources is achieved. An illustrative example is presented and the results obtained from the proposed model are reported in cases where various scheduling disciplines are considered inside the grid resource to simultaneously service grid and local tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Abbreviations

FTCS:

Fault tolerant computer system

CTMC:

Continuous time Markov chain

MRM:

Markov reward model

RMS:

Resource management system

GSPN:

Generalized stochastic Petri net

SAN:

Stochastic activity network

\(t\) :

Time

\(i\) :

Index of the system

\(\alpha \) :

Failure rate of the resource

\(\beta \) :

Repair rate of the resource

\(\lambda _\mathrm{l}\) :

Local tasks arrival rate

\(\mu _\mathrm{l}\) :

Resource service rate for local tasks

\(\lambda _\mathrm{g}\) :

Grid tasks arrival rate

\(\mu _\mathrm{g}\) :

Resource service rate for grid tasks

\(\mu \) :

Total service rate of the resource

\(Z(t)\) :

\(\hbox {t}\ge 0\), a random process representing the related CTMC

\(\Omega \) :

State space of CTMC

\(N\) :

Number of states in CTMC

\(Q\) :

Generator matrix

\(P(t)\) :

Transient probability vector

\(p_{0}\) :

Initial probability vector

\(\pi \) :

Steady state probability vector

\(L(t)\) :

Cumulative state probability vector during time period \(\left[ {0,t} \right) \)

\(X(t)\) :

Instantaneous reward rate of the related MRM

\(r\) :

Reward rate vector over \(Z( t)\)

\(\Phi (t)\) :

Accumulative reward over the period \(\left[ {0,t} \right) \)

\({E\left[ {X(t)}\right] }\) :

Expected instantaneous reward rate

\({E\left[ X \right] }\) :

Expected steady state reward

\({E\left[ {\Phi (t)} \right] }\) :

Expected accumulated reward rate

References

  1. Foster I, Kesselman C (2004) The Grid 2: blueprint for a new computing infrastructure, 2nd edn. Morgan Kaufmann, San Francisco

  2. Krauter K, Buyya R, Maheswaran M (2002) A taxonomy and survey of grid resource management systems for distributed computing. Softw Pract experience 32(2):135–164

    Article  MATH  Google Scholar 

  3. Levitin G, Dai Y-S (2007) Service reliability and performance in grid system with star topology. Reliab Eng System Saf 92(1):40–46

    Google Scholar 

  4. Dai Y-S, Levitin G (2006) Reliability and performance of tree-structured grid services. IEEE Trans Reliab 55(2):337–349

    Article  Google Scholar 

  5. Azgomi MA, Entezari-Maleki R (2010) Task scheduling modelling and reliability evaluation of grid services using coloured Petri nets. Futur Gener Comput Systems 26(8):1141–1150

    Article  Google Scholar 

  6. Entezari-Maleki R, Movaghar A (2011) Availability modeling of grid computing environments using SANs. In: The 19th International Conference on Software, Telecommunications and Computer Networks (SoftCOM 2011), Split, Croatia, September 15–17, 2011, pp 1–6

  7. Caron E, Garonne V, Tsaregorodtsev A (2007) Definition, modeling and simulation of a grid computing scheduling system for high throughput computing. Futur Gener Comput Systems 23(8):968–976

    Article  Google Scholar 

  8. Parsa S, Entezari-Maleki R (2009) Modeling and throughput analysis of grid task scheduling using stochastic Petri nets. In: The 2009 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA’09), Las Vegas, July 13–16, 2009, pp 458–464

  9. Berten V, Devillers R, Louchard G (2007) FIFO queuing of constant length fully synchronous jobs. In: The 4th International Conference on Grid Service Engineering and Management (GSEM ’07), Leipzig, September 25–26, 2007, pp 81–95

  10. Jayasinghe M, Tari Z, Zeephongsekul P, Zomaya AY (2011) Task assignment in multiple server farms using preemptive migration and flow control. J Parallel Distributed Comput 71(12):1608–1621

    Article  MATH  Google Scholar 

  11. Zhang L, Zhang B, Zhang Y, Zeng L (2011) Efficient scheduling strategies in high performance computing service platform for Shanghai Colleges. In: The 3rd International Conference on Computer Research and Development (ICCRD’11), Shanghai, March 11–13, 2011, pp 176–180

  12. Entezari-Maleki R, Movaghar A (2012) A probabilistic task scheduling method for grid environments. Futur Gener Comput Systems 28(3):513–524

    Article  Google Scholar 

  13. Adabi S, Movaghar A, Rahmani AM, Beigy H (2013) Market\_based grid resource allocation using new negotiation model. J Netw Comput Appl 36(1):543–565

    Article  Google Scholar 

  14. Parsa S, Entezari-Maleki R (2012) Task dispatching approach to reduce the number of waiting tasks in grid environments. J Supercomput 59(1):469–485

    Article  Google Scholar 

  15. Parsa S, Entezari-Maleki R (2012) A queuing network model for minimizing the total makespan of computational grids. Comput Electr Eng 38(4):827–839

    Article  Google Scholar 

  16. Trivedi KS, Muppala JK, Woolet SP, Haverkort BR (1992) Composite performance and dependability analysis. Perform Eval 14(2–3):197–215

    Article  MATH  Google Scholar 

  17. Reibman AL (1990) Modeling the effect of reliability on performance. IEEE Trans Reliab 39(3):314–320

    Article  Google Scholar 

  18. Dai Y-S, Xie M, Poh K-L (2008) Availability modeling and cost optimization for the grid resource management system. IEEE Trans Systems Man Cybern Part A Systems Hum 38(1):170–179

    Article  Google Scholar 

  19. Parsa S, Parand F-A (2012) Estimation of service reliability and performance in grid environment. J King Saud Univ Eng Sci 24(2):151–157

    Google Scholar 

  20. Trivedi KS, Ciardo G, Malhotra M, Sahner RA (1993) Dependability and performability analysis. NASA Contractor Report 191565, Report no. 93–85, November 1993

  21. Trivedi KS, Malhotra M (1993) Reliability and performability techniques and tools: a survey. In: The 7th ITG/GI Conference on Measurement, Modelling and Evaluation of Computer and Communication Systems (MMB), Germany, September 1993, pp 27–48

  22. Meyer JF (1980) On evaluating the performability of degradable computing systems. IEEE Trans Comput c–29(8):720–731

    Article  Google Scholar 

  23. Ma Y, Han JJ, Trivedi KS (2001) Composite performance and availability analysis of wireless communication networks. IEEE Trans Vehicular Technol 50(5):1216–1223

    Article  Google Scholar 

  24. Beaudry MD (1978) Performance-related reliability measures for computing systems. IEEE Transact Comput C–27(6):540–547

    Article  Google Scholar 

  25. Hussain H et al (2013) A survey on resource allocation in high performance distributed computing systems. Parallel Comput 39(11):709–736

    Article  MathSciNet  Google Scholar 

  26. Meyer JF (1982) Closed-form solutions of performability. IEEE Transact Comput C–31(7):648–657

    Article  Google Scholar 

  27. Balasangameshwaraa J, Rajub N (2012) A hybrid policy for fault tolerant load balancing in grid computing environments. J Netw Comput Appl 35(1):412–422

    Article  Google Scholar 

  28. Sun X-H, Wu M (2007) Quality of service of grid computing: resource sharing. In: The 6th International Conference on Grid and Cooperative Computing (GCC’07), Los Alamitos, August 16–18, 2007, pp 395–402

  29. Bolch G, Greiner S, Meer HD, Trivedi KS (2006) Queueing networks and Markov chains: modeling and performance evaluation with computer science applications, 2nd edn., Wiley, New York

  30. Allen AO (1990) Probability, statistics, and queueing theory with computer science applications, 2nd edn., Harcourt Brace Jovanovich, New York

Download references

Acknowledgments

The authors would like to thank Iran Telecommunication Research Center (ITRC) for their support. This research was also supported by the MSIP (Ministry of Science, ICT and Future Planning), Korea, under the CPRC (Communications Policy Research Center) support program supervised by the KCA (Korea Communications Agency)(KCA-1194100004).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Reza Entezari-Maleki.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Entezari-Maleki, R., Mohammadkhan, A., Yeom, H.Y. et al. Combined performance and availability analysis of distributed resources in grid computing. J Supercomput 69, 827–844 (2014). https://doi.org/10.1007/s11227-014-1184-z

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-014-1184-z

Keywords

Navigation