Abstract
This paper presents a mathematical model to evaluate the performance of grid resources when availability of the resources is taken into account. The proposed model uses continuous time Markov chains (CTMCs) to model the failure-repair behavior of a grid resource. In grid computing environment, a resource not only may fail during task execution, but also it can cancel its membership at any time. Hence, the proposed CTMC considers the availability of a grid resource to a grid user in both failure and membership refusal situations. After modeling the availability of the resource, the mean sojourn time of grid tasks in each of the availability states is estimated. Assigning the mean sojourn times of the tasks as performance levels to each of the CTMC’s states, a Markov reward model (MRM) representing the combined performance and availability measures is obtained. Computing the cumulative state probability of the CTMC and multiplying reward rates of the MRM’s states to each of the corresponding state probabilities, the expected accumulated sojourn time of grid tasks in each of the grid resources is achieved. An illustrative example is presented and the results obtained from the proposed model are reported in cases where various scheduling disciplines are considered inside the grid resource to simultaneously service grid and local tasks.
Similar content being viewed by others
Abbreviations
- FTCS:
-
Fault tolerant computer system
- CTMC:
-
Continuous time Markov chain
- MRM:
-
Markov reward model
- RMS:
-
Resource management system
- GSPN:
-
Generalized stochastic Petri net
- SAN:
-
Stochastic activity network
- \(t\) :
-
Time
- \(i\) :
-
Index of the system
- \(\alpha \) :
-
Failure rate of the resource
- \(\beta \) :
-
Repair rate of the resource
- \(\lambda _\mathrm{l}\) :
-
Local tasks arrival rate
- \(\mu _\mathrm{l}\) :
-
Resource service rate for local tasks
- \(\lambda _\mathrm{g}\) :
-
Grid tasks arrival rate
- \(\mu _\mathrm{g}\) :
-
Resource service rate for grid tasks
- \(\mu \) :
-
Total service rate of the resource
- \(Z(t)\) :
-
\(\hbox {t}\ge 0\), a random process representing the related CTMC
- \(\Omega \) :
-
State space of CTMC
- \(N\) :
-
Number of states in CTMC
- \(Q\) :
-
Generator matrix
- \(P(t)\) :
-
Transient probability vector
- \(p_{0}\) :
-
Initial probability vector
- \(\pi \) :
-
Steady state probability vector
- \(L(t)\) :
-
Cumulative state probability vector during time period \(\left[ {0,t} \right) \)
- \(X(t)\) :
-
Instantaneous reward rate of the related MRM
- \(r\) :
-
Reward rate vector over \(Z( t)\)
- \(\Phi (t)\) :
-
Accumulative reward over the period \(\left[ {0,t} \right) \)
- \({E\left[ {X(t)}\right] }\) :
-
Expected instantaneous reward rate
- \({E\left[ X \right] }\) :
-
Expected steady state reward
- \({E\left[ {\Phi (t)} \right] }\) :
-
Expected accumulated reward rate
References
Foster I, Kesselman C (2004) The Grid 2: blueprint for a new computing infrastructure, 2nd edn. Morgan Kaufmann, San Francisco
Krauter K, Buyya R, Maheswaran M (2002) A taxonomy and survey of grid resource management systems for distributed computing. Softw Pract experience 32(2):135–164
Levitin G, Dai Y-S (2007) Service reliability and performance in grid system with star topology. Reliab Eng System Saf 92(1):40–46
Dai Y-S, Levitin G (2006) Reliability and performance of tree-structured grid services. IEEE Trans Reliab 55(2):337–349
Azgomi MA, Entezari-Maleki R (2010) Task scheduling modelling and reliability evaluation of grid services using coloured Petri nets. Futur Gener Comput Systems 26(8):1141–1150
Entezari-Maleki R, Movaghar A (2011) Availability modeling of grid computing environments using SANs. In: The 19th International Conference on Software, Telecommunications and Computer Networks (SoftCOM 2011), Split, Croatia, September 15–17, 2011, pp 1–6
Caron E, Garonne V, Tsaregorodtsev A (2007) Definition, modeling and simulation of a grid computing scheduling system for high throughput computing. Futur Gener Comput Systems 23(8):968–976
Parsa S, Entezari-Maleki R (2009) Modeling and throughput analysis of grid task scheduling using stochastic Petri nets. In: The 2009 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA’09), Las Vegas, July 13–16, 2009, pp 458–464
Berten V, Devillers R, Louchard G (2007) FIFO queuing of constant length fully synchronous jobs. In: The 4th International Conference on Grid Service Engineering and Management (GSEM ’07), Leipzig, September 25–26, 2007, pp 81–95
Jayasinghe M, Tari Z, Zeephongsekul P, Zomaya AY (2011) Task assignment in multiple server farms using preemptive migration and flow control. J Parallel Distributed Comput 71(12):1608–1621
Zhang L, Zhang B, Zhang Y, Zeng L (2011) Efficient scheduling strategies in high performance computing service platform for Shanghai Colleges. In: The 3rd International Conference on Computer Research and Development (ICCRD’11), Shanghai, March 11–13, 2011, pp 176–180
Entezari-Maleki R, Movaghar A (2012) A probabilistic task scheduling method for grid environments. Futur Gener Comput Systems 28(3):513–524
Adabi S, Movaghar A, Rahmani AM, Beigy H (2013) Market\_based grid resource allocation using new negotiation model. J Netw Comput Appl 36(1):543–565
Parsa S, Entezari-Maleki R (2012) Task dispatching approach to reduce the number of waiting tasks in grid environments. J Supercomput 59(1):469–485
Parsa S, Entezari-Maleki R (2012) A queuing network model for minimizing the total makespan of computational grids. Comput Electr Eng 38(4):827–839
Trivedi KS, Muppala JK, Woolet SP, Haverkort BR (1992) Composite performance and dependability analysis. Perform Eval 14(2–3):197–215
Reibman AL (1990) Modeling the effect of reliability on performance. IEEE Trans Reliab 39(3):314–320
Dai Y-S, Xie M, Poh K-L (2008) Availability modeling and cost optimization for the grid resource management system. IEEE Trans Systems Man Cybern Part A Systems Hum 38(1):170–179
Parsa S, Parand F-A (2012) Estimation of service reliability and performance in grid environment. J King Saud Univ Eng Sci 24(2):151–157
Trivedi KS, Ciardo G, Malhotra M, Sahner RA (1993) Dependability and performability analysis. NASA Contractor Report 191565, Report no. 93–85, November 1993
Trivedi KS, Malhotra M (1993) Reliability and performability techniques and tools: a survey. In: The 7th ITG/GI Conference on Measurement, Modelling and Evaluation of Computer and Communication Systems (MMB), Germany, September 1993, pp 27–48
Meyer JF (1980) On evaluating the performability of degradable computing systems. IEEE Trans Comput c–29(8):720–731
Ma Y, Han JJ, Trivedi KS (2001) Composite performance and availability analysis of wireless communication networks. IEEE Trans Vehicular Technol 50(5):1216–1223
Beaudry MD (1978) Performance-related reliability measures for computing systems. IEEE Transact Comput C–27(6):540–547
Hussain H et al (2013) A survey on resource allocation in high performance distributed computing systems. Parallel Comput 39(11):709–736
Meyer JF (1982) Closed-form solutions of performability. IEEE Transact Comput C–31(7):648–657
Balasangameshwaraa J, Rajub N (2012) A hybrid policy for fault tolerant load balancing in grid computing environments. J Netw Comput Appl 35(1):412–422
Sun X-H, Wu M (2007) Quality of service of grid computing: resource sharing. In: The 6th International Conference on Grid and Cooperative Computing (GCC’07), Los Alamitos, August 16–18, 2007, pp 395–402
Bolch G, Greiner S, Meer HD, Trivedi KS (2006) Queueing networks and Markov chains: modeling and performance evaluation with computer science applications, 2nd edn., Wiley, New York
Allen AO (1990) Probability, statistics, and queueing theory with computer science applications, 2nd edn., Harcourt Brace Jovanovich, New York
Acknowledgments
The authors would like to thank Iran Telecommunication Research Center (ITRC) for their support. This research was also supported by the MSIP (Ministry of Science, ICT and Future Planning), Korea, under the CPRC (Communications Policy Research Center) support program supervised by the KCA (Korea Communications Agency)(KCA-1194100004).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Entezari-Maleki, R., Mohammadkhan, A., Yeom, H.Y. et al. Combined performance and availability analysis of distributed resources in grid computing. J Supercomput 69, 827–844 (2014). https://doi.org/10.1007/s11227-014-1184-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-014-1184-z