Combined performance and availability analysis of distributed resources in grid computing

Entezari-Maleki, Reza; Mohammadkhan, Ali; Yeom, Heon Young; Movaghar, Ali

doi:10.1007/s11227-014-1184-z

Combined performance and availability analysis of distributed resources in grid computing

Published: 17 April 2014

Volume 69, pages 827–844, (2014)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Reza Entezari-Maleki¹,
Ali Mohammadkhan¹,
Heon Young Yeom² &
…
Ali Movaghar¹

287 Accesses
7 Citations
Explore all metrics

Abstract

This paper presents a mathematical model to evaluate the performance of grid resources when availability of the resources is taken into account. The proposed model uses continuous time Markov chains (CTMCs) to model the failure-repair behavior of a grid resource. In grid computing environment, a resource not only may fail during task execution, but also it can cancel its membership at any time. Hence, the proposed CTMC considers the availability of a grid resource to a grid user in both failure and membership refusal situations. After modeling the availability of the resource, the mean sojourn time of grid tasks in each of the availability states is estimated. Assigning the mean sojourn times of the tasks as performance levels to each of the CTMC’s states, a Markov reward model (MRM) representing the combined performance and availability measures is obtained. Computing the cumulative state probability of the CTMC and multiplying reward rates of the MRM’s states to each of the corresponding state probabilities, the expected accumulated sojourn time of grid tasks in each of the grid resources is achieved. An illustrative example is presented and the results obtained from the proposed model are reported in cases where various scheduling disciplines are considered inside the grid resource to simultaneously service grid and local tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Performance aware scheduling considering resource availability in grid computing

Article 02 July 2016

Reza Entezari-Maleki, Maryam Bagheri, … Ali Movaghar

A Novel Dynamic Reliability Optimized Resource Scheduling Algorithm for Grid Computing System

Article 26 August 2014

U. Syed Abudhagir & S. Shanmugavel

Resource Re-allocation for Data Inter-dependent Continuous Tasks in Grids

Abbreviations

FTCS:: Fault tolerant computer system
CTMC:: Continuous time Markov chain
MRM:: Markov reward model
RMS:: Resource management system
GSPN:: Generalized stochastic Petri net
SAN:: Stochastic activity network
\(t\) :: Time
\(i\) :: Index of the system
\(\alpha \) :: Failure rate of the resource
\(\beta \) :: Repair rate of the resource
\(\lambda _\mathrm{l}\) :: Local tasks arrival rate
\(\mu _\mathrm{l}\) :: Resource service rate for local tasks
\(\lambda _\mathrm{g}\) :: Grid tasks arrival rate
\(\mu _\mathrm{g}\) :: Resource service rate for grid tasks
\(\mu \) :: Total service rate of the resource
\(Z(t)\) :: \(\hbox {t}\ge 0\), a random process representing the related CTMC
\(\Omega \) :: State space of CTMC
\(N\) :: Number of states in CTMC
\(Q\) :: Generator matrix
\(P(t)\) :: Transient probability vector
\(p_{0}\) :: Initial probability vector
\(\pi \) :: Steady state probability vector
\(L(t)\) :: Cumulative state probability vector during time period \(\left[ {0,t} \right) \)
\(X(t)\) :: Instantaneous reward rate of the related MRM
\(r\) :: Reward rate vector over \(Z( t)\)
\(\Phi (t)\) :: Accumulative reward over the period \(\left[ {0,t} \right) \)
\({E\left[ {X(t)}\right] }\) :: Expected instantaneous reward rate
\({E\left[ X \right] }\) :: Expected steady state reward
\({E\left[ {\Phi (t)} \right] }\) :: Expected accumulated reward rate

References

Foster I, Kesselman C (2004) The Grid 2: blueprint for a new computing infrastructure, 2nd edn. Morgan Kaufmann, San Francisco
Krauter K, Buyya R, Maheswaran M (2002) A taxonomy and survey of grid resource management systems for distributed computing. Softw Pract experience 32(2):135–164
Article MATH Google Scholar
Levitin G, Dai Y-S (2007) Service reliability and performance in grid system with star topology. Reliab Eng System Saf 92(1):40–46
Google Scholar
Dai Y-S, Levitin G (2006) Reliability and performance of tree-structured grid services. IEEE Trans Reliab 55(2):337–349
Article Google Scholar
Azgomi MA, Entezari-Maleki R (2010) Task scheduling modelling and reliability evaluation of grid services using coloured Petri nets. Futur Gener Comput Systems 26(8):1141–1150
Article Google Scholar
Entezari-Maleki R, Movaghar A (2011) Availability modeling of grid computing environments using SANs. In: The 19th International Conference on Software, Telecommunications and Computer Networks (SoftCOM 2011), Split, Croatia, September 15–17, 2011, pp 1–6
Caron E, Garonne V, Tsaregorodtsev A (2007) Definition, modeling and simulation of a grid computing scheduling system for high throughput computing. Futur Gener Comput Systems 23(8):968–976
Article Google Scholar
Parsa S, Entezari-Maleki R (2009) Modeling and throughput analysis of grid task scheduling using stochastic Petri nets. In: The 2009 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA’09), Las Vegas, July 13–16, 2009, pp 458–464
Berten V, Devillers R, Louchard G (2007) FIFO queuing of constant length fully synchronous jobs. In: The 4th International Conference on Grid Service Engineering and Management (GSEM ’07), Leipzig, September 25–26, 2007, pp 81–95
Jayasinghe M, Tari Z, Zeephongsekul P, Zomaya AY (2011) Task assignment in multiple server farms using preemptive migration and flow control. J Parallel Distributed Comput 71(12):1608–1621
Article MATH Google Scholar
Zhang L, Zhang B, Zhang Y, Zeng L (2011) Efficient scheduling strategies in high performance computing service platform for Shanghai Colleges. In: The 3rd International Conference on Computer Research and Development (ICCRD’11), Shanghai, March 11–13, 2011, pp 176–180
Entezari-Maleki R, Movaghar A (2012) A probabilistic task scheduling method for grid environments. Futur Gener Comput Systems 28(3):513–524
Article Google Scholar
Adabi S, Movaghar A, Rahmani AM, Beigy H (2013) Market\_based grid resource allocation using new negotiation model. J Netw Comput Appl 36(1):543–565
Article Google Scholar
Parsa S, Entezari-Maleki R (2012) Task dispatching approach to reduce the number of waiting tasks in grid environments. J Supercomput 59(1):469–485
Article Google Scholar
Parsa S, Entezari-Maleki R (2012) A queuing network model for minimizing the total makespan of computational grids. Comput Electr Eng 38(4):827–839
Article Google Scholar
Trivedi KS, Muppala JK, Woolet SP, Haverkort BR (1992) Composite performance and dependability analysis. Perform Eval 14(2–3):197–215
Article MATH Google Scholar
Reibman AL (1990) Modeling the effect of reliability on performance. IEEE Trans Reliab 39(3):314–320
Article Google Scholar
Dai Y-S, Xie M, Poh K-L (2008) Availability modeling and cost optimization for the grid resource management system. IEEE Trans Systems Man Cybern Part A Systems Hum 38(1):170–179
Article Google Scholar
Parsa S, Parand F-A (2012) Estimation of service reliability and performance in grid environment. J King Saud Univ Eng Sci 24(2):151–157
Google Scholar
Trivedi KS, Ciardo G, Malhotra M, Sahner RA (1993) Dependability and performability analysis. NASA Contractor Report 191565, Report no. 93–85, November 1993
Trivedi KS, Malhotra M (1993) Reliability and performability techniques and tools: a survey. In: The 7th ITG/GI Conference on Measurement, Modelling and Evaluation of Computer and Communication Systems (MMB), Germany, September 1993, pp 27–48
Meyer JF (1980) On evaluating the performability of degradable computing systems. IEEE Trans Comput c–29(8):720–731
Article Google Scholar
Ma Y, Han JJ, Trivedi KS (2001) Composite performance and availability analysis of wireless communication networks. IEEE Trans Vehicular Technol 50(5):1216–1223
Article Google Scholar
Beaudry MD (1978) Performance-related reliability measures for computing systems. IEEE Transact Comput C–27(6):540–547
Article Google Scholar
Hussain H et al (2013) A survey on resource allocation in high performance distributed computing systems. Parallel Comput 39(11):709–736
Article MathSciNet Google Scholar
Meyer JF (1982) Closed-form solutions of performability. IEEE Transact Comput C–31(7):648–657
Article Google Scholar
Balasangameshwaraa J, Rajub N (2012) A hybrid policy for fault tolerant load balancing in grid computing environments. J Netw Comput Appl 35(1):412–422
Article Google Scholar
Sun X-H, Wu M (2007) Quality of service of grid computing: resource sharing. In: The 6th International Conference on Grid and Cooperative Computing (GCC’07), Los Alamitos, August 16–18, 2007, pp 395–402
Bolch G, Greiner S, Meer HD, Trivedi KS (2006) Queueing networks and Markov chains: modeling and performance evaluation with computer science applications, 2nd edn., Wiley, New York
Allen AO (1990) Probability, statistics, and queueing theory with computer science applications, 2nd edn., Harcourt Brace Jovanovich, New York

Download references

Acknowledgments

The authors would like to thank Iran Telecommunication Research Center (ITRC) for their support. This research was also supported by the MSIP (Ministry of Science, ICT and Future Planning), Korea, under the CPRC (Communications Policy Research Center) support program supervised by the KCA (Korea Communications Agency)(KCA-1194100004).

Author information

Authors and Affiliations

Department of Computer Engineering, Sharif University of Technology, Tehran, Iran
Reza Entezari-Maleki, Ali Mohammadkhan & Ali Movaghar
School of Computer Science and Engineering, Seoul National University, Seoul, South Korea
Heon Young Yeom

Authors

Reza Entezari-Maleki
View author publications
You can also search for this author in PubMed Google Scholar
Ali Mohammadkhan
View author publications
You can also search for this author in PubMed Google Scholar
Heon Young Yeom
View author publications
You can also search for this author in PubMed Google Scholar
Ali Movaghar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Reza Entezari-Maleki.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Entezari-Maleki, R., Mohammadkhan, A., Yeom, H.Y. et al. Combined performance and availability analysis of distributed resources in grid computing. J Supercomput 69, 827–844 (2014). https://doi.org/10.1007/s11227-014-1184-z

Download citation

Published: 17 April 2014
Issue Date: August 2014
DOI: https://doi.org/10.1007/s11227-014-1184-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Combined performance and availability analysis of distributed resources in grid computing

Abstract

Access this article

Similar content being viewed by others

Performance aware scheduling considering resource availability in grid computing

A Novel Dynamic Reliability Optimized Resource Scheduling Algorithm for Grid Computing System

Resource Re-allocation for Data Inter-dependent Continuous Tasks in Grids

Abbreviations

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Combined performance and availability analysis of distributed resources in grid computing

Abstract

Access this article

Similar content being viewed by others

Performance aware scheduling considering resource availability in grid computing

A Novel Dynamic Reliability Optimized Resource Scheduling Algorithm for Grid Computing System

Resource Re-allocation for Data Inter-dependent Continuous Tasks in Grids

Abbreviations

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation