Copyright © 2003 Elsevier B.V. All rights reserved.
Analyzing the effectiveness of fault-management architectures in layered distributed systems
Available online 4 November 2003.
Abstract
Fault management infrastructure in distributed systems includes manager processes and agents with various kinds of interactions for monitoring and surveillance of the status of the application software and hardware. The system architecture now includes these additional components and interactions, and they affect the system availability. This paper describes an architecture model called MAMA (Model for Availability Management Architecture) with an architecture definition language MAMA-dl for the combination of the application and management parts, and its analysis. The analysis extends the Fault Tolerant Layered Queueing Model to account for propagation of knowledge of the system state in the management sub-architecture. The model is demonstrated on a problem of placement of manager tasks in a system.
Author Keywords: MAMA; Fault management; Architecture
Article Outline
- 1. Introduction
- 2. Layered systems with a fault-management architecture
- 3. Modeling fault propagation
- 4. Modeling knowledge propagation
- 5. Performability algorithm
- 6. A comparison of four fault-management architectures
- 6.1. The layered distributed application
- 6.2. Four sample fault-management architectures
- 6.2.1. Architecture 1: a centralized management architecture
- 6.2.2. Architecture 2: a distributed management architecture
- 6.2.3. Architecture 3: a hierarchical management architecture
- 6.2.4. Architecture 4: a general network management architecture
- 6.3. Results
- 7. A language for combined software and management architecture
- 8. Best placement of managers
- 9. Conclusion
- Acknowledgements
- Appendix A. The MAMA Definition Language MAMA-dl
- References
- Vitae






E-mail Article
Add to my Quick Links

Cited By in Scopus (0)







