Abstract
Since its introduction nearly 20 years ago, the Byzantine Generals Problem has been the subject of many papers having the scrutiny of the fault tolerance community. Numerous Byzantine tolerant algorithms and architectures have been proposed. However, this problem is not yet sufficiently understood by those who design, build, and maintain systems with high dependability requirements. Today, there are still many misconceptions relating to Byzantine failure, what makes a system vulnerable, and indeed the very nature and reality of Byzantine faults. This paper revisits the Byzantine problem from a practitioner’s perspective. It has the intention to provide the reader with a working appreciation of the Byzantine failure from a practical as well as a theoretical perspective. A discussion of typical failure properties and the difficulties in preventing the associated failure propagation is presented. These are illustrated with real Byzantine failure observations. Finally, various architectural solutions to the Byzantine problem are presented.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
NOAA/American Red Cross: Thunderstorms and Lightning, safety brochure (1994)
RTCA Inc.: DO-254, Design Assurance Guidance for Airborne Electronic Hardware
Constantinescu, C.: Impact of Deep Submicron Technology on Dependability of VLSI Circuits. In: Proc. Dependable Systems and Networks (2002)
Systems Standard & Technology Council: Avionics Process Management Committee, http://www.geia.org/sstc/APM/
Kelling, N., Heck, W.: The Brake Project—Centralized Versus Distributed Redundancy for Brake-By-Wire Systems. Paper No 2002-01-0266, SAE (2002)
TTTech Computertechnik AG, Specification of the TTP/C Protocol V1.0
Lamport, L., Shostak, R., Pease, M.: The Byzantine Generals Problem. ACM Transactions on Programming Languages and Systems 4(3), 382–401 (1982)
Lavo, D., Larrabee, B., Chess, T.: Beyond the Byzantine Generals: Unexpected Behavior and Bridging Fault Diagnosis. In: Proc. Int. Test Conference, pp. 611–619 (1996)
Bohr, N.: The quantum postulate and the recent development of atomic theory. Nature 121, 580–589 (1928); Reprinted in Quantum Theory and Measurement
Chaney, T.: Measured Flip-Flop Responses to Marginal Triggering. IEEE Transactions of Computers C-32(12), 1207–1209 (1983)
Kopetz, H.: Real-Time Systems. In: Design Principles for Distributed Embedded Applications. Kluwer Academic Publishers, Boston (1997)
Fault Injection for TTA. Deliverable 5.1–5.5 Combined Report IST 1999 10748
Pfeifer, H., Schwier, D., von Henke, F.W.: Formal Verification for Time Triggered Clock Synchronization. In: Proc. 7th IFIP International Working Conference on Dependable Computing for Critical Applications (January 1999)
Ademaj, A.: Slightly-Off-Specification Failures in the Time Triggered Architecture. In: 7th IEEE Int. Workshop on High Level Design Validation and Test (October 2002)
Wensly, J.H., Lamport, L., Goldberg, J., Levitt, K.N., Melliar-Smith, P.M., Shostak, R.E., Weinstock, C.B.: SIFT: Design and Analysis of fault tolerant computer control for aircraft. Proceedings of IEEE 66(10), 1240–1255 (1978)
Hopkins, A., Smith, T., Lala, J.: FTMP—A Highly Reliable Fault Tolerant Multiprocessor for Aircraft. Proceedings of IEEE 66(10), 1221–1239 (1978)
Miner, P., Malekpour, M., Torres, W.: A Conceptual Design for a Reliable Optical Bus (ROBUS). In: Proc. 21st Digital Avionics Systems Conference (2002)
Hoyme, K., Driscoll, K.: SAFEbus. In: Proc. 11th Digital Avionics Systems Conference, October 5–9 (1992)
Kopetz, H., Bauer, G., Poledna, S.: Tolerating Arbitrary Node Failure in the Time- Triggered Architecture. Doc No 2001-01-0677, SAE (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Driscoll, K., Hall, B., Sivencrona, H., Zumsteg, P. (2003). Byzantine Fault Tolerance, from Theory to Reality. In: Anderson, S., Felici, M., Littlewood, B. (eds) Computer Safety, Reliability, and Security. SAFECOMP 2003. Lecture Notes in Computer Science, vol 2788. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39878-3_19
Download citation
DOI: https://doi.org/10.1007/978-3-540-39878-3_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20126-7
Online ISBN: 978-3-540-39878-3
eBook Packages: Springer Book Archive