Fail-Silent Assumptions in Fault-Tolerant Systems


(*) Corresponding author


Authors' affiliations


DOI's assignment:
the author of the article can submit here a request for assignment of a DOI number to this resource!
Cost of the service: euros 10,00 (for a DOI)

Abstract


Fail-silent units are implemented by using redundancy of system components. Errors are detected by comparison of the results produced by these components and in a case of a result mismatch the result is not delivered. This paper discusses and analyzes the definition of the fail silence in the context of safety-critical systems. Furthermore, it describes which items should be provided within a fault hypothesis for safety-critical systems. Instead of the definition of fail silence units, we propose the definition of detectable erroneous units (DEU), and describe a general model of a DEU for distributed fault-tolerant systems.
Copyright © 2017 Praise Worthy Prize - All rights reserved.

Keywords


Fail Silence; Fault Tolerance; Fault Hypothesis; Dependability Evaluation; Fault Injection

Full Text:

PDF


References


D. Powell and P. Verissimo. The Delta-4 Approach to Dependability in Open Distributed Computing Systems. In Proceedings of 18th Int. Symposium on Fault-Tolerant Computing (FTCS-18), pages 102–107, Tokyo, Japan, June 1988. IEEE Press.

A. Ademaj, H. Sivencrona, G. Bauer, and J. Torin. Evaluation of Fault Handling of the Time-Triggered Architecture with Bus and Star Topology. In IEEE International Conference on Dependable Systems and Networks (DSN 2003), pages 123–132, San Francisco, USA, June 2003.

C. Basile, W.Long, Z. Kalbarczyk, and R. Iyer. Group Communication Protocols Under Errors. In IEEE International Conference on Dependable Systems and Networks (DSN 2001) Proceedings of the 22nd International Symposium on Reliable Distributed Systems (SRDS’03), pages 35– 44, Florence, Italy, Oct. 2003.

H. Medeira, R. Some, F. Moreira, D. Costa, and D. Rennels. Evaluation of a COTS System for Space Applications. In International Conference on Dependable Systems and Networks, (DSN), pages 325–330, Washington D.C., USA, June 2002.

M. Rela, H. Madeira, and J. Silva. Experimental Evaluation of the Fail-Silent Behaviour in Programs with Consistency Checks. In Proc. 22nd International Symposium on Fault-Tolerant Computing, FTCS-26, pages 394–403, Sendai, Japan, June 1996.

C. Jones, M.-O. Killijian, H. Kopetz, E. Marsden, N. Moffat, M. Paulitsch, D. Powell, B. Randell, A. Romanovsky, and R. Stroud. Revised Concepts of DSoS Conceptual Model. Project Deliverable for Dependable Systems of Systems (DSoS). Research Report 35/2001, Vienna University of Technology, Real-Time System Group, Vienna, Austria, 2001.

L. Kaufman, S. Bhide, and B. Johnson. Modelling of Common-Mode Failures in Digital Embedded Systems. In Proceedings of the Reliability and Maintainability Symposium 2000. IEEE Press, pages 350–357, Los Angeles, USA, 2000.

C. Constantinescu. Impact of Deep Submicron Technology on Dependability of VLSI Circuits. In International Conference on Dependable Systems and Networks, DSN 2002, pages 205–209, Washington DC, USA, June 2002.

H. Kopetz. Fault Containment and Error Detection in TTP/C and FlexRay. In Proceedings of The Sixth International Symposium on Autonomous Decentralized Systems (ISADS ’03), pages 139–148, Pisa, Italy, Apr. 2003.

H. Kopetz and G. Bauer. The Time-Triggered Architecture. Proceedings of the IEEE, Special Issue on Modelling and Design of Embedded Software, 91(1):112–126, Jan. 2003.

D. Black, C. Low, and S. Shrivastava. The Voltan Application Programming Environment for Fail-Silent Processes. Distributed Systems Engineering, 5:66–77, June 1998.

M. Baleani, A. Ferrari, L. Mangeruca, A. Sangiovanni- Vincentelli, M. Peri, and S. Pezzini. Fault-Tolerant Platforms for Automotive Safety-Critical Applications. Proceedings of the 2003 International Conference on Compilers, Architecture and Synthesis for Embedded Systems Pages 170–177, San Jose, California, USA, Oct. 2003.

F. Brasileiro, P. Ezhilchelvan, S. Shrivastava, N. Speirs, and S. Tao. Implementing Fail-Silent Nodes for Distributed Systems. IEEE Transactions on computers, 45(11):1226–1238, Nov. 1996.

S. Webber and J. Beirne. The Stratus Architecture. In 21th International Symposium on Fault-Tolerant Computing (FTCS-21), pages 79–85, June 1991.

P. Bernstein. Sequoia: A Fault-Tolerant Tightly Coupled Multiprocessor for Transaction Processing. IEEE Transactions on Computers, 21(2):37–45, Feb. 1988.

J. Wensley. SIFT: The Design and Analysis of a Fault-Tolerant Computer for Aircraft Control. IEEE Transactions on Computers, 66:1240–1255, 1978.

J. Karlsson, J. A. P. Folkesson, Y. Crouzet, G. Leber, and J. Reisinger. Application of Three Physical Fault Injection Techniques in the Experimental Assessment of the MARS Architecture. In 5th IFIP Working Conference on Dependable Computing for Critical Applications (DCCA-5), pages 150–161, 1995.

J. Reisinger and A. Steiniger. The Design of a Fail-Silent Processing Node for the Predictable Hard Real-Time System MARS. Distributed System Eng. Journal, 1(2):104–111, 1993.

E. Fuchs. Validating Fail-Silence in the MARS Architecture. In 6th International Working Conference on Dependable Computing for Critical Applications (DCCA-6), Grainau, Germany, Mar. 1997.

H. Kopetz and N. Suri. Compositional Design of RT Systems: A Conceptual Basis for Specification of Linking Interfaces. The 6th IEEE International Symposium on Object-Oriented Real-Time Computing (ISORC03), Hokkaido, Japan, May 2003.

D. T. Stott, N. A. Speirs, J. Xu, S. Bagchi, K. Whisnant, Z. Kalbarczyk, and R. K. Iyer. Fault Injection Based Assessment of Fail-Silence Provided by Process Duplication versus Internal Error Detection. http://citeseer.ist.psu.edu/stott00fault.html.

P. Chevochot and I. Puaut. Experimental Evaluation of the Fail-Silent Behaviour of a Distributed Real-Time Run-Time Support Built from Cots Components. In IEEE International Conference on Dependable Systems and Networks (DSN 2001), pages 304–313, Gothenburg, Sweden, July 2001.

H. Madeira and J.G.Silva. Experimental Evaluation of the Fail-Silent Behaviour in Computers without Error Masking. In Proc. 22nd International Symposium on Fault-Tolerant Computing, 1994.


Refbacks

  • There are currently no refbacks.



Please send any question about this web site to info@praiseworthyprize.com
Copyright © 2005-2024 Praise Worthy Prize