Automatically increasing the fault-tolerance of distributed algorithms

G Neiger, S Toueg�- Journal of Algorithms, 1990 - Elsevier
The design of fault-tolerant distributed systems is a costly and difficult task. Its cost and
difficulty increase dramatically with the severity of failures that a system must tolerate. This
task is simplified through methods that automatically translate protocols tolerant of “benign”
failures into ones tolerant of more “severe” failures. This paper describes two new translation
mechanisms for synchronous systems: one translates protocols tolerant of crash failures into
protocols tolerant of general omission failures and the other from general omission failures�…