CENG 525

Fault Tolerant Computing

Fault modeling, testing and redundancy techniques to achieve fault tolerance in computer systems, error detection, failure recovery, error coverage, current research in the field.

Week Topics
1 Introduction to fault tolerance
2 Digital circuits and fault modeling
3 Testing for combinational and sequential circuits
4 Testing of microprocessor based systems
5 Error detection, self-checking modules
6 Malfunction diagnosis, redundancy
7 Midterm
8 Software reliability
9 Resilient algorithms
10 Error coverage
11 Vulnerability discovery
12 Failure recovery
13 Current research in the field
14 Current research in the field