Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
The analytical perspective of RBD differs fundamentally from both FTA and FMEA. FTA asks: “How did this system failure occur?” (top-down, deductive). FMEA asks: “If this component fails, what happens?” (bottom-up, inductive). RBD asks a more foundational question: “For the system to operate successfully, which components must function simultaneously?” This is a forward-modeling perspective—starting from component working states and deriving system success conditions through logical connections. This mindset makes RBD uniquely suited for quantitative trade-off analysis during the architecture design phase: “If I configure the system this way, what reliability can I expect?”
IEC 61078 defines a reliability block diagram as “an organized graphical representation of the functional blocks and their logical interconnections required for system success.” Each block in the diagram represents a functional unit (component, subsystem, or function), and the connecting lines represent reliability dependency relationships—not physical connections or signal flow paths. This is the most commonly misunderstood aspect of RBD: the diagram depicts reliability logic, not electrical schematics or piping and instrumentation diagrams (P&ID). Translating a physical architecture into its RBD equivalent is the first core competency every RBD practitioner must master.
In an RBD, a series configuration means that every block must function correctly for the system to succeed. This is logically equivalent to an AND gate—all components must be working for system success. Failure of any single block causes total system failure. This is the most common and simultaneously the most reliability-fragile system structure.
Series system reliability equation (IEC 61078, Equation 1):
RS = RA × RB × RC × … × RZ
Example: A system consisting of a power supply (R=0.99), controller (R=0.98), and actuator (R=0.97) connected in series yields a total reliability of merely 0.99 × 0.98 × 0.97 = 0.941 (94.1%). Despite each individual component exceeding 97% reliability, the series structure drives the system reliability below that of the worst single component. As the number of series-connected blocks increases, system reliability decays exponentially—this is the fundamental mathematical reason why complex systems must incorporate redundancy.
Parallel configuration is the core mechanism for enhancing system reliability. When two or more blocks are connected in parallel, at least one block must function for system success. This is equivalent to OR logic—the system fails only when all parallel blocks fail simultaneously. IEC 61078 uses a two-block parallel arrangement (Figure 9) as the elementary model:
Parallel system reliability equation (IEC 61078, Equation 2):
RS = RA + RB – RA × RB
Example: If each power supply module has a reliability of 0.9, a single-channel supply yields only 90% reliability (approximately 36.5 days of downtime per year). With dual-channel parallel redundancy, RS = 0.9 + 0.9 – 0.81 = 0.99 (99%). By adding one extra block, system reliability jumps from 90% to 99%—a tenfold reduction in failure probability. This is the mathematical foundation behind the engineering adage: “build one good system from two mediocre ones.”
Generalized parallel formula for n independent blocks configured in full parallel redundancy:
RS = 1 – Π(1 – Ri) (i = 1 to n)
IEC 61078 also presents mixed configurations (Figures 2-5)—duplicated series branches (parallel-of-series, i.e., first series then parallel) and series duplicated branches (series-of-parallel, i.e., first parallel then series). Although these two configurations may appear physically similar, their reliability calculation paths are distinct, reflecting fundamentally different system success condition definitions.
The m-out-of-n voting structure (also called K/N gate) sits between pure series (all must work) and pure parallel (at least one must work)—at least m out of n identical blocks must function for system success. The most prevalent configurations are 2/3 voting (e.g., triple-redundant sensor systems) and 2/4 redundancy.
M/N reliability with identical items (IEC 61078, Equation 5):
RS = Σ C(n, r) × Rn-r × (1-R)r (r = 0 to n-m)
For a 2/3 voting system (three identical items, at least two must work) with individual component reliability R=0.9:
RS = R3 + 3 × R2 × (1-R) = 0.729 + 3 × 0.81 × 0.1 = 0.729 + 0.243 = 0.972
When m=1, the m/n configuration degenerates to full parallel; when m=n, it degenerates to pure series. The engineering value of m/n structures lies in their ability to tolerate up to (n-m) simultaneous failures while avoiding the high hardware cost of full parallel redundancy. Aircraft engine thrust control systems use 2/3 voting to accept a single sensor fault; nuclear reactor protection systems commonly deploy 2/4 voting logic to tolerate a single channel spurious actuation or failure-to-trip.
| Configuration | Logic Equivalent | System Success Condition | Reliability Formula (Independent) | Typical Application |
|---|---|---|---|---|
| Series | AND gate | All blocks must work | RS = ΠRi | Power + controller + actuator chain, communication links |
| Full Parallel | OR gate | At least one block works | RS = 1 – Π(1-Ri) | 1+1 redundant power supplies, dual cooling pumps |
| 2/3 Voting (2oo3) | 2/3 gate | At least 2 of 3 work | RS = 3R2 – 2R3 | Triple-redundant sensors, flight control voters |
| 2/4 Voting (2oo4) | 2/4 gate | At least 2 of 4 work | RS = 3R4 – 8R3 + 6R2 | Nuclear reactor protection systems |
| Cold Standby | Switching logic | Primary fails; backup available and successfully switched in | RS(t) = e-λt(1+λt) (equal λ) | Power backup, communication link backup |
IEC 61078 Clauses 6 and 7 present a complete methodology for RBD development. This is not simply a “draw a few boxes and connect them” exercise—it is a systematic engineering analysis activity requiring disciplined systems thinking:
(1) Define System Success/Failure Criteria — The most critical and most frequently skipped step in RBD construction. IEC 61078 (6.1) stipulates that the system success definition must encompass: functional requirements (which functions the system must perform), time interval (over what duration), operating conditions (environment, load range), and permissible degradation levels. For instance, “the system operates normally” is an unqualified definition; “the 24 V control power supply, operating in -40 to +85 °C ambient temperature, with rated load of 0-5 A, maintains output voltage deviation within ±5% for a continuous mission duration of 8,760 hours” is a qualified definition.
(2) Identify Functional Components and Assign Blocks — Decompose the system into functionally distinguishable components or sub-functions. Each block should represent a “reliability-independent entity”—its failure probability must not depend on the states of other blocks (independence assumption, 5.1). Block granularity directly determines model accuracy: too coarse, and internal redundancy is masked; too fine, and the model becomes unmanageable.
(3) Determine Reliability Logic Relationships Between Blocks — Based on the system success condition, judge which blocks are in series (all must work), which are in parallel (at least one must work), and which form m/n voting relationships. This is the core step of translating system architecture language into reliability logic language.
(4) Draw the RBD with Standardized Symbols — Input (I) and output (O) terminals mark the connection endpoints. IEC 61078 employs standardized block symbols and connecting lines.
(5) Evaluate the Model — Apply series/parallel/m-n formulas for preliminary reliability calculation. For simple RBDs, analytical closed-form solutions are directly obtainable. For complex RBDs (containing bridge structures or common blocks), employ truth table methods, total probability theorem decomposition, or Boolean disjointing techniques (Clause 8).
Not all systems can be reduced to pure series/parallel combinations. IEC 61078 uses a bridge circuit (Figure 8) as its canonical complex case—a structure containing a central bridge arm A, with success paths: (B1 AND C1) OR (A AND C1) OR (A AND C2) OR (B2 AND C2), where C1 and C2 represent two loads (e.g., port and starboard engines in a light aircraft fuel supply analogy). This structure cannot be evaluated by direct application of series/parallel formulas because A participates in two distinct success paths simultaneously.
IEC 61078 provides two methods for handling complex RBDs:
Method 1: Total Probability Theorem (8.1.2) — Select a “critical block” (such as the central element A in the bridge structure), compute the conditional system reliability under the two mutually exclusive states of that block (“working” and “failed”), then weight by the reliability of that block:
RS = Pr(System Success | A working) × RA + Pr(System Success | A failed) × (1-RA)
When A is working, the bridge structure collapses to a simplified reliability model (Figure 12) that is easy to evaluate. When A has failed, it collapses to a different simplified model (Figure 11). This decomposition approach is well-suited for structures with a small number of bridge arms (typically 1-3). The key is selecting the critical block that maximizes the simplification of the remaining structure.
Method 2: Truth Table Method (8.1.3) — Enumerate all possible “working/failed” state combinations of all blocks (2n combinations for n blocks), determine row-by-row whether each combination satisfies the system success condition, and sum the probabilities of all success-state rows. IEC 61078 Table 1 demonstrates a truth table for a 1-out-of-3 system (23=8 rows), and Table 2 demonstrates the truth table for the bridge structure (25=32 rows). The truth table method is universally applicable to RBDs of any complexity, but when the number of blocks exceeds 15-20 (215=32,768 rows), manual computation becomes infeasible and software tools or Monte Carlo simulation are required (the standard notes that simulation is outside its scope).
IEC 61078 Clause 8.2 provides an in-depth treatment of the common blocks (repeated blocks) problem—when the same block appears at multiple positions within an RBD. This is exceedingly common in real systems: a single power supply module may feed both the controller and the communication interface, appearing in two independent success paths in the RBD even though they reference the same physical device.
Evaluating an RBD containing common blocks under the independence assumption leads to:
IEC 61078 Annex B provides the Boolean Disjointing method—a technique that uses Boolean algebra to transform reliability expressions containing repeated events into a sum of mutually exclusive terms free of repeated events. The core derivation formula (Annex B, Equation B.1):
RS = P(A) + P(¬A ∧ B) + P(¬A ∧ 가B ∧ C) + …
The essence of this method: iteratively select blocks and expand the system success probability into a mutually exclusive sequence—”Step 1: A succeeds; Step 2: A fails but B succeeds; Step 3: both A and B fail but C succeeds…”—and sum the resulting disjoint terms. Modern RBD software tools (e.g., Isograph Availability Workbench, ReliaSoft BlockSim, ITEM ToolKit) can automatically perform Boolean disjointing, but engineers must understand the underlying principle to correctly interpret the physical meaning of each term in the calculation result.
| Method | Applicable Scenario | Advantages | Limitations | Computational Complexity |
|---|---|---|---|---|
| Series/Parallel Reduction | Pure series/parallel, no bridge structures, no repeated blocks | Intuitive, fast, direct formula application | Cannot handle bridges or common blocks | O(n) |
| Total Probability Decomposition | Bridge structures with 1-3 bridge arms | Splits complex structure into two simple sub-models | Branch explosion with many arms; requires wise selection of critical block | O(2k), k=critical block count |
| Truth Table | Any complexity, blocks < 20 | Most general, logically transparent, auditable | Exponential state explosion (2n) with many blocks | O(2n) |
| Boolean Disjointing | RBDs with repeated/common blocks | Exact treatment of common blocks; results usable for sensitivity analysis | Tedious manual derivation; physical meaning of terms opaque after expansion | Depends on repeated-block structure |
| Block Diagram Reduction | Complex RBDs with known success logic | Gradually reduces complexity through grouping and equivalent transformation | Only applicable to regularly groupable structures | Depends on reduction path |
IEC 61078 Clause 9 extends the RBD methodology from reliability (probability of no failure within a defined mission time) to availability (probability of being in an operable state at a given point in time, considering repair). This extension confers significant practical value, since most industrial systems require not “zero failures ever” (reliability requirement) but rather “rapid restoration after failure” (availability requirement).
In the RBD framework, availability is computed by assigning each block two parameters: failure rate (λ) and repair rate (μ)—or equivalently, MTTF (Mean Time To Failure) and MTTR (Mean Time To Repair). For a series availability model, the system unavailability is approximately the sum of component unavailabilities:
US ≈ Σ(λi / μi) = Σ(MTTRi / MTTFi) (when λ ≪ μ)
This extension upgrades RBD from analyzing “probability of surviving a mission without failure” to analyzing “steady-state proportion of time the system is in an available state.” This has direct engineering value for data centers (targeting “five nines” availability, i.e., 99.999%, which translates to only 5.26 minutes of downtime per year), telecommunications base stations (targeting 99.9999%), and process control system design verification. The standard notes that if repair times are not negligible, Markov analysis should be used instead of RBD to handle state transitions—during repair processes, blocks are no longer independent.
RBD and FTA are frequently confused by newcomers because both employ graphical representations with blocks/gates and connecting lines. At the methodological level, however, they represent fundamentally different analytical perspectives:
RBD is a forward, bottom-up reliability synthesis method: starting from component reliability data and synthesizing system reliability through AND (series) and OR (parallel) logic. RBD answers: “Given the reliability of each component, what is the reliability of the entire system?”
FTA is a backward, top-down failure reasoning method: starting from a system-level top event and tracing downward through AND and OR logic gates to identify all combinations of basic events that could produce that outcome. FTA answers: “How did this system failure occur?”
A critical distinction lies in the semantics of the logic gates: in RBD, an AND (series) configuration reduces reliability (because all components must work), while an OR (parallel) configuration enhances reliability (because any one component working suffices). In FTA, the semantics are inverted: an OR gate reduces reliability (any single cause triggers the top event), while an AND gate enhances reliability (all causes must coincide). The AND/OR logic used by the two methods carries exactly opposite meanings—because RBD deals with “success logic” and FTA deals with “failure logic.” This semantic inversion is the single greatest source of cognitive friction when practitioners migrate between the two methodologies.
Viewing RBD alongside FMEA (IEC 60812) and FTA (IEC 61025) reveals a clear division of labor and complementary relationship:
| Dimension | RBD (Reliability Block Diagram) | FTA (Fault Tree Analysis) | FMEA (Failure Mode & Effects Analysis) |
|---|---|---|---|
| IEC Standard | IEC 61078 | IEC 61025 | IEC 60812 |
| Analysis Direction | Bottom-up (Forward synthesis) | Top-down (Deductive) | Bottom-up (Inductive) |
| Starting Point | Component reliability data | System-level top event (failure consequence) | Component failure mode list |
| Core Question | “Given component reliability, what is system reliability?” | “How did this top event occur?” | “What happens if this component fails?” |
| Object Modeled | Success paths | Failure paths | Failure effects |
| Logic Gates | Series = AND (all must work) Parallel = OR (one suffices) |
OR gate (any cause triggers) AND gate (all causes needed) |
No logic gates |
| Primary Output | System reliability/availability numeric value | Minimal cut sets, top event probability | RPN ranking, failure effects list |
| Optimal Application | Redundancy architecture trade-offs, availability prediction | Safety-critical system failure path identification | Design review, potential failure coverage |
| Key Limitation | Assumes independence; struggles with sequential events | Construction effort heavy; easy to miss common causes | Cannot handle multiple-failure combinations |
RBD delivers irreplaceable value in the following engineering scenarios: