Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
In modern electronic product development, a familiar story plays out far too often: the design team completes the product, hands it over to the reliability team for “the test,” and only then discovers fundamental weaknesses. At that point, a single design change may mean re-tooling, board re-spin, and regulatory re-certification — costs multiply, schedules collapse. IEC 61014, “Programmes for Reliability Growth,” exists precisely to prevent this scenario by embedding reliability improvement into every stage of product development through structured Test-Analyze-Fix (TAAF) cycles.
Published by IEC TC 56 (Dependability), the second edition (2003) represents a fundamental rethinking of reliability growth. The first edition (1989) focused almost exclusively on formal reliability growth testing. The second edition introduces the concept of Integrated Reliability Engineering — reliability growth activities that span the entire product life cycle from concept definition through field use. The logic is simple: fixing a weakness on a schematic costs pennies; fixing it during pilot production costs hundreds; fixing it through a product recall costs millions.
IEC 61014 builds its entire methodology on a crucial distinction between two fundamentally different types of weaknesses:
A systematic weakness can only be eliminated, or its effects reduced, by a modification of the design, manufacturing process, operational procedures, documentation, or other relevant factors. These weaknesses arise from deterministic causes such as design errors, improper component selection, or manufacturing process flaws. The critical insight: a single systematic weakness is built into every unit of the design. This means systematic weaknesses can be detected even with small sample sizes — provided the test conditions stimulate the failure mode.
Software weaknesses are always systematic, as IEC 61014 explicitly notes. A software bug does not appear “randomly” — it lurks in every copy, waiting for the right input conditions to trigger it.
Residual weaknesses are related to uncontrolled random variation and exist only in hardware. Unlike systematic weaknesses, their effects are limited to individual units. They are addressed through quality control, statistical process control, and adequate design margins rather than through reliability growth testing.
IEC 61014 makes a provocative statement: “The term random failures should be avoided.” The time at which a failure is observed may be random, but the cause of the failure is always deterministic — we simply may not yet understand the physical failure mechanism.
| Characteristic | Systematic Weaknesses | Residual Weaknesses |
|---|---|---|
| Root cause | Design/process/documentation defects | Uncontrolled random variation |
| Scope of effect | All units of the same design | Individual units only |
| Detection method | Small sample testing suffices | Large sample sizes required |
| Elimination method | Design modification (core of TAAF) | Screening, QC, derating |
| Applicable to software | Yes (all software weaknesses are systematic) | No |
| Failure recurrence | Inevitable without design change | Low recurrence probability |
The engine of reliability growth is the TAAF cycle:
IEC 61014 categorizes test failures as follows:
The decision team typically includes design, reliability, and programme management personnel. This triage mechanism ensures resources are focused on the highest-impact improvements.
IEC 61014 and its sister standard IEC 61164 describe the mathematical foundation. The core concept: as each successful corrective modification is introduced, the product’s failure intensity decreases following a power-law relationship.
The Duane model is the classic empirical approach. It observes that cumulative failure rate plotted against cumulative test time on log-log axes approximates a straight line:
λΣ(T) = kT-α
where λΣ(T) is cumulative failure rate, T is cumulative test time, k is a constant related to initial failure rate, and α is the growth rate parameter (0 < α < 1, typically 0.3 to 0.6).
The Crow-AMSAA model provides a rigorous statistical foundation using a Non-Homogeneous Poisson Process (NHPP):
N(T) = λTβ
where β is the growth parameter (β < 1 indicates reliability is improving) and λ is a scale parameter. The Crow-AMSAA model’s key advantage is that it provides statistical confidence intervals, enabling probabilistic predictions of when reliability targets will be met.
IEC 61014 maps reliability growth activities across seven product development phases:
| Phase | Key Reliability Activities | Typical Outputs |
|---|---|---|
| I. Concept & Requirements | Set product reliability goal; analyze usage profile; study field data from similar products | Reliability goal document; usage profile |
| II. Product Definition & Prelim. Design | Initial reliability estimates; reliability growth plan and model; key component reliability requirements | Growth plan; key components list |
| III. Detailed Design | FMEA/FTA; failure mode mitigation; design reviews; continuous reliability reassessment | FMEA report; mitigation action list |
| IV. Tooling & Production Prep. | Component testing; subsystem reliability testing | Component qualification reports |
| V. First Production / Pilot | Reliability growth testing; life testing; environmental stress screening | TAAF cycle records; growth curves |
| VI. Production | Continuing reliability testing; product change impact assessment | Lot reliability reports |
| VII. Field Use | Field failure tracking and analysis; input for next-generation improvements | Field performance report |
Mistake 2: Confusing reliability growth testing with reliability demonstration testing. Growth testing aims to find and fix problems. Demonstration testing aims to prove a requirement has been met. Using demonstration criteria (e.g., time-terminated, zero-failure acceptance) during the growth phase discourages aggressive failure discovery — engineers subconsciously avoid exposing issues that would “fail” the test.
Mistake 3: Short-changing the Analyze step. The most undervalued letter in TAAF is “A.” IEC 61014 requires multi-dimensional investigation including physical analysis, chemical analysis, and circumstantial analysis. If failure analysis stops at “replaced IC U12 and it works now,” systemic weaknesses remain undiagnosed and unaddressed. A well-functioning FRACAS (Failure Reporting, Analysis and Corrective Action System) is essential.
IEC 61014 sounds a strong note of caution: even seemingly successful modifications must be rigorously verified. Verification requires testing not only under the same conditions that produced the original failure, but also accounting for all stress factors previously applied. Moreover, a modification may introduce an entirely new failure mode — a well-known phenomenon in complex systems. For critical fixes, IEC 61014 recommends additional targeted testing for speculative failure modes that the modification might introduce.
At its heart, reliability growth is not about plotting an elegant growth curve for a quarterly review presentation. It is about building a product that does not wake you up at 3 AM with a customer escalation call. IEC 61014 provides a battle-tested methodology: from reliability goal setting in the concept phase, through FMEA/FTA analysis during design, through TAAF cycles during testing, and onward to continuous improvement in the field. Reliability is not something you test into a product — it is something you grow into a product, one design improvement at a time.