Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
ISO/TR 27877:2021 provides comprehensive guidance on statistical methods for evaluating proficiency testing (PT) data obtained from interlaboratory comparisons. Proficiency testing is a cornerstone of laboratory quality assurance, enabling laboratories to demonstrate technical competence and identify measurement biases through periodic comparison with peer laboratories. This Technical Report fills a critical gap by consolidating robust statistical approaches that go beyond classical assumptions, addressing real-world challenges such as small sample sizes, multiple outliers, and non-normal distributions commonly encountered in PT schemes.
The standard focuses on performance statistics that are both intuitive and statistically sound. It covers traditional z-scores alongside robust alternatives including z’-scores, zeta-scores, and En numbers, each suited to different scenarios depending on the availability and reliability of assigned values and uncertainty estimates. A key contribution is the guidance on handling censored data, extreme values, and multimodal distributions that violate normality assumptions.
The core of ISO/TR 27877 lies in its structured approach to calculating and interpreting performance metrics. The classical z-score, defined as (x − Xpt) / σpt, remains widely used, but the standard emphasizes the importance of selecting robust estimators for the assigned value Xpt and the standard deviation σpt. Algorithm A from ISO 13528 is recommended for robust analysis, providing immunity to the influence of up to 20% outliers.
| Metric | Formula | Application | Robustness |
|---|---|---|---|
| z-score | (x − Xpt) / σpt | General PT with reliable σ | Low |
| z’-score | (x − Xpt) / MAD | Small participant numbers | High |
| ζ-score (zeta) | (x − Xpt) / √(ux² + upt²) | When lab uncertainty is critical | Moderate |
| En number | (x − Xpt) / √(Ulab² + Uref²) | Calibration PT schemes | Moderate |
The standard also addresses the interpretation of combined results across multiple rounds or multiple measurands. The sum of z-scores (RSZ) and sum of squared z-scores (RSSZ) are introduced as tools for detecting persistent bias or excessive variability that individual z-scores might miss. A laboratory with |z| ≤ 2 considered satisfactory, 2 < |z| < 3 is questionable, and |z| ≥ 3 is unsatisfactory, but the standard stresses the need to consider these alongside graphical tools such as Youden plots and Mandel’s h and k statistics.
From an engineering perspective, ISO/TR 27877 offers several valuable design insights. First, the choice of robust statistical estimators directly impacts the sensitivity of the PT scheme. Using the median instead of the mean reduces the influence of extreme results but also reduces statistical efficiency. The standard recommends Algorithm A (Huber M-estimator) as a practical compromise, providing high breakdown point with acceptable efficiency.
Second, the standard provides practical workflows for handling non-normal data distributions. When the data exhibit significant skewness or kurtosis, transformation techniques (logarithmic, Box-Cox) are recommended before applying performance metrics. The standard includes worked examples demonstrating how transformation affects z-score interpretation, a rarely-covered but critically important topic.