Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
ISO/IEC TR 25219:2023 provides a comprehensive technical framework for the performance testing of face recognition systems. As facial recognition technology becomes ubiquitous in security, banking, border control, and consumer applications, the need for standardized, reproducible testing methodologies has never been more pressing. This technical report addresses the entire testing lifecycle — from test design and dataset selection to metric computation and result interpretation.
The report covers both verification (1:1 matching) and identification (1:N search) scenarios, offering distinct protocols for each. It accounts for variations in image capture conditions, population demographics, and system operational modes — providing a holistic approach to performance characterization.
TR 25219 defines three primary testing regimes: closed-set identification, open-set identification, and verification testing. Each regime requires specific dataset characteristics and evaluation protocols. The table below summarizes the core metrics and their operational significance:
| Metric | Definition | Operational Relevance |
|---|---|---|
| False Accept Rate (FAR) | Proportion of impostor attempts incorrectly accepted | Security risk — critical for access control and financial transactions |
| False Reject Rate (FRR) | Proportion of genuine attempts incorrectly rejected | User experience — impacts convenience and workflow efficiency |
| Equal Error Rate (EER) | Threshold where FAR equals FRR | Single-figure system comparison benchmark |
| Rank-1 Identification Rate | Proportion of queries where the correct identity is the top match | Primary accuracy measure for watchlist and identification scenarios |
| Detection Error Trade-off (DET) Curve | Graphical FAR vs FRR across all thresholds | Threshold selection for operational requirements |
| False Positive Identification Rate (FPIR) | In open-set, proportion of non-mated searches falsely matched | Critical for watchlist applications where false alarms must be minimized |
| True Positive Identification Rate (TPIR) | In open-set, proportion of mated searches correctly identified | Effectiveness measure for enrolled population coverage |
TR 25219 imposes stringent requirements on test datasets. Images must represent the target operational distribution in terms of pose angles (±15° for cooperative subjects, up to ±45° for non-cooperative), illumination variation (at least 5 lux to 1000+ lux), resolution (minimum 80 pixels interpupillary distance for verification), and image quality (no compression artifacts below JPEG quality 80). The report also requires that datasets include multiple samples per subject to enable statistical confidence intervals.
The report introduces a structured protocol for demographic bias analysis. Test results must be disaggregated by at least three demographic dimensions, with statistical significance testing (e.g., 95% confidence intervals) applied to compare cohort performance. If the difference in FAR or FRR between any two demographic groups exceeds 1.5x, the system is flagged for potential bias. The protocol includes guidance on remedial actions, including targeted retraining, threshold adjustment, or fusion with complementary modalities.
TR 25219 introduces the concept of “operational scenario profiles” — parameterized descriptions of deployment conditions that affect face recognition performance. These profiles include camera type (visible, NIR, thermal), capture distance (0.5m to 10m+), subject cooperation level, environmental lighting, and population characteristics. By testing against multiple scenario profiles, procurers can match system capabilities to real operational needs rather than relying on single-number accuracy claims.