Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
ISO/IEC TR 29189 provides a comprehensive framework for the evaluation of Presentation Attack Detection (PAD) mechanisms in biometric systems. Presentation attacks — attempts to fool biometric sensors using fabricated or altered biometric characteristics (e.g., silicone fingerprints, printed iris images, voice recordings) — represent one of the most significant security threats to biometric authentication systems. This technical report establishes standardized methodologies for assessing how well different PAD mechanisms perform across various attack types and conditions.
The report addresses the fundamental challenge that PAD evaluation must account for: the adversarial nature of the problem. Unlike traditional biometric performance evaluation, where genuine users cooperate with the system, PAD evaluation involves simulating an intelligent attacker who actively tries to bypass detection mechanisms. This requires carefully designed test protocols, attack species taxonomies, and performance metrics that capture the arms-race dynamic between attackers and defenders.
ISO/IEC TR 29189 defines a systematic evaluation methodology structured around several dimensions. The Attack Presentation Classification Error (APCER) measures the proportion of attack presentations incorrectly classified as genuine. The Bona Fide Presentation Classification Error (BPCER) measures the proportion of genuine presentations incorrectly classified as attacks. The Overall System Error (OSE) provides a combined metric. The report specifies how these metrics should be computed with confidence intervals and how to establish performance baselines.
The evaluation methodology requires that test datasets include multiple attack types (known as attack species) with varying levels of attack potential — from low-effort attacks using readily available materials to high-effort attacks using sophisticated fabrication techniques. Each attack species is categorized by attack potential level following the Common Criteria framework, enabling consistent security certification across different biometric modalities and applications.
| Attack Species | Attack Potential | Difficulty to Produce | Detection Difficulty |
|---|---|---|---|
| 2D Printed face/iris | Low | Easy (office printer) | Low |
| Silicone fingerprint | Medium | Moderate (molding + casting) | Medium |
| 3D Mask face | High | Difficult (scanning + printing) | High |
| Video replay face | Medium | Moderate (display screen) | Medium-High |
| Synthesized voice (deepfake) | Very High | Very difficult (AI training) | Very High |
A critical contribution of ISO/IEC TR 29189 is its guidance on test protocol design. The protocol must specify the enrollment conditions under which genuine biometric references are created, the presentation conditions for both genuine and attack attempts (including environmental factors like lighting, angle, distance), and the number of attempts required for statistically significant results. Minimum sample sizes are provided based on statistical power analysis, with recommendations for both development testing and independent evaluation.
The report also addresses the critical issue of dataset diversity. Biometric characteristics vary significantly across demographic groups, and PAD mechanisms that work well for one population may fail for another. The evaluation framework requires that test populations represent the target deployment demographics in terms of age, ethnicity, gender, and other relevant factors. Cross-dataset evaluation — where a PAD mechanism is tested on data collected under different conditions than its training data — is recommended as a measure of generalization capability.
ISO/IEC TR 29189 specifies a standardized reporting format for PAD evaluation results, including Detection Error Trade-off (DET) curves showing the full trade-off between APCER and BPCER at various operating points. The report format must specify the exact hardware and software configuration, the attack species tested, the demographic composition of the test population, and the environmental conditions during testing. This comprehensive reporting enables informed risk assessment by system integrators and certification bodies.
The report also discusses the relationship between PAD evaluation and biometric system usability. High BPCER (false rejection of genuine users) can severely degrade user experience and drive users to circumvent security measures. The evaluation framework recommends establishing application-specific performance thresholds that balance security requirements with usability constraints.
APCER measures how many attack presentations are incorrectly accepted as genuine (false acceptance of attacks). BPCER measures how many genuine presentations are incorrectly rejected as attacks (false rejection of legitimate users). The ideal PAD mechanism minimizes both simultaneously.
Select attack species based on risk assessment of your deployment environment. Consider the skills, resources, and motivation of potential attackers. Include both low-effort attacks (to cover casual threats) and high-effort attacks (to cover sophisticated adversaries) in your test plan.
No. Every PAD mechanism has a non-zero bypass rate against sufficiently skilled attackers. The goal is to raise the cost and complexity of successful attacks to a level that exceeds the value of the protected assets.
At minimum, re-evaluation should be conducted whenever the PAD algorithm is updated, the sensor hardware changes, or a new attack method is discovered. Annual independent evaluation is recommended as a baseline best practice.