Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
ISO 25237:2017 defines principles, methods, and procedures for pseudonymization of health data. As healthcare increasingly relies on digital records, protecting patient privacy while enabling data utility has become a critical challenge. Pseudonymization replaces identifying attributes with artificial identifiers (pseudonyms), allowing data to be processed for secondary purposes such as clinical research, epidemiology, and public health surveillance without directly revealing patient identities.
The standard establishes a comprehensive framework for understanding pseudonymization in healthcare contexts. A pseudonym is an identifier that replaces a direct identifier (such as name or national ID) across one or more data records. The process involves a pseudonymization function — a cryptographic or algorithmic transformation — and a pseudonymization service that manages the mapping table, access controls, and policy enforcement.
Key entities defined include the data subject (patient), data controller (healthcare provider or researcher), and pseudonymization authority (trusted third party or internal service). The standard distinguishes between internal pseudonymization (managed within the same organization) and external pseudonymization (involving a separate trusted entity), each with different security and trust implications.
| Concept | Definition | Example |
|---|---|---|
| Direct Identifier | Information that uniquely identifies a person | Patient name, national ID number |
| Pseudonym | Artificial identifier replacing a direct identifier | “P-8F3A29” instead of “John Smith” |
| Re-identification Risk | Probability of linking pseudonymized data back to identity | ≤0.01% in controlled environments |
| Linkability | Ability to correlate records belonging to the same subject | Same pseudonym across multiple clinical trials |
| De-pseudonymization | Reverse process requiring authorized access | Court-ordered disclosure with audit trail |
ISO 25237 describes multiple pseudonymization techniques suited for different use cases. Cryptographic hashing with salt is common for one-way pseudonymization, where the original identifier is hashed using SHA-256 with a secret salt value. Encryption-based pseudonymization uses symmetric encryption (e.g., AES-256) to create reversible pseudonyms, enabling authorized re-identification when necessary for patient safety or regulatory audits.
The standard emphasizes security requirements for the pseudonymization service, including physical isolation, encryption of mapping tables, role-based access control, and comprehensive audit logging. For cross-domain data sharing, the standard recommends domain-specific pseudonyms — different pseudonyms for the same patient across different research databases — to prevent cross-correlation attacks.
Clinical research networks use pseudonymization to pool data from multiple hospitals while protecting patient privacy. A patient participating in three studies across two hospitals receives consistent pseudonyms within each domain, enabling longitudinal follow-up without exposing their identity. Pharmacovigilance systems apply pseudonymization to adverse event reports, allowing signal detection while maintaining patient confidentiality.
The standard provides guidance on evaluating re-identification risks using metrics such as k-anonymity (each record indistinguishable from at least k-1 others) and population uniqueness. It also addresses the specific challenges of pseudonymizing genetic data, imaging data, and free-text clinical notes, which may contain embedded identifiers requiring specialized de-identification pipelines.