Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
ISO/HL7 27932:2009, reaffirmed in 2012, specifies the data types used in HL7 Version 3 messages. This standard defines a comprehensive type system that ensures semantic consistency across all HL7 V3 interactions. The data types specification is a critical companion to the messaging standard (ISO/HL7 27931), providing the fundamental building blocks for all clinical data representation. It covers everything from basic text and numeric types to complex constructs such as precise healthcare timestamps, coded vocabulary references, and multi-language text blocks. The specification defines the XML Schema representation for each data type, enabling automated validation of message content against type constraints.
The data types specification draws heavily on the HL7 V3 Abstract Data Types specification (HL7 ADT) and extends it with concrete XML Schema implementations. The standard distinguishes between “data types” (the abstract semantic model) and “data type implementations” (the specific XML, and potentially other, representations). This separation allows the same clinical semantics to be encoded in different concrete syntaxes, a design principle that later influenced the FHIR standard’s approach to multiple serialization formats.
The standard organizes data types into a formal hierarchy rooted in the ANY abstract type. Fundamental types include: BL (Boolean) with three-valued logic (true, false, and null); INT (Integer) with arbitrary precision; REAL (Real number) with configurable precision; ST (String, unordered text); and CV (Coded Value) for concept representations that include both a coded identifier and human-readable display text. The CS (Coded Simple) type provides a compact representation for codes where only the code value is needed without the display name.
| Data Type | Category | Description | Example Usage |
|---|---|---|---|
| TS | Time-related | Precise healthcare timestamp with optional precision | Surgery start time, medication administration time |
| PQ | Quantities | Physical quantity with unit of measure | Body weight (75 kg), dosage (500 mg) |
| II | Identifiers | Instance identifier with root OID and extension | Patient ID, order number, specimen ID |
| CD | Coded values | Concept descriptor with code, code system, and qualifiers | Diagnosis (ICD-10), procedure (SNOMED CT) |
| ED | Encapsulated data | Binary or text data with MIME type | PDF report, JPEG image, audio recording |
| AD | Addresses | Postal address with structured parts | Patient home address, facility location |
Several data types in ISO/HL7 27932 are uniquely designed for healthcare semantics. The IVL
The standard defines collection types including SET (unordered, no duplicates), LIST (ordered, allows duplicates), BAG (unordered, allows duplicates), and DSET (ordered, no duplicates, with distinct semantics). For representing probability distributions, the standard provides BAG
The TS (Point in Time) data type supports configurable precision from year down to sub-second. A timestamp with year precision (e.g., 2024) is semantically distinct from one with full date-time precision (e.g., 20241115143015.1234+0800). Engineers must carefully manage precision expectations in message processing — comparing a year-precision date of birth with a day-precision admission date requires different handling than comparing two day-precision timestamps. The standard specifies that precision is indicated by the number of digits provided, not by explicit precision qualifiers.
ISO/HL7 27932 defines a sophisticated null flavor mechanism that goes far beyond simple null values. The standard specifies multiple null flavors including: NI (No Information), NA (Not Applicable), UNK (Unknown), ASKU (Asked But Unknown), NAV (Not Available), and MSK (Masked). Each null flavor carries distinct semantics that affect downstream processing. For example, a blood pressure reading with MSK flavor indicates that the value exists but is being withheld for privacy reasons, while NA indicates that the concept is not relevant to the patient. This granular null handling enables clinical decision support systems to reason appropriately about missing data.
The CD (Concept Descriptor) type allows qualifiers to refine the meaning of coded values, enabling post-coordination of clinical concepts. For example, a SNOMED CT code for “fracture” can be qualified with “site = femur” and “severity = comminuted” to create a compound clinical concept without requiring a pre-coordinated code for every possible combination. Managing these qualified codes in terminology servers requires careful indexing strategy, as the semantic equivalence of two qualified codes depends on the full qualification chain, not just the base code.