Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
IEC 14496-18-05 (2007), formally integrated within the ISO/IEC 14496 series on the coding of audio-visual objects (MPEG-4), defines the normative architecture for font data compression and streaming. While earlier parts of the MPEG-4 standard focused on video (Part 2, Part 10/AVC), audio (Part 3), and scene description (Part 11/BIFS), the requirement for deterministic, multi-lingual typography driven by the scene composition memory became critical for broadcast and rich media applications.
The scope of this part is unique within the MPEG-4 ecosystem. It specifies how complete font programs—specifically TrueType, OpenType (CFF and TrueType outlines), and associated compact font formats—are compressed in a lossless manner, packaged into dedicated data units, and streamed synchronously with the multimedia presentation. This ensures that content creators can guarantee a specific visual font appearance regardless of the local system fonts available on the end-user device.
The standard builds directly upon the OpenType specification, which itself is a superset of TrueType. The decoder profiles specified in IEC 14496-18-05 support the following font program structures:
IEC 14496-18-05 specifies a specific lossless compression algorithm tailored for font tables. Unlike general-purpose compression (GZIP), the normative method defined in this standard uses a predictive, table-aware algorithm.
The algorithm analyzes the structural redundancy of TrueType/OpenType tables (such as the `glyf`, `loca`, `cmap`, and `CFF ` tables) and applies context modeling specific to glyph coordinate data. This typically achieves a 40–60% reduction in size compared to the raw font file, critical for bandwidth-constrained broadcast networks.
Table 1: Font Stream Profiles and Compression Levels (IEC 14496-18-05)
| Profile Level | Font Format | Compression Standard | Target Use Case |
|---|---|---|---|
| Level 1 Baseline | TrueType | GZIP per table | Low complexity embedded devices |
| Level 2 Standard | OpenType (TT / CFF) | Predictive & Entropy Coding | Set-top boxes, mobile devices |
| Level 3 Extended | OpenType (TT / CFF + Multiple Masters) | Full Font Stream Compression | Broadcast, BD-ROM, Full Graphics |
FontDataUnit cache. If multiple scene objects reference the same font UID (Font Identifier), the decompressed font program should be shared in memory to prevent redundant decompression cycles and conserve processing resources. The most significant architectural element introduced by IEC 14496-18-05 is the Font Data Stream (FDSS). This is an Elementary Stream (ES) in the MPEG-4 Systems layer (ISO/IEC 14496-1). Fonts are not treated as static files inside a container; rather, they are managed as dynamic data objects with synchronization to the BIFS scene description.
FontData object within the scene graph. The object holds the compressed font data. In the Binary Format for Scenes (BIFS), text nodes utilize fontFamily or fontUID fields to select the typeface. IEC 14496-18-05 allows these fields to reference either platform fonts (using generic aliases like “Serif”) or streamed fonts (using a unique fontID defined in the FDSS).
The timing model ensures that text rendering blocks until the referenced font data unit is available in the decoder buffer. This prevents “flash of unstyled text” (FOUT) in the rich media terminal.
decoderSpecificInfo descriptor. A single font program can contain several thousand glyphs (common in CJK fonts). Decoders must implement a flush mechanism for unused font instances based on the scene composition memory buffer to avoid exhausting system resources. Compliance with IEC 14496-18-05 requires two distinct layers of conformance testing to ensure semantic and syntactic correctness:
An often-overlooked requirement of this standard is font licensing. IEC 14496-18-05 provides the technical framework for carriage and compression, but the content creator must ensure they own the digital rights to convert the font into the compressed stream format.
To guarantee interoperability between authoring tools and playback terminals, the standard mandates specific resolution for the Font Data Unit. Every glyph must be rendered against a reference rasterizer at a defined size to match the expected output of the reference decoder. This prevents visual discrepancies caused by different font engine implementations or hinting interpretation.
Technical reference document for IEC 14496-18-05 (2007). All specifications subject to the official ISO/IEC copyright. Reviewed 2026.