Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
IEC TS 62579, published as a Technical Specification by IEC Technical Committee 100 (Audio, Video and Multimedia Systems and Equipment), defines a standardized XML format for electronic publishing of multimedia content. As the publishing industry undergoes a fundamental transformation from print-centric to digital-first workflows, the need for a robust, extensible, and platform-independent content format has become paramount. This standard addresses the requirements for representing structured multimedia publications — combining text, images, audio, video, and interactive elements — in a single XML-based document framework that ensures consistent rendering across diverse reading platforms, from dedicated e-readers and tablets to web browsers and mobile devices.
The standard defines a hierarchical document structure based on XML Schema (XSD) that organizes multimedia publications into logical components. At the top level, the Publication element contains metadata, manifest, spine (reading order), and media collection elements. The metadata section, following Dublin Core and PRISM vocabulary standards, captures bibliographic information including title, creator, publisher, identifier (DOI, ISBN, or URI), publication date, language, subject classification, and rights management terms. The standard extends basic metadata with multimedia-specific elements: media duration, required bandwidth, supported rendering profiles, and accessibility features including closed caption tracks, audio descriptions, and alternative text for images.
The spine element defines the linear reading order of the content, supporting both mandatory and optional reading sequences for adaptive content delivery. Content documents within the publication use a subset of XHTML 1.1 with additional multimedia elements defined in the IEC TS 62579 namespace. The media collection element aggregates all multimedia resources — referenced by unique identifiers — with their associated metadata, including codec requirements, resolution variants (for responsive rendering), language tracks, and content protection flags. The container structure uses OCF (OEBPS Container Format) conventions based on ZIP archive packaging, with a MIME type descriptor file and a container.xml file that specifies the publication root file location.
| Element | Description | Required | Cardinality |
|---|---|---|---|
| publication | Root element of the multimedia publication | Yes | 1 |
| metadata | Bibliographic and technical metadata container | Yes | 1 |
| manifest | List of all resources in the publication | Yes | 1 |
| spine | Linear reading order of content documents | Yes | 1 |
| media-collection | Aggregated multimedia resource registry | No | 0..1 |
| nav-map | Table of contents and navigation hierarchy | Yes | 1 |
| content-document | Individual content document (XHTML + multimedia) | No | 0..n |
| media-resource | Reference to an external or embedded media asset | No | 0..n |
A key innovation of IEC TS 62579 is its multimedia synchronization model, which enables precise temporal coordination between text, audio, video, and interactive elements. The synchronization is achieved through a SMIL-based (Synchronized Multimedia Integration Language) timeline model embedded within content documents. Paragraph-level timing markers allow text highlighting synchronized with audio narration (karaoke-style), while video segments can be linked to specific sections of text for illustrated technical documentation. The standard defines three synchronization modes: sequential (media elements play in order), parallel (simultaneous playback with synchronisation points), and interactive (user-driven progression with media triggered by reader actions).
Content rendering requirements specify minimum display capabilities for compliant reading systems. Text must reflow across different screen sizes using CSS-based layout with named page templates defined in the publication manifest. The standard mandates support for a minimum of six font families (serif, sans-serif, monospace, cursive, fantasy, and a Chinese/Japanese/Korean font), with font embedding permitted through the OpenType format with subsetting to reduce file size. For mathematical and scientific content, MathML rendering support is required at the presentation level, with content-level MathML recommended for semantic interoperability with learning management systems.
Implementing a publishing system compliant with IEC TS 62579 requires careful engineering of the content creation pipeline. The standard does not require authors to write raw XML; instead, it establishes the target format that authoring tools must generate. For engineering teams building publishing workflows, the recommended approach is to maintain content in a semantic XML source format — such as DocBook or DITA — and transform it to IEC TS 62579 using XSLT stylesheets. This separation of content from presentation enables single-source publishing with automated generation of the multimedia publication format. The transformation pipeline must handle media resource referencing, resolution variant selection, and metadata mapping from the authoring schema to the IEC TS 62579 metadata vocabulary.
Digital rights management integration is addressed through a pluggable rights management interface. The standard defines a rights management information (RMI) element in the metadata section that can reference external DRM system parameters without mandating a specific DRM technology. Publications can be encrypted at the container level using AES-128 in CBC mode, with the encryption key delivered through a rights issuer service. The standard recommends that reading systems implement at minimum the OMA DRM 2.0 profile for mobile compatibility and the Marlin DRM system for broadband-connected devices, though implementers may support additional DRM systems through the extensible rights management framework.
Performance optimization for multimedia publications requires attention to several engineering parameters. The standard recommends that the total publication size should be declared in the metadata to allow reading systems to check available storage before download. For publications exceeding 50 MB, progressive download with chapter-level granularity is recommended, using HTTP range requests to fetch content on demand. The manifest must declare the byte offsets of each content document within the container to support random access without full decompression. Caching policies at the reading system level should prioritize recently accessed chapters and preload the next chapter in the reading order when bandwidth permits, providing a seamless reading experience without perceptible loading delays between sections.
| Media Type | Baseline Profile | Enhanced Profile | Container Format |
|---|---|---|---|
| Text | XHTML 1.1 + CSS 2.1 | XHTML 5 + CSS 3 | application/xhtml+xml |
| Still images | JPEG, PNG, GIF | JPEG 2000, WebP, SVG | image/* |
| Audio | MP3 (192 kbps), AAC-LC | AAC-HE, Opus, FLAC | audio/mpeg, audio/mp4 |
| Video | H.264 AVC (3 Mbps, 720p) | H.265 HEVC (8 Mbps, 1080p) | video/mp4, video/webm |
| Interactive | SMIL 3.0 timeline | JavaScript + Canvas API | application/smil+xml |
| Fonts | OpenType (subset, CFF) | WOFF 2.0 with variable fonts | font/otf, font/woff2 |