IEC TS 62579: Multimedia — Electronic Publishing XML Format

Standardized XML framework for cross-platform multimedia electronic publishing and content distribution

IEC TS 62579, published as a Technical Specification by IEC Technical Committee 100 (Audio, Video and Multimedia Systems and Equipment), defines a standardized XML format for electronic publishing of multimedia content. As the publishing industry undergoes a fundamental transformation from print-centric to digital-first workflows, the need for a robust, extensible, and platform-independent content format has become paramount. This standard addresses the requirements for representing structured multimedia publications — combining text, images, audio, video, and interactive elements — in a single XML-based document framework that ensures consistent rendering across diverse reading platforms, from dedicated e-readers and tablets to web browsers and mobile devices.

IEC TS 62579 is designed to complement existing electronic publishing formats such as EPUB (ISO 24521) by providing additional XML structures specifically optimized for multimedia-rich content. While EPUB offers a general-purpose container format for digital publications, IEC TS 62579 focuses on the semantic markup and multimedia synchronization aspects that are critical for educational, technical, and entertainment content where precise timing between media elements is essential.

Document Structure and Metadata Framework

The standard defines a hierarchical document structure based on XML Schema (XSD) that organizes multimedia publications into logical components. At the top level, the Publication element contains metadata, manifest, spine (reading order), and media collection elements. The metadata section, following Dublin Core and PRISM vocabulary standards, captures bibliographic information including title, creator, publisher, identifier (DOI, ISBN, or URI), publication date, language, subject classification, and rights management terms. The standard extends basic metadata with multimedia-specific elements: media duration, required bandwidth, supported rendering profiles, and accessibility features including closed caption tracks, audio descriptions, and alternative text for images.

The spine element defines the linear reading order of the content, supporting both mandatory and optional reading sequences for adaptive content delivery. Content documents within the publication use a subset of XHTML 1.1 with additional multimedia elements defined in the IEC TS 62579 namespace. The media collection element aggregates all multimedia resources — referenced by unique identifiers — with their associated metadata, including codec requirements, resolution variants (for responsive rendering), language tracks, and content protection flags. The container structure uses OCF (OEBPS Container Format) conventions based on ZIP archive packaging, with a MIME type descriptor file and a container.xml file that specifies the publication root file location.

IEC TS 62579 Document Structure Elements
Element Description Required Cardinality
publication Root element of the multimedia publication Yes 1
metadata Bibliographic and technical metadata container Yes 1
manifest List of all resources in the publication Yes 1
spine Linear reading order of content documents Yes 1
media-collection Aggregated multimedia resource registry No 0..1
nav-map Table of contents and navigation hierarchy Yes 1
content-document Individual content document (XHTML + multimedia) No 0..n
media-resource Reference to an external or embedded media asset No 0..n
When designing multimedia publications for the IEC TS 62579 format, engineers must pay careful attention to the declared media profiles and the target device capabilities. A publication that embeds 4K video at 60 fps with multichannel audio will not render on a basic e-reader with limited processing power and storage. The standard recommends providing multiple media variants — for example, 4K, 1080p, and 720p video alternatives with corresponding audio bitrates — and using the device profile declaration at runtime to select the most appropriate variant, similar to MPEG-DASH adaptive streaming principles.

Multimedia Synchronization and Content Rendering

A key innovation of IEC TS 62579 is its multimedia synchronization model, which enables precise temporal coordination between text, audio, video, and interactive elements. The synchronization is achieved through a SMIL-based (Synchronized Multimedia Integration Language) timeline model embedded within content documents. Paragraph-level timing markers allow text highlighting synchronized with audio narration (karaoke-style), while video segments can be linked to specific sections of text for illustrated technical documentation. The standard defines three synchronization modes: sequential (media elements play in order), parallel (simultaneous playback with synchronisation points), and interactive (user-driven progression with media triggered by reader actions).

Content rendering requirements specify minimum display capabilities for compliant reading systems. Text must reflow across different screen sizes using CSS-based layout with named page templates defined in the publication manifest. The standard mandates support for a minimum of six font families (serif, sans-serif, monospace, cursive, fantasy, and a Chinese/Japanese/Korean font), with font embedding permitted through the OpenType format with subsetting to reduce file size. For mathematical and scientific content, MathML rendering support is required at the presentation level, with content-level MathML recommended for semantic interoperability with learning management systems.

The adaptive content model in IEC TS 62579 enables a single publication source to serve multiple output formats: a full multimedia version for tablet and desktop reading, an audio-only version for hands-free consumption, a text-only version for screen readers and accessibility devices, and a print-ready PDF. This multi-format publishing capability reduces production costs by up to 60% compared to creating separate editions for each distribution channel, while ensuring content consistency across all formats.

Engineering Design Insights for Cross-Platform Publishing

Implementing a publishing system compliant with IEC TS 62579 requires careful engineering of the content creation pipeline. The standard does not require authors to write raw XML; instead, it establishes the target format that authoring tools must generate. For engineering teams building publishing workflows, the recommended approach is to maintain content in a semantic XML source format — such as DocBook or DITA — and transform it to IEC TS 62579 using XSLT stylesheets. This separation of content from presentation enables single-source publishing with automated generation of the multimedia publication format. The transformation pipeline must handle media resource referencing, resolution variant selection, and metadata mapping from the authoring schema to the IEC TS 62579 metadata vocabulary.

Digital rights management integration is addressed through a pluggable rights management interface. The standard defines a rights management information (RMI) element in the metadata section that can reference external DRM system parameters without mandating a specific DRM technology. Publications can be encrypted at the container level using AES-128 in CBC mode, with the encryption key delivered through a rights issuer service. The standard recommends that reading systems implement at minimum the OMA DRM 2.0 profile for mobile compatibility and the Marlin DRM system for broadband-connected devices, though implementers may support additional DRM systems through the extensible rights management framework.

Performance optimization for multimedia publications requires attention to several engineering parameters. The standard recommends that the total publication size should be declared in the metadata to allow reading systems to check available storage before download. For publications exceeding 50 MB, progressive download with chapter-level granularity is recommended, using HTTP range requests to fetch content on demand. The manifest must declare the byte offsets of each content document within the container to support random access without full decompression. Caching policies at the reading system level should prioritize recently accessed chapters and preload the next chapter in the reading order when bandwidth permits, providing a seamless reading experience without perceptible loading delays between sections.

Recommended Media Encoding Profiles for IEC TS 62579
Media Type Baseline Profile Enhanced Profile Container Format
Text XHTML 1.1 + CSS 2.1 XHTML 5 + CSS 3 application/xhtml+xml
Still images JPEG, PNG, GIF JPEG 2000, WebP, SVG image/*
Audio MP3 (192 kbps), AAC-LC AAC-HE, Opus, FLAC audio/mpeg, audio/mp4
Video H.264 AVC (3 Mbps, 720p) H.265 HEVC (8 Mbps, 1080p) video/mp4, video/webm
Interactive SMIL 3.0 timeline JavaScript + Canvas API application/smil+xml
Fonts OpenType (subset, CFF) WOFF 2.0 with variable fonts font/otf, font/woff2
Q1: How does IEC TS 62579 differ from EPUB (ISO 24521)?
A: While both are XML-based electronic publishing formats, IEC TS 62579 provides enhanced multimedia synchronization capabilities through SMIL timeline integration, richer metadata for multimedia content, adaptive media variant support, and a more granular rights management interface. EPUB is a general-purpose format focused on reflowable text with basic image support, whereas IEC TS 62579 is designed for publications where precise temporal coordination between text, audio, video, and interactive elements is essential — such as interactive textbooks, multimedia technical manuals, and enriched entertainment content.
Q2: What are the minimum reading system requirements for IEC TS 62579 compliance?
A: A compliant reading system must support XHTML 1.1 rendering with CSS 2.1, MathML presentation level, SMIL 3.0 basic profile for multimedia synchronization, at least the baseline media encoding profiles (H.264 video, AAC audio, JPEG images), OCF container format handling, and metadata-driven content navigation. Recommended hardware specifications include a 1 GHz processor, 512 MB RAM, and 1024×768 minimum display resolution for adequate multimedia rendering performance.
Q3: Can IEC TS 62579 publications include interactive assessments or quizzes?
A: Yes, the standard supports interactive content through embedded JavaScript, XForms, and SMIL event triggers. Assessment content can be encoded as interactive content documents that respond to user input and provide feedback through event-driven navigation rules. The standard defines a simple question-and-answer schema for self-assessment that can be integrated into the content flow, with scoring data optionally transmitted back to a learning management system through the standard’s web service interface.
Q4: How are accessibility requirements addressed in IEC TS 62579?
A: The standard includes comprehensive accessibility provisions: alt-text for all non-text content, extended image descriptions through the aria-describedby attribute, synchronized captions for audio and video content (using TTML or WebVTT formats), audio descriptions for visual content, semantic structural markup for screen reader navigation, and support for text-to-speech rendering with SSML pronunciation hints. The metadata section includes a mandatory accessibility summary that declares which accessibility features the publication provides, enabling users to identify suitable content before download.

Leave a Reply

Your email address will not be published. Required fields are marked *