ISO 26428-19: Digital Cinema — Part 19: Additional Audio for Large-Format Venues

Extended immersive audio framework for PLF cinemas — enhanced metadata, haptic channels, and binaural rendering

1. ISO 26428-19: Extending the Immersive Audio Framework

ISO 26428-19 builds upon the baseline established in Part 11 by specifying additional audio channel configurations and enhanced metadata structures for large-format cinema venues, premium large-format (PLF) screens, and outdoor cinematic installations. The standard addresses channel counts beyond 64, including 9.1.6, 11.1.8, and custom configurations with up to 256 individually addressable transducer channels for haptic and motion-cue integration.

For PLF installations with more than 32 height channels, configure audio processing in a hierarchical buss structure: group height channels into quadrants (front-left, front-right, rear-left, rear-right) at the DSP level, then apply per-speaker EQ and delay within each group. This reduces CPU load by approximately 40 % compared to flat processing of all channels.
Configuration Bed Channels Height Layers Subwoofers Total Channels
9.1.6 9 6 (1 layer) 1 16
11.1.8 11 8 (2 layers) 1 20
27.3.16 27 16 (3 layers) 3 46
Custom (haptic-enabled) 16 16 + 128 haptic 4 164

2. Advanced Metadata and Binaural Rendering

The enhanced metadata format defined in ISO 26428-19 includes dynamic acoustic room compensation filters, per-object Doppler shift parameters for moving sound sources, and distance-based high-frequency air absorption coefficients. These metadata elements enable the rendering engine to model physics-based acoustic propagation rather than relying on static gain-based panning. The standard also defines a mandatory binaural rendering profile for headphone-based quality control monitoring during mixing and mastering, using HRTF (Head-Related Transfer Function) data sets with minimum 128 measurement points per ear.

When implementing binaural monitoring for QC, the HRTF data set must be measured at ≤ 5° angular resolution across all three axes (azimuth, elevation, roll). Generic HRTF data sets with coarser resolution produce localization errors exceeding 10° in elevation perception, resulting in inaccurate mix approval decisions.

Audio compression for the enhanced formats is provided through the ISO 26428-19 lossless audio frame (LLAF) codec, which uses a combination of linear prediction (FLAC-derived) and entropy coding to achieve compression ratios of 1.8:1 to 2.5:1 on cinematic content with zero sample loss. This is essential for managing storage and bandwidth in multi-reel Digital Cinema Packages (DCPs), where a single two-hour feature with full immersive audio can exceed 400 GB.

3. Engineering Design for Large-Venue Systems

Deploying ISO 26428-19 compliant systems in venues seating 500+ patrons presents unique acoustic and electrical challenges. The inverse-square law attenuation across a 40 m deep auditorium requires a dynamic range management strategy that applies different gain structures to near-field and far-field speaker arrays while maintaining coherent wavefront arrival times.

  • Delay tapering: Far-field speaker arrays must receive progressively increasing delay (by 1–2 samples relative to the nearest array) to compensate for the natural time-of-flight difference across the listening area. This is computed automatically from the venue’s 3D acoustic model stored in the Sound System Configuration File (SSCF).
  • Amplifier topology: For PLF installations, distributed amplification (one amplifier module per speaker driver) provides superior channel separation and fault tolerance compared to centralized multi-channel amplifiers.
Using ISO 26428-19’s automated acoustic alignment routine with a calibrated measurement microphone array reduces venue commissioning time from 5–7 days (manual alignment) to 12–18 hours, while achieving ±0.5 dB SPL uniformity across all seats.
When installing overhead height channels in venues with steel roof structures, account for structural resonance modes between 20 and 80 Hz. Uncompensated structural coupling can introduce > 15 dB SPL peaks at specific auditorium locations, potentially causing hearing discomfort at moderate playback levels.

4. Frequently Asked Questions

Q: What is the difference between ISO 26428-11 and ISO 26428-19?
A: Part 11 defines the baseline immersive audio channel specifications (up to 64 channels with object audio). Part 19 extends this framework for large-format venues with channel counts exceeding 64, enhanced metadata for acoustic room compensation, haptic channel integration, and mandatory binaural QC rendering profiles.
Q: Can existing DCI-compliant projectors and servers handle the additional audio bandwidth?
A: Yes, provided the DCP is packaged correctly. The additional audio data streams are multiplexed within the existing MXF container structure using extended essence descriptors. Servers must be running software that supports ISO 26428-19 according to the manufacturer’s compliance matrix.
Q: Is the LLAF codec mandatory for all ISO 26428-19 DCPs?
A: The lossless audio frame codec is mandatory for DCPs using more than 24 audio channels to ensure manageable file sizes. For configurations with 24 channels or fewer, standard uncompressed PCM is acceptable per the standard.
Q: How are haptic channels different from conventional audio channels?
A: Haptic channels carry low-frequency (< 200 Hz) vibrational signals synchronized with on-screen action. They are routed to specialised actuators (tactile transducers) mounted in seats, floors, or armrests rather than to acoustic speakers, and are encoded with different metadata parameters focused on acceleration and displacement rather than SPL.

Leave a Reply

Your email address will not be published. Required fields are marked *