Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
ISO 26429-10 defines the format for stereoscopic (3D) picture content wrapping within a single MXF file for digital cinema distribution. This standard was developed to address the growing demand for 3D cinema content following the success of stereoscopic films in the late 2000s. The standard introduces a frame-interleaved structure where left-eye and right-eye frames are sequentially interleaved within a single essence stream. The left frame is always the first in each left/right pair, establishing an unambiguous convention that all equipment can rely upon for correct stereoscopic presentation.
The single-file approach represents a significant engineering advantage over dual-file methods, where separate left and right MXF files would need to be synchronized. By combining both views into a single container, the standard eliminates the risk of synchronization drift between the two eyes, which is one of the most critical issues in stereoscopic playback. Even momentary loss of synchronization can cause viewer discomfort, headaches, and nausea, making robust synchronization essential for commercial 3D cinema.
| Parameter | Value | Engineering Significance |
|---|---|---|
| Frame Interleaving | Left/Right sequential pairs | Left frame first, then right frame; always paired |
| Sample Rate (gross picture rate) | 48 fps (example) | Defines the underlying single-eye picture rate in descriptors |
| Edit Rate | 24 fps (example) | Half the sample rate; one edit unit = one L/R stereoscopic pair |
| KLV Fill Prohibition | Not allowed between L/R frames | Ensures contiguous, inseparable stereoscopic frame pairs |
| Identification | Stereoscopic Picture Sub-Descriptor | Strong reference from Picture Essence Descriptor |
The relationship between Sample Rate and Edit Rate is a crucial engineering concept in this standard. The Sample Rate defines the gross picture rate of the essence stream, which for stereoscopic content is twice the frame rate since each frame pair contains two pictures. The Edit Rate, however, defines the rate at which edit units (complete stereoscopic pairs) are presented. With a gross picture rate of 48 fps and an Edit Rate of 24 fps, each edit unit indexes exactly one left/right pair, allowing decoders to always start playback at the beginning of a stereoscopic pair.
The standard defines a MainStereoscopicPicture element as an extension to the standard Composition Playlist (CPL) Reel element defined in SMPTE 429-7. When present, this element indicates stereoscopic content, and the optional MainPicture element must be omitted. This design prevents accidental 2D playback of stereoscopic content, which would appear as a confusing double-image error to the audience. The XML namespace provides both version identification and semantic context for the element.
The standard provides a complete XML schema for the MainStereoscopicPicture element, which uses the same PictureTrackFileAssetType as defined in SMPTE 429-7. This design choice minimizes the implementation effort for equipment manufacturers, as the same asset type handling code can be reused for both stereoscopic and monoscopic content. The sample CPL in the standard shows a complete implementation example including UUIDs for asset identification, Edit Rate and Frame Rate specifications, and Screen Aspect Ratio settings.
The Stereoscopic Picture Sub-Descriptor (Annex A) provides a lightweight identification mechanism that is elegantly simple. Any Picture Essence Descriptor describing stereoscopic content must include a strong reference to this sub-descriptor. The sub-descriptor itself requires no specific properties beyond its key value its mere presence identifies the track file as stereoscopic. This design avoids complex parameter negotiations while maintaining unambiguous identification, leveraging the existing MXF sub-descriptor mechanism defined in SMPTE 422M.