Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
The transition from analog to digital video recording in professional broadcasting demanded standardized formats that guaranteed interchangeability, high quality, and long-term archiving stability. IEC 62356, published in four parts in 2017, defines the 12.65 mm type D-11 digital video recording format—a professional digital tape format designed for broadcast-quality video acquisition, post-production, and archiving. This standard was developed under IEC Technical Committee 100 (Audio, video and multimedia systems and equipment).
Unlike consumer formats that prioritized low cost and compact size, the D-11 format was engineered for the rigorous demands of professional broadcasting: predictable playback across different machines, robust error correction, and consistent bitstream behavior over decades of storage. The standard is organized into four parts, each addressing a critical layer of the recording chain.
IEC 62356 follows a layered architecture that separates the physical recording format from the compression scheme and the file structure. This modular approach allows each layer to evolve independently.
| Part | Title | Scope | Key Technical Parameters |
|---|---|---|---|
| 62356-1 | Format Specification | Physical tape format, track pattern, and recording parameters | 12.65 mm tape width, helical scan, track pitch, azimuth angles |
| 62356-2 | Compression Specification | Video compression algorithm and bitstream syntax | MPEG-4-based intra-frame coding, 50/100 Mbps modes |
| 62356-3 | Data Structure and Stream Format | Multiplexing of video, audio, and metadata into a single stream | Packetized Elementary Stream (PES) structure, timecode embedding |
| 62356-4 | File Format | File-level wrapping for IT-based workflows and non-linear editing | MXF (Material eXchange Format) wrapper, KLV encoding |
Part 1 of IEC 62356 specifies the physical recording format in exacting detail. The 12.65 mm (nominal 1/2 inch) tape is written using a helical scan technique, where the rotating drum carries the recording heads at a precise angle relative to the tape path.
The standard defines a track pattern consisting of video sectors, audio sectors, and subcode areas arranged across the tape width. The helical tracks are written at an angle of approximately 4.9 degrees relative to the tape edge, with adjacent heads using opposing azimuth angles (±15 degrees) to eliminate crosstalk between neighboring tracks—a technique known as azimuth recording.
| Parameter | Value | Notes |
|---|---|---|
| Tape width | 12.65 mm ± 0.01 mm | ½-inch format |
| Track pitch | 18 µm | High-density recording |
| Drum diameter | 62 mm | Rotating head drum |
| Drum rotation speed | 9000 rpm (150 Hz) | Synchronous with video field rate |
| Head-to-tape speed | ~25.4 m/s | Relative velocity |
| Recording wavelength | 0.33 µm | Minimum bit cell |
| Azimuth angle | ±15° | Adjacent tracks |
Part 2 defines the video compression system, which is based on MPEG-4 Part 2 (Advanced Simple Profile) with enhancements tailored for professional post-production. Critically, the D-11 format uses intra-frame coding only (I-frames), meaning each frame is compressed independently without reference to preceding or following frames.
This design choice has profound implications for editing: because each frame is a complete entity, cuts can be made at any frame boundary without the need to decode a Group of Pictures (GOP) structure. The standard supports two primary bitrate modes:
Parts 3 and 4 of IEC 62356 address the logical organization of audio, video, and metadata. Part 3 defines the Packetized Elementary Stream (PES) structure used to multiplex up to eight channels of uncompressed PCM audio (24-bit, 48 kHz) with the compressed video stream and timecode metadata.
Part 4 specifies the MXF (Material eXchange Format) file wrapper, enabling D-11 content to be transferred over IT networks and ingested directly into non-linear editing systems without real-time tape playback. The MXF wrapper uses KLV (Key-Length-Value) encoding for metadata, ensuring that critical production metadata—including timecode, reel name, and shot logging information—travels with the essence data.
D-11 uses MPEG-4 intra-frame compression at 100 Mbps, while HDCAM SR uses MPEG-4 Studio Profile at up to 880 Mbps with some inter-frame compression. D-5 HD is uncompressed (approx. 1.2 Gbps). D-11 occupies the efficiency sweet spot: it offers excellent quality for most broadcast applications while requiring significantly less tape and bandwidth than uncompressed alternatives. However, for high-end cinematic production requiring absolute transparency, D-5 HD or HDCAM SR remain preferred.
No. D-11 requires a dedicated broadcast VTR deck that implements the full recording and playback chain specified in IEC 62356. These decks connect to editing systems via SDI (Serial Digital Interface) or AES/EBU digital audio interfaces, not through standard computer interfaces like USB or Thunderbolt.
While most modern acquisition uses solid-state media (SxS, P2, CFast cards), D-11 remains relevant for archiving and interchange. Many broadcasters maintain D-11 libraries containing decades of news and program content. Furthermore, the MXF file format specified in Part 4 ensures D-11 content integrates seamlessly into modern IT-based workflows. The format’s detailed specification also makes it a reference for understanding the engineering principles of professional digital recording.
D-11 employs a two-dimensional Reed-Solomon error correction scheme (C1/C2 product code), similar to that used in DVD and Blu-ray. The C1 code corrects errors within a single sync block, while the C2 code provides cross-interleaved correction across multiple blocks. Additionally, a powerful concealment algorithm interpolates missing data from neighboring pixels when error correction is exhausted. The result is a correctable error rate of approximately 1×10⁻¹⁵ after full decoding.