Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
IEC 61121 (Ed. 3.0, 2013) defines the international standard for the Compact Disc Digital Audio System (CD-DA). Originating from the joint Red Book specification published by Philips and Sony in 1980, it stands as one of the most influential optical storage formats in consumer electronics history. As the definitive reference for the physical format and signal encoding of CD audio, IEC 61121 laid the technical foundation for a generation of derived standards: CD-ROM (Yellow Book), CD-R (Orange Book), CD-RW, and beyond.
The CD-DA system replaced both vinyl records and compact cassettes, establishing a new paradigm for digital audio distribution. IEC 61121 specifies every aspect of the system: physical disc dimensions (120 mm diameter, 1.2 mm thickness), optical readout parameters, channel encoding (EFM), error correction (CIRC), subcode structure, and playback compatibility requirements.
The CD-DA disc uses a polycarbonate substrate measuring 120 mm in diameter (80 mm mini-discs are also specified) with a 15 mm central hole. Data is recorded as a spiral track of pits (depressions) and lands (flat areas), read from the inner radius outward under Constant Linear Velocity (CLV) control at a scanning speed of 1.2–1.4 m/s. The pits are approximately 0.11 μm deep (one-quarter of the 780 nm laser wavelength in the polycarbonate medium, giving an optical path difference of half a wavelength for maximum signal contrast), roughly 0.5 μm wide, and vary in length from 0.83 μm (3T) to 3.05 μm (11T). The track pitch is 1.6 μm.
| Parameter | Value | Description |
|---|---|---|
| Sampling Rate | 44.1 kHz | Satisfies 20 kHz bandwidth per Nyquist criterion |
| Quantization | 16-bit linear PCM | Theoretical dynamic range: 96 dB; practical: 90–95 dB |
| Channels | 2 (stereo) | Independent left/right sampling and quantization |
| Audio Data Rate | 176.4 kB/s | 44,100 x 16 x 2 / 8 = 176,400 B/s |
| Channel Bit Rate | 4.3218 Mbps | After EFM modulation and merging bits |
| Error Correction | CIRC (Cross-Interleaved Reed-Solomon Code) | Two-stage C1/C2 decoding; burst correction up to 4,000 bits (~2.5 mm) |
| Modulation | EFM (Eight-to-Fourteen Modulation) | 8-bit data -> 14-bit channel symbol + 3 merge bits; RLL(2,10) constraint |
IEC 61121 specifies the Cross-Interleaved Reed-Solomon Code (CIRC) as the error correction strategy. CIRC is a two-stage, cross-interleaved block code that processes audio data through an outer C2 encoder RS(28,24) and an inner C1 encoder RS(32,28) during recording. Decoding proceeds in the reverse order: the C1 decoder first corrects random errors and short burst errors, and the C2 decoder handles longer burst errors using erasure correction when C1 flags unreliable symbols.
The genius of CIRC lies in its interleaving strategy: delay lines of increasing length (from 1 to 4 frames in D1 through D4 stages) distribute adjacent audio samples across a span of 108 frames (approximately 4,536 audio samples, representing about 6,800 channel bits or 2.5 mm of track length). A scratch on the disc surface that obliterates a contiguous block of pits is thus spread across many codewords, each seeing only a small number of corrupted symbols. The C1 decoder (inherently capable of correcting up to 2 symbol errors per 32-symbol block) handles most of the fallout; remaining errors are passed to the C2 decoder which applies stronger RS(28,24) decoding. For errors exceeding correction capability, the system falls back to concealment — either sample-hold (repeating the previous good value) or linear interpolation between surrounding valid samples.
Eight-to-Fourteen Modulation (EFM) is the channel code used in the CD-DA system. Every 8-bit data byte (audio sample or subcode) is mapped via lookup table to a 14-bit channel symbol, to which 3 merge bits are appended for run-length constraint compliance. The EFM code enforces a (2,10) Run-Length Limited (RLL) constraint: the number of channel clock periods between successive transitions (pit-to-land or land-to-pit) must be between 3 and 11 inclusive.
With a channel bit period T = 231.4 ns (at 4.3218 Mbps), the shortest pit corresponds to 3T = 0.83 μm and the longest to 11T = 3.05 μm. The RLL(2,10) constraint serves three critical purposes:
After CIRC encoding, each frame (588 channel bits, 24 audio bytes) includes one subcode byte, divided into eight subcode channels designated P, Q, R, S, T, U, V, and W — each carrying one bit per frame at a rate of 7.35 kb/s per channel. The P channel provides a simple track-start flag (pre-emphasis and track number markers). The Q channel carries substantially richer information: track number (TNO, 1–99), index point (INDEX, 01–99), running time in minutes/seconds/frames (75 frames/s), and optionally the catalog number (UPC/EAN) and ISRC identification.
The Q-channel time code is encoded in two forms: absolute time measured from the lead-in start (A-time) and relative time measured from the current track start (T-time). CD players use this data for random track access, elapsed/remaining time display, and A–B repeat functions. Later extensions such as CD-Text exploit the R–W subcode channels for storing metadata and graphic information.
The canonical 74-minute maximum playback time specified in IEC 61121 arises from straightforward geometry: the program area spans radii from 25 mm to 58 mm, yielding a usable recording area of approximately 86 cm2. At a track pitch of 1.6 μm, the spiral track measures approximately 5.4 km in total length. With linear density of about 0.6 μm per channel bit (1.2–1.4 m/s at 4.3218 Mbps), the raw channel bit capacity is roughly 4.4 billion bits, from which CIRC framing, EFM overhead, and subcode consume approximately 30%, leaving about 783 MB for raw audio — enough for 74 minutes at 176.4 kB/s.
Notably, the 74-minute specification was itself a compromise between competing design proposals: Philips originally favored a 60-minute capacity using a 115 mm disc, while Sony insisted on 74 minutes to accommodate Beethoven’s Ninth Symphony (as conducted by Wilhelm Furtwängler at the 1951 Bayreuth Festival). The compromise resulted in the 120 mm disc format. Through subsequent refinements in track pitch reduction (to 1.5 μm) and improved mastering techniques, practical capacity has been extended to 80 minutes (approximately 700 MiB), though such extended discs push the physical margins of the standard’s compatibility envelope.
Jitter in the CD-DA context refers to timing deviations in channel-bit transitions caused by mechanical speed instability, disc eccentricity, laser pickup vibration, and threshold-crossing uncertainty in EFM signal detection. Excessive jitter pushes the data toward the limits of CIRC correction capability; when the jitter budget is exhausted, uncorrectable C2 errors produce audible artifacts — clicks, pops, or momentary dropouts.
CD player designers employ a layered approach to jitter mitigation. First, the PLL-based clock data recovery (CDR) extracts a stable bit clock from the EFM signal’s transition edges, with the loop filter bandwidth carefully chosen to track low-frequency wow-and-flutter while rejecting high-frequency noise. Second, a FIFO buffer (typically 4–16 frames, or 0.8–3.3 ms of audio) decouples the variable-rate read channel from the fixed-rate audio output. Third, the CIRC interleaving itself provides inherent resilience against short-duration timing errors. In professional ripping scenarios, multiple reads with confidence checking, C2 error flag interrogation, and offset correction are all required to achieve the bit-accurate results demanded by archival-quality digital audio preservation.