Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
IEC 61104 specifies a 120 mm diameter, 1.2 mm thick single-sided disc constructed from injection-molded polycarbonate substrate, coated with a reflective aluminum layer and overcoated with UV-cured acrylic lacquer. The signal is read from the inner radius outward in a continuous spiral at a constant linear velocity (CLV) of approximately 1.2โ1.4 m/s. The track pitch is 1.6 ฮผm, with pit widths of approximately 0.5 ฮผm and pit depths of roughly 0.125 ฮผm โ corresponding to one-quarter of the 780 nm laser wavelength, producing maximum destructive interference modulation of the reflected signal.
| Parameter | Value | Engineering Significance |
|---|---|---|
| Disc diameter | 120 mm | Matched to 74-minute capacity target |
| Substrate thickness | 1.2 mm | Mechanical rigidity and warp resistance |
| Track pitch | 1.6 ฮผm | Balances density against crosstalk |
| Pit depth | ~0.125 ฮผm (ฮป/4) | Maximizes reflected signal modulation depth |
| Laser wavelength | 780 nm (infrared) | Low-cost GaAlAs semiconductor laser |
| Numerical aperture (NA) | 0.45 | Spot diameter ~1.6 ฮผm, matched to track pitch |
| CLV speed | 1.2โ1.4 m/s | Sustains constant data rate 4.3218 Mb/s |
| Initial max. playback | 74 minutes | Determined by Beethoven’s 9th Symphony length |
The optical pickup employs a three-beam configuration: a central main beam reads the high-frequency (RF) signal, while two satellite beams generate the radial tracking error signal (TES) via the push-pull method. Focus servo uses the astigmatic method โ a cylindrical lens projects reflected light onto a quadrant photodetector, and the focus error is computed as (A+C) โ (B+D). This servo architecture endured for nearly three decades across CD, DVD, and Blu-ray generations.
IEC 61104 mandates a sampling rate of 44.1 kHz with 16-bit linear PCM quantization โ a combination that remains a professional audio reference standard today. The choice of 44.1 kHz is deeply historical: it originated from early PCM digital audio recorders that used NTSC and PAL video formats as storage media. For NTSC, 245 usable lines per frame ร 3 samples per line ร 2 channels ร 59.94 fps = approximately 44,055 samples/s. For PAL, 294 lines ร 3 samples ร 2 channels ร 50 fps = 44,100 samples/s. The IEC ultimately adopted 44.1 kHz as the universal standard.
Sixteen-bit linear quantization provides a theoretical dynamic range of 96.33 dB (6.02 ร 16 + 1.76). Practical CD players achieve 90โ95 dB SNR at the analog output stage, limited by DAC linearity errors and output amplifier noise floor. Notably, early CD players (e.g., those using the Philips TDA1540) employed 14-bit DACs with 4ร oversampling digital filters to relax analog low-pass filter requirements โ a pragmatic cost-performance trade-off.
Each audio frame contains 6 audio samples (3 per channel). Ninety-eight frames form one sector, and 75 sectors are read per second. The raw audio data rate is calculated as:
44,100 samples/s ร 16 bits ร 2 channels = 1,411,200 bps โ 172.3 KB/s
CIRC is the cornerstone of CD-DA data reliability. The encoder comprises two cascaded Reed-Solomon encoders (C2 then C1) with a cross-interleaver between them. Encoding flow: every 24 bytes of audio data โ 4-byte C2 parity appended (RS(28,24)) โ convolutional interleaving with delays of 1โ4 frames โ 4-byte C1 parity appended (RS(32,28)) โ written to disc.
CIRC can correct burst errors up to approximately 4,000 bits (equivalent to a 2.5 mm track-length defect) or detect 12,000-bit burst errors. The two-stage decoding strategy operates as follows: the C1 decoder corrects random errors and flags uncorrectable data as erasures; the C2 decoder then uses the interleaved redundancy to interpolate over missing samples. If C2 also fails, the player performs muting (output zero) or sample-and-hold (repeat last valid sample).
EFM is the channel coding layer: each 8-bit data byte is mapped into a 14-bit channel symbol, with 3 merging bits inserted between symbols to ensure that the number of channel-bit zeros between consecutive ones falls between 2 and 10 (i.e., a run-length constraint of 3Tโ11T). This constraint is essential for both PLL clock recovery and servo stability: too-short runs introduce high-frequency components that interfere with focus/tracking servos; too-long runs increase the risk of PLL drift.
| Parameter | EFM (CD-DA) | EFM+ (DVD) for comparison |
|---|---|---|
| Channel bits / data bit | 17:8 (incl. merge bits) | 16:8 |
| Minimum run-length (Tmin) | 3T (~0.9 ฮผm) | 3T |
| Maximum run-length (Tmax) | 11T (~3.3 ฮผm) | 14T |
| Channel data rate | 4.3218 Mb/s | 26.16 Mb/s |
| Coding efficiency | 47% (8/17) | 50% (8/16) |
EFM also provides excellent low-frequency suppression. Through DSV (Digital Sum Value) control of the merging bits, the EFM signal spectrum approaches zero in the low-frequency range โ critically, this avoids interference with the focus and tracking servo control bands (DC to several tens of kHz). This is a textbook example of system-level frequency-domain engineering in optical storage.
IEC 61104 reserves 8 subcode bits per frame, labeled P through W. The P channel is a simple track-gap flag; the Q channel carries essential navigation data โ absolute time (in minutes:seconds:frames format from the TOC start), track number, index number, and the Table of Contents (TOC). Channels R through W are available for extensions such as CD-Graphics (CD-G) and CD-TEXT.
The Q-subcode TOC is recorded in the Lead-in Area at the innermost radius, storing the start address and track count for all program tracks. Before data playback begins, every CD player reads the TOC to build an in-memory track map โ a architectural pattern that influenced the directory structures of all subsequent optical media formats.
It was intentional. According to Sony executive Norio Ohga (who later became Sony’s president and chairman), 74 minutes was chosen because it could accommodate Beethoven’s 9th Symphony in its entirety โ specifically the 1951 Bayreuth Festival recording conducted by Wilhelm Furtwangler, which runs exactly 74 minutes. Philips had initially proposed a 60-minute, 115 mm disc; the final 120 mm / 74 minute specification was a compromise reached after vigorous debate.
IEC 61104 was superseded by IEC 60908 in 1999. The newer standard consolidated technical corrigenda, incorporated CD-TEXT extensions, added references to CD-R recordable media specifications, and unified the terminology across the CD family (CD-DA, CD-ROM, CD-R, CD-RW) under a single umbrella document.
Not at all. The Nyquist-Shannon sampling theorem dictates that 44.1 kHz can perfectly reconstruct any signal up to 22.05 kHz โ safely covering the human hearing range of roughly 20 Hzโ20 kHz. Higher sampling rates (96 kHz, 192 kHz) preserve ultrasonic content that may be relevant in Hi-Res Audio production and certain perceptual coding contexts, but for final consumer playback, 44.1 kHz / 16-bit remains a transparent, efficient, and fully adequate delivery format. The bottleneck in perceived audio quality is almost always in mastering quality and transducer performance, not the sampling rate.
CLV (Constant Linear Velocity) keeps the data rate constant by varying spindle RPM โ faster at the inner radius, slower at the outer. This is ideal for streaming audio. CAV (Constant Angular Velocity) keeps the spindle speed constant, sacrificing sequential throughput for reduced random-access latency โ better suited to data applications like CD-ROM and DVD-ROM. Modern optical drives often operate in Z-CLV (Zoned CLV) mode, dividing the disc into zones and using CLV within each zone while varying speed between zones, offering a practical compromise between throughput and access time.
IEC 61104 is far more than a technical specification โ it is an engineering landmark. Born from the unprecedented collaboration between Philips and Sony, the standard masterfully balanced storage density, read reliability, manufacturing cost, and playback duration. Every major design decision โ the 44.1 kHz / 16-bit PCM audio coding, the CIRC/EFM tandem for error correction and channel modulation, the three-beam servo architecture, the extensible subcode framework โ reflects the prudence and foresight of engineers navigating the transition from analog to digital.
Although the physical Compact Disc has receded from everyday use in the streaming era, the digital audio foundation laid down by IEC 61104 continues to underpin modern audio production. Without it, CD-ROM, DVD, and Blu-ray might never have existed in their familiar forms. Every audio engineer who reaches for 44.1 kHz today is, knowingly or not, working within the framework that this landmark standard established over four decades ago.