Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
ISO/IEC 13818-3:2002, often cited as MPEG‑2 Audio Part 3, is an international standard developed by the Moving Picture Experts Group (MPEG) and published jointly by ISO and IEC. It specifies the audio coding portion of the MPEG‑2 multimedia compression system, extending the audio capabilities defined in MPEG‑1 (ISO/IEC 11172‑3). The standard defines coding methods for high‑quality digital audio at low bitrates, supporting up to 5.1 multichannel surround sound while maintaining backward compatibility with stereo MPEG‑1 decoders.
The standard is formally titled “Information technology — Generic coding of moving pictures and associated audio information — Part 3: Audio”. It addresses perceptual audio coding using subband and transform techniques, and it introduces new features such as low sampling frequencies for enhanced speech quality, multichannel coding, and improved compression efficiency through advanced psychoacoustic models.
ISO/IEC 13818-3:2002 defines several audio profiles that tailor the codec complexity and features for different applications. The primary profiles are:
The standard specifies sampling frequencies of 16, 22.05, 24, 32, 44.1, and 48 kHz. Bitrates range from 8 kbps to 1024 kbps, depending on the layer and number of channels.
| Layer | Typical Bitrate Range (kbps/channel) | Common Applications | Quality |
|---|---|---|---|
| Layer I | 64–448 | Digital Audio Tape (DAT), early CD‑ROM | Good at medium bitrates |
| Layer II | 56–384 | DAB, DVD, broadcast, MPEG‑2 Multichannel | High efficiency, widely used |
| Layer III | 32–320 | Internet streaming, portable audio (MP3) | Excellent at low bitrates |
A key requirement of ISO/IEC 13818-3:2002 is backward compatibility with MPEG‑1 audio decoders. This is achieved through a matrixing technique: the center and surround channels are folded into the left and right channels in a compatible manner. An MPEG‑1 decoder reproduces a stereo downmix, while an MPEG‑2 decoder extracts the full multichannel signal. The standard also supports up to 7.1 channels using the Low Frequency Enhancement (LFE) channel.
The standard relies on a perceptual coding model that masks quantization noise according to human auditory thresholds. The encoder analyzes the input signal in blocks of up to 1152 samples, applies a filter bank (polyphase or hybrid), and allocates bits to subbands where the signal is most perceptually significant. The MPEG‑2 extension improves this by providing updated psychoacoustic tables for the new sampling rates and multichannel configurations.
ISO/IEC 13818-3:2002 includes optional error protection through a cyclic redundancy check (CRC) in the bitstream header. For scalable transmission, the standard defines a layered bitstream architecture: the base layer contains a backward‑compatible stereo downmix, and enhancement layers carry the additional channel data. This scalability allows graceful degradation in poor network conditions.
Conformance to ISO/IEC 13818-3:2002 is verified through a suite of test criteria defined in ISO/IEC 13818-4 (MPEG‑2 Conformance Testing). The conformance testing covers:
Effective implementation and compliance with ISO/IEC 13818-3:2002 remain important for systems that rely on backward‑compatible multichannel audio and for understanding the evolution of perceptual audio coding standards in digital broadcasting and storage applications. © 2026