IEC 62365: Digital Audio – Transmission of Digital Audio over ATM Networks

💡 Standard Snapshot: IEC 62365 (Edition 2.0, 2009) specifies the method for transmitting digital audio signals over Asynchronous Transfer Mode (ATM) networks. It defines how to pack digital audio samples into ATM cells, establish switched virtual circuits for audio connections, and code audio format information for professional audio applications.

1. Scope and Field of Application

IEC 62365 defines the digital input-output interfacing for transmission of digital audio over ATM networks. It covers the mapping of digital audio signals conforming to the AES3 (AES/EBU) interface standard into ATM Adaptation Layer 1 (AAL1) cells for transport over ATM networks. The standard addresses both point-to-point and point-to-multipoint audio connections in professional audio production, broadcasting, and distribution environments.

The standard supports a wide range of audio sampling frequencies including 32 kHz, 44.1 kHz, and 48 kHz, with provisions for higher sampling rates through multi-channel grouping. It also covers the setup and teardown of switched virtual circuits (SVCs) for dynamic audio connection management.

Parameter Specification per IEC 62365
Supported Sampling Rates 32 kHz, 44.1 kHz, 48 kHz (and multiples)
Audio Resolution Up to 24 bits per sample
Channel Capacity per ATM VC Up to 8 audio channels (AES3 pairs)
ATM Adaptation Layer AAL1
Cell Assembly Delay < 1 ms at 48 kHz sampling
Latency Determined by ATM network configuration
Jitter Tolerance Per ATM network QoS Class 1

2. Audio Data Format in ATM Cells

2.1 Audio Sample Format

The standard specifies that audio samples are formatted according to the AES3 digital audio interface standard. Each audio sample consists of up to 24 bits of audio data plus ancillary data bits for channel status, user data, and parity. The standard defines how these subframes are packed into the payload of ATM AAL1 cells.

⚠️ Engineering Insight: The choice of packing scheme has significant implications for end-to-end latency. The standard offers two packing schemes: a low-latency scheme that prioritizes minimizing delay (suitable for live sound reinforcement) and an efficient scheme that maximizes bandwidth utilization (suitable for studio recording and archiving). Engineers should select the scheme based on the specific application requirements.

2.2 Packing Schemes

IEC 62365 defines two primary packing schemes:

  • Scheme A (Low Latency): Audio samples are packed sequentially with minimal buffering. Each ATM cell contains samples from a single time instant across multiple channels, reducing the accumulation delay to one sample period. This scheme is ideal for live performance and interactive audio applications.
  • Scheme B (Efficient): Audio samples are grouped temporally, with multiple samples from the same channel packed into consecutive cells. This scheme achieves higher payload efficiency but introduces additional latency proportional to the group size.

3. Connection Management

3.1 Switched Virtual Circuit Setup

The standard defines the signaling procedures for establishing, maintaining, and releasing ATM switched virtual circuits for audio transport. Key aspects include:

  • Audio call SETUP messages containing audio-specific information elements
  • Bandwidth negotiation based on the number of audio channels and sampling rate
  • Quality of Service (QoS) parameters appropriate for real-time audio (CBR or rt-VBR)
  • Destination response handling, including connection acceptance and rejection

3.2 Audio Format Coding

IEC 62365 specifies a comprehensive coding system for audio format information transmitted during connection setup. This includes:

  • Audio sample frequency coding (32 kHz, 44.1 kHz, 48 kHz, 96 kHz)
  • Audio sample size (16, 20, or 24 bits per sample)
  • Number of audio channels and their assignment
  • Emphasis and pre-emphasis status
  • Channel status data from the AES3 stream
Design Recommendation: When implementing IEC 62365 interfaces, engineers should ensure that the ATM network supports Class 1 (CBR) QoS with tight delay variation (CDV) limits. Even brief periods of network congestion can cause cell loss that manifests as audible clicks or dropouts in the reconstructed audio signal. A jitter buffer of at least 2 ms at the receiving end is recommended to absorb network-induced timing variations.

4. Practical Applications and Migration Considerations

IEC 62365 was widely used in professional audio infrastructure during the era when ATM networks provided the only practical means for high-quality, low-latency digital audio transport over wide-area networks. Applications included:

  • Remote broadcasting (live events, sports commentary)
  • Inter-studio audio distribution for broadcasting networks
  • Post-production facility interconnection
  • Live sound reinforcement for large venues
🚨 Note on Technology Evolution: While ATM-based audio transport as specified by IEC 62365 has largely been superseded by IP-based solutions using AES67, ST 2110-30, and Dante protocols, the standard remains important for understanding the foundational principles of packetized audio transport. Many of the timing, synchronization, and QoS concepts defined in IEC 62365 directly influenced the development of modern audio-over-IP standards.

Frequently Asked Questions (FAQ)

Q1: Can IEC 62365 transport multichannel audio formats like 5.1 surround sound?
Yes. IEC 62365 supports up to 8 audio channels per ATM virtual circuit. For 5.1 surround sound (6 channels), a single VC is sufficient. For larger channel counts, multiple VCs can be used with synchronization maintained through the ATM network’s clock recovery mechanisms.
Q2: What is the typical latency of audio transmitted over ATM per IEC 62365?
With Scheme A (low-latency packing), the end-to-end latency can be as low as 1-3 ms excluding network propagation delay. With Scheme B, latency increases to 5-10 ms depending on the group size. Network-specific delays (switching, queuing) add additional latency based on the ATM network configuration.
Q3: How does IEC 62365 handle clock synchronization between transmitter and receiver?
The standard leverages the AAL1 structured data transfer mechanism, which includes a synchronous residual time stamp (SRTS) method for clock recovery. This allows the receiver to reconstruct the audio sampling clock from the ATM cell stream with high accuracy, maintaining the timing integrity required for professional audio applications.
Q4: Is IEC 62365 compatible with modern IP-based audio networks?
IEC 62365 is specific to ATM networks and is not directly compatible with IP-based audio transport. However, the audio format coding and channel mapping conventions established in IEC 62365 influenced subsequent standards like AES47 (audio over IP) and AES67 (audio-over-IP interoperability). Gateways exist to bridge ATM audio networks with IP-based systems in hybrid environments.

Leave a Reply

Your email address will not be published. Required fields are marked *