Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
IEC 62731-2018, prepared by IEC TC 100 (Audio, video and multimedia systems and equipment), specifies the text-to-speech (TTS) functionality for digital television receivers. This second edition supersedes the 2013 edition and adds significant improvements, including network-based updates to TTS pronunciation dictionaries and enhanced announcement quality levels. The standard applies to stationary and semi-stationary digital TV receivers such as set-top boxes, integrated digital TVs, and recorders whose primary function is TV reception. It does NOT apply to products where TV is a secondary function, such as PCs or game consoles with TV tuners.
The standard identifies five key areas of user requirements based on research with visually impaired television viewers:
A key innovation in the standard is the concept of context-aware TTS. The TV system recognizes which context the user is in (watching TV, browsing EPG, adjusting settings) and provides appropriate audio feedback. For example, when changing channels, the TTS should announce the channel name, program title, and start/end time, while in the EPG context, it should announce program descriptions and scheduling information.
IEC 62731 defines three distinct TTS profiles with increasing levels of functionality:
| Profile | Level | Features | Target Use Case |
|---|---|---|---|
| Basic | Entry | Channel change announcement, volume indication, simple menu navigation | Low-cost receivers, basic accessibility |
| Main | Standard | Full EPG reading, program information, subtitle rendering, input switching | Mainstream TVs with accessibility focus |
| Enhanced | Advanced | All Main features plus interactive services, smart TV apps, advanced navigation | Premium smart TVs, comprehensive accessibility |
The standard specifies that the TTS engine must:
The standard defines a comprehensive mapping between TV events and the corresponding TTS announcement data. The event state machine includes: channel change, program start/end, EPG navigation, menu selection, pop-up messages, input source switching, and context switches. For each event, the standard specifies exactly which information elements must be included in the TTS output.
A: The standard applies to receivers whose primary function is to receive TV content. While the focus is on broadcast digital TV, the TTS principles and profiles are applicable to streaming and hybrid devices. However, PCs and game consoles with secondary TV capability are explicitly excluded.
A: The standard requires support for the language of the user interface and broadcast content but does not mandate specific languages. In practice, manufacturers implement TTS for the languages used in their target markets. The network-updateable pronunciation feature introduced in the 2018 edition helps address multilingual content.
A: Subtitles (closed captions) are treated as TTS data that must be read when the subtitle display is active. The TTS can either read subtitles as they appear or on demand. The standard includes provisions for handling subtitle format transitions, multiple subtitle tracks, and synchronization with program audio.
A: TTS audio is the normal speech output generated from on-screen text. Priority audio information relates to emergency alerts and critical system messages that must interrupt normal TTS output. The standard mandates that priority audio takes precedence over all other TTS output and cannot be suppressed by user settings.