Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
“content”: “
CSA Z243.32-M1983 (2004), reaffirmed in 2004, establishes a unified 7-bit coded character set for information processing and data interchange across Canadian systems. Developed under the Z243 series by the Canadian Standards Association, this standard provides a common encoding for text data that supports both English and French languages, addressing the federal requirement for bilingual data representation. It specifies 128 code positions (0016 to 7F16), assigning control functions and graphic characters—including accented letters, punctuation, and symbols—necessary for Canadian communication.
The standard aligns with international standards ISO 646 and ISO 2022 but introduces specific substitutions to accommodate the French language, such as the inclusion of é, è, ê, à, ù, ç, and œ (ligature) at code positions that typically house different characters in other national variants. CSA Z243.32-M1983 (2004) is applicable to data storage, transmission, and software systems where bilingual text interchange is critical, including government databases, telecommunication networks, and document processing.
Although originally published in 1983, the 2004 reaffirmation confirms its continued relevance in legacy systems and as a reference for modern Unicode mappings. It serves as a foundation for understanding Canada’s approach to character encoding before widespread adoption of multi-byte standards.
The standard defines a 7-bit code with 128 positions (0x00–0x7F). Positions 0x00–0x1F and 0x7F are reserved for C0 control characters. Positions 0x20–0x7E contain graphic characters. Unlike ISO 646 IRV (International Reference Version), CSA Z243.32 introduces nine substitutions to enable French diacritics and ligatures while preserving ASCII compatibility where possible.
The following table summarizes the key differences between ISO 646 IRV and the Canadian variant defined by CSA Z243.32-M1983 (2004).
| Code (Hex) | ISO 646 IRV Character | CSA Z243.32 Character | Name |
|---|---|---|---|
| 0x23 | # | # | Number sign (shared) |
| 0x24 | $ | $ | Dollar sign (shared) |
| 0x40 | @ | à | Grave accent a (small) |
| 0x5B | [ | â | Circumflex a (small) |
| 0x5C | \ | ç | Cedilla c (small) |
| 0x5D | ] | ê | Circumflex e (small) |
| 0x5E | ^ | ^ | Circumflex accent (shared) |
| 0x60 | ` | ù | Grave accent u (small) |
| 0x7B | { | é | Acute accent e (small) |
| 0x7C | | | œ | Ligature oe (small) |
| 0x7D | } | è | Grave accent e (small) |
| 0x7E | ~ | “ | Diaeresis umlaut |
Control characters follow the ISO 646 assignment. The standard includes additional design notes for escape sequences to switch between national variants according to ISO 2022, but the default designation for Canada is defined using the escape sequence ESC 2/8 4/0 (0x1B 0x28 0x40).
Implementing CSA Z243.32-M1983 (2004) requires understanding its role as both a terminal encoding and a data interchange code. Key implementation aspects include:
ESC 0x28 0x40 for the G0 graphic set.charset=CSA_Z243.32-1983.[ for â), direct interpretation of data as US-ASCII without proper identification will corrupt French text. Always verify encoding metadata.Application protocols that commonly reference CSA Z243.32 include electronic mail (pre-MIME), teletext services, and Canadian government forms processing. In modern environments, Unicode Transformation Formats (UTF-8) have superseded this standard, but understanding Z243.32 remains essential for preserving historical documents and ensuring backward compatibility.
Compliance with CSA Z243.32-M1983 (2004) involves conformance to the character assignment, control function definitions, and escape sequence procedures. The standard itself is reaffirmed (R2004), meaning it is still recognized but may not have active maintenance. Key compliance criteria:
Organizations upgrading from Z243.32 to Unicode should document translation errors or character loss. The standard permits the use of escape sequences to extend the character set for additional accents (e.g., uppercase accented letters not included in the 7-bit table) via supplementary registered sets.
é as U+00E9). Avoid simply stripping diacritics, as it changes the meaning of French words.