1. INTRODUCTION
With the growing internationalization of the Internet, support for many coded character sets is required. It is the intention of this memo to document precisely the mapping between all characters and their corresponding coded representations in various coded character sets, and give names to these coded character sets, so they can be referenced unambiguously in Internet standards. This memo does not indicate anything about the validity of using these specifications in any Internet standard, so you should consult each individual Internet standard to see which coded character sets and names are allowed there. Unambiguous character mnemonics are specified, which provide a practical way of identifying a character, without reference to a coded character set and its code in this coded character set. The mnemonics are written in a minimal set of characters, namely the invariant 83 graphical characters of ISO 646, which is a kind of greatest common subset to be found between the majority of coded character sets, including ASCII, national variants of the ISO 646 7- bit character set and various EBCDICs. In addition, the numeric value of the coded representations of all these characters are the same in all coded character sets compatible with ISO standards. All of them except two, EXCLAMATION MARK and QUOTATION MARK, have the same coded representation in all variants of EBCDIC. This minimal set of characters is called the reference character set in this memo. The mnemonics can be used in Internet standards for easy and unambiguous reference, and they can also serve as a fallback representation in various Internet specifications. The coded character sets covered include all parts of ISO 8859, ISO 6937-2 and all ISO 646 conforming coded character sets in the ISO character set registry managed by ECMA according to ISO 2375. Almost all graphic coded character sets in the ECMA registry [1] are covered. The graphic coded character sets not included are registry numbers 31, 38, 39, 53, 59, 68, 71, 72, 129 and 137. In addition many vendor defined character sets are covered, including PC codepages [4], [7], [8], many EBCDIC character sets [4], [5], [6] and HP, DEC and Apple character sets [8], [9], [10], [13], [14]. The East-Asian 16-bit character sets from the ECMA registry is also included in this memo.
