Interchange converters—8-bit

This converter provides conversions between internal code and 8-bit standard interchange formats (fold8).

The fold8 name identifies encodings that can be used to pass text data through 8-bit mail protocols. The encodings are based on ISO2022.

The fold8 converters convert characters from a specific code set encoding to a canonical 8-bit encoding that identifies each character. This type of conversion is useful in networks where clients communicate with different code sets but use the same character sets. For example:

Code set conversion Description
IBM-850 <—> ISO8859-1 Common Latin characters
IBM-943 <—> IBM-eucJP Common Japanese characters

The following escape sequences designate standard code sets.

Escape Sequence Standard Code Set
01/11 02/04 02/09 04/01 GR right half of GB2312.1980-0.
01/11 02/13 04/01 GR right half of ISO8859-1.
01/11 02/13 04/02 GR right half of ISO8859-2.
01/11 02/13 04/03 GR right half of ISO8859-3.
01/11 02/13 04/04 GR right half of ISO8859-4.
01/11 02/13 04/06 GR right half of ISO8859-7.
01/11 02/13 04/07 GR right half of ISO8859-6.
01/11 02/13 04/08 GR right half of ISO8859-8.
01/11 02/13 04/12 GR right half of ISO8859-5.
01/11 02/13 04/13 GR right half of ISO8859-9.
01/11 02/09 04/09 GR right half of JIS X0201.1976-1.
01/11 02/04 02/09 04/02 GR JIS X0208.1983-1.
01/11 02/04 02/09 04/00 GR JISX0208.1978-1.
01/11 02/09 04/02 GR 7-bit ASCII or left half of ISO8859-1.
01/11 02/05 02/15 03/01 M L 04/09 04/02 04/13 02/13 03/08 03/05 03/00 00/02 GR right half of IBM-850 unique characters. Characters common to ISO8859-1 should not use this escape sequence.
01/11 02/05 02/15 03/02 M L 04/09 04/02 04/13 02/13 07/05 06/04 06/03 04/10 05/00 00/02 GR right half of Japanese user-definable characters.
01/11 02/08 04/02 GL 7-bit ASCII or left half of ISO8859-1.
01/11 02/14 04/01 GL right half of ISO8859-1.
01/11 02/14 04/02 GL right half of ISO8859-2.
01/11 02/14 04/03 GL right half of ISO8859-3.
01/11 02/14 04/04 GL right half of ISO8859-4.
01/11 02/14 04/06 GL right half of ISO8859-7.
01/11 02/14 04/07 GL right half of ISO8859-6.
01/11 02/14 04/08 GL right half of ISO8859-8.
01/11 02/14 04/12 GL right half of ISO8859-5.
01/11 02/14 04/13 GL right half of ISO8859-9.
01/11 02/08 04/09 GL right half of JIS X0201.1976-0.
01/11 02/08 04/10 GL left half of JIS X0201.1976.
01/11 02/04 02/08 04/02 GL JIS X0208.1983-0.
01/11 02/04 04/02 GL JIS X0208.1983-0.
01/11 02/04 04/00 GL JIS X0208.1978-0.
01/11 02/05 02/15 03/01 M L 06/09 06/02 06/13 02/13 03/08 03/05 03/00 00/02 GL right half of IBM-850 unique characters. Characters common to ISO8859-1 do not use this escape sequence.
01/11 02/05 02/15 03/02 M L 06/09 06/02 06/13 02/13 07/05 06/04 06/03 04/10 05/00 00/02 GL Japanese (IBM-udcJP) user-definable characters.
01/11 02/04 02/09 04/03 GR KSC5601-1987.
01/11 02/04 02/09 03/00 GR CNS11643-1986-1.
01/11 02/04 02/10 03/01 GR CNS11643-1986-2.
01/11 02/05 02/15 03/02 M L 04/09 04/02 04/13 02/13 07/05 06/04 06/03 05/05 05/08 00/02 GR right half of Traditional Chinese user-definable characters.
01/11 02/05 02/15 03/02 M L 04/09 04/02 04/13 02/13 07/03 06/02 06/04 05/05 05/08 00/02 GR right half of IBM-850 unique symbols.
01/11 02/04 02/08 04/03 GL KSC5601-1987.
01/11 02/05 02/15 03/02 M L 06/09 06/02 06/13 02/13 07/05 06/04 06/03 05/05 05/08 00/02 GL Traditional Chinese (IBM-udcTW) user-definable characters.
01/11 02/05 02/15 03/02 M L 06/09 06/02 06/13 02/13 07/03 06/02 06/04 05/05 05/08 00/02 GL Traditional Chinese IBM-850 unique symbols (IBM-shdTW) user-definable characters.
01/11 02/05 02/15 03/00 M L 05/05 05/04 04/06 02/13 03/08 00/02 UCS-2 encoded as UTF-8; used only for those characters not encoded by any of the above escape sequences listed above.

When converting from a code set to fold8, the escape sequence used to designate the code set is chosen according to the order listed. For example, the JISX0208.1983-0 characters use 01/11 02/04 02/08 04/02 as the designation.