Interchange converters—8-bit
This converter provides conversions between internal code and 8-bit standard interchange formats (fold8).
The fold8 name identifies encodings that can be used to pass text data through 8-bit mail protocols. The encodings are based on ISO2022.
The fold8 converters convert characters from a specific code set encoding to a canonical 8-bit encoding that identifies each character. This type of conversion is useful in networks where clients communicate with different code sets but use the same character sets. For example:
| Code set conversion | Description |
|---|---|
| IBM-850 <—> ISO8859-1 | Common Latin characters |
| IBM-943 <—> IBM-eucJP | Common Japanese characters |
The following escape sequences designate standard code sets.
| Escape Sequence | Standard Code Set |
|---|---|
| 01/11 02/04 02/09 04/01 | GR right half of GB2312.1980-0. |
| 01/11 02/13 04/01 | GR right half of ISO8859-1. |
| 01/11 02/13 04/02 | GR right half of ISO8859-2. |
| 01/11 02/13 04/03 | GR right half of ISO8859-3. |
| 01/11 02/13 04/04 | GR right half of ISO8859-4. |
| 01/11 02/13 04/06 | GR right half of ISO8859-7. |
| 01/11 02/13 04/07 | GR right half of ISO8859-6. |
| 01/11 02/13 04/08 | GR right half of ISO8859-8. |
| 01/11 02/13 04/12 | GR right half of ISO8859-5. |
| 01/11 02/13 04/13 | GR right half of ISO8859-9. |
| 01/11 02/09 04/09 | GR right half of JIS X0201.1976-1. |
| 01/11 02/04 02/09 04/02 | GR JIS X0208.1983-1. |
| 01/11 02/04 02/09 04/00 | GR JISX0208.1978-1. |
| 01/11 02/09 04/02 | GR 7-bit ASCII or left half of ISO8859-1. |
| 01/11 02/05 02/15 03/01 M L 04/09 04/02 04/13 02/13 03/08 03/05 03/00 00/02 | GR right half of IBM-850 unique characters. Characters common to ISO8859-1 should not use this escape sequence. |
| 01/11 02/05 02/15 03/02 M L 04/09 04/02 04/13 02/13 07/05 06/04 06/03 04/10 05/00 00/02 | GR right half of Japanese user-definable characters. |
| 01/11 02/08 04/02 | GL 7-bit ASCII or left half of ISO8859-1. |
| 01/11 02/14 04/01 | GL right half of ISO8859-1. |
| 01/11 02/14 04/02 | GL right half of ISO8859-2. |
| 01/11 02/14 04/03 | GL right half of ISO8859-3. |
| 01/11 02/14 04/04 | GL right half of ISO8859-4. |
| 01/11 02/14 04/06 | GL right half of ISO8859-7. |
| 01/11 02/14 04/07 | GL right half of ISO8859-6. |
| 01/11 02/14 04/08 | GL right half of ISO8859-8. |
| 01/11 02/14 04/12 | GL right half of ISO8859-5. |
| 01/11 02/14 04/13 | GL right half of ISO8859-9. |
| 01/11 02/08 04/09 | GL right half of JIS X0201.1976-0. |
| 01/11 02/08 04/10 | GL left half of JIS X0201.1976. |
| 01/11 02/04 02/08 04/02 | GL JIS X0208.1983-0. |
| 01/11 02/04 04/02 | GL JIS X0208.1983-0. |
| 01/11 02/04 04/00 | GL JIS X0208.1978-0. |
| 01/11 02/05 02/15 03/01 M L 06/09 06/02 06/13 02/13 03/08 03/05 03/00 00/02 | GL right half of IBM-850 unique characters. Characters common to ISO8859-1 do not use this escape sequence. |
| 01/11 02/05 02/15 03/02 M L 06/09 06/02 06/13 02/13 07/05 06/04 06/03 04/10 05/00 00/02 | GL Japanese (IBM-udcJP) user-definable characters. |
| 01/11 02/04 02/09 04/03 | GR KSC5601-1987. |
| 01/11 02/04 02/09 03/00 | GR CNS11643-1986-1. |
| 01/11 02/04 02/10 03/01 | GR CNS11643-1986-2. |
| 01/11 02/05 02/15 03/02 M L 04/09 04/02 04/13 02/13 07/05 06/04 06/03 05/05 05/08 00/02 | GR right half of Traditional Chinese user-definable characters. |
| 01/11 02/05 02/15 03/02 M L 04/09 04/02 04/13 02/13 07/03 06/02 06/04 05/05 05/08 00/02 | GR right half of IBM-850 unique symbols. |
| 01/11 02/04 02/08 04/03 | GL KSC5601-1987. |
| 01/11 02/05 02/15 03/02 M L 06/09 06/02 06/13 02/13 07/05 06/04 06/03 05/05 05/08 00/02 | GL Traditional Chinese (IBM-udcTW) user-definable characters. |
| 01/11 02/05 02/15 03/02 M L 06/09 06/02 06/13 02/13 07/03 06/02 06/04 05/05 05/08 00/02 | GL Traditional Chinese IBM-850 unique symbols (IBM-shdTW) user-definable characters. |
| 01/11 02/05 02/15 03/00 M L 05/05 05/04 04/06 02/13 03/08 00/02 | UCS-2 encoded as UTF-8; used only for those characters not encoded by any of the above escape sequences listed above. |
When converting from a code set to fold8, the escape sequence used to designate the code set is chosen according to the order listed. For example, the JISX0208.1983-0 characters use 01/11 02/04 02/08 04/02 as the designation.