HTML coded character sets
This reference lists the supported IANA-registered character
set names (specified as charset=
values in HTTP headers),
and the IBM® CCSID equivalents.
All of these values are valid for code page conversion
options on the following commands:
Language | Coded character set | IANA charset | IBM CCSID |
---|---|---|---|
Albanian | ISO/IEC 8859-1 | iso-8859-1 | 819 |
Arabic | ISO/IEC 8859-6 | iso-8859-6 | 1089 |
Bulgarian | Windows 1251 | windows-1251 | 1251 |
Byelorussian | Windows 1251 | windows-1251 | 1251 |
Catalan | ISO/IEC 8859-1 | iso-8859-1 | 819 |
Chinese (simplified) | GB | gb2312 | 1381 or 5477 |
Chinese (traditional) | Big 5 | big5 | 950 |
Croatian | ISO/IEC 8859-2 | iso-8859-2 | 912 |
Czech | ISO/IEC 8859-2 | iso-8859-2 | 912 |
Danish | ISO/IEC 8859-1 | iso-8859-1 | 819 |
Dutch | ISO/IEC 8859-1 | iso-8859-1 | 819 |
English | ISO/IEC 8859-1 | iso-8859-1 | 819 |
Estonian | ISO/IEC 8859-1 | iso-8859-1 | 819 |
Finnish | ISO/IEC 8859-1 | iso-8859-1 | 819 |
French | ISO/IEC 8859-1 | iso-8859-1 | 819 |
German | ISO/IEC 8859-1 | iso-8859-1 | 819 |
Greek | ISO/IEC 8859-7 | iso-8859-7 | 813 |
Hebrew | ISO/IEC 8859-8 | iso-8859-8 | 916 |
Hungarian | ISO/IEC 8859-2 | iso-8859-2 | 912 |
Italian | ISO/IEC 8859-1 | iso-8859-1 | 819 |
Japanese | Shift JIS | x-sjis or shift-jis | 943 (932, a subset of 943, is also valid) |
Japanese | EUC Japanese | euc-jp | 5050 (EUC) |
Korean | EUC Korean | euc-kr | 970 (for AIX® or Unix) |
Latvian | Windows 1257 | windows-1257 | 1257 |
Lithuanian | Windows 1257 | windows-1257 | 1257 |
Macedonian | Windows 1257 | windows-1257 | 1251 |
Norwegian | ISO/IEC 8859-1 | iso-8859-1 | 819 |
Polish | ISO/IEC 8859-2 | iso-8859-2 | 912 |
Portuguese | ISO/IEC 8859-1 | iso-8859-1 | 819 |
Romanian | ISO/IEC 8859-2 | iso-8859-2 | 912 |
Russian | Windows 1251 | windows-1251 | 1251 |
Serbian (Cyrillic) | Windows 1251 | windows-1251 | 1251 |
Serbian (Latin 2) | Windows 1250 | windows-1250 | 1250 |
Slovakian | ISO/IEC 8859-2 | iso-8859-2 | 912 |
Slovenian | ISO/IEC 8859-2 | iso-8859-2 | 912 |
Spanish | ISO/IEC 8859-1 | iso-8859-1 | 819 |
Spanish | ISO/IEC 8859-15 | iso-8859-15 | 923 |
Swedish | ISO/IEC 8859-1 | iso-8859-1 | 819 |
Turkish | ISO/IEC 8859-9 | iso-8859-9 | 920 |
Ukrainian | Windows 1251 | windows-1251 | 1251 |
Unicode | UCS-2 | iso-10646-ucs-2 | 1200 (growing) or 13488 (fixed) |
Unicode | UTF-16 | utf-16 | 1200 |
Unicode | UTF-16 big-endian | utf-16be | 1201 |
Unicode | UTF-16 little-endian | utf-16le | 1202 |
Unicode | UTF-8 | utf-8 | 1208 |