SetInputCharacterSet
SetInputCharacterSet function lets you specify the character set used in a TurboIntegrator data source.
This function is valid in TM1® TurboIntegrator processes only.
When a TurboIntegrator process reads an external file as input, it needs to know the character set in which that external file was written. If the file contains a valid byte-order-mark, TM1 functions will correctly convert the file to UTF-8 if required.
For formats lacking a valid byte-order-mark, the characters must be converted from some other encoding to UTF-8. If the proper converters are present on the machine hosting the server, the input file will be converted to the Unicode character set required by TM1.
Syntax
SetInputCharacterSet (CharacterSet);
Argument |
Description |
---|---|
CharacterSet |
The character encoding in the input file to be used by the TurboIntegrator process. If the CharacterSet argument is not a known character type, the type defaults to the system locale. |
These are the valid values for CharacterSet.
Character Encoding |
System Locale |
---|---|
TM1CS_ISO_8859_1 |
ISO-8859-1 Latin-1, Western Europe |
TM1CS_ISO_8859_2 |
ISO-8859-2 Latin-2, Central Europe |
TM1CS_ISO_8859_3 |
ISO-8859-3 Latin-3, South Europe |
TM1CS_ISO_8859_4 |
ISO-8859-4 Latin-4, North Europe |
TM1CS_ISO_8859_5 |
ISO-8859-5 Latin/Cyrillic |
TM1CS_ISO_8859_6 |
ISO-8859-6 Latin/Arabic |
TM1CS_ISO_8859_7 |
ISO-8859-7 Latin/Greek |
TM1CS_ISO_8859_8 |
ISO-8859-8 Latin/Hebrew |
TM1CS_ISO_8859_9 |
ISO-8859-9 Latin-5, Turkish |
TM1CS_ISO_8859_10 |
ISO-8859-10 Latin-6, Nordic, |
TM1CS_ISO_8859_11 |
ISO-8859-11 Latin/Thai |
TM1CS_ISO_8859_13 |
ISO-8859-13 Latin-7, Baltic Rim |
TM1CS_ISO_8859_14 |
ISO-8859-14 Latin-8, Celtic |
TM1CS_ISO_8859_15 |
ISO-8859-15 Latin-9, replaces ISO-8859-1 |
TM1CS_ISO_8859_16 |
ISO-8859-16 Latin-10, South-Eastern Europe |
TM1CS_WCP1250 |
Microsoft Windows Central Europe |
TM1CS_WCP1251 |
Windows Cyrillic |
TM1CS_WCP1252 |
Windows Latin-1 multilingual |
TM1CS_WCP1253 |
Windows Greek |
TM1CS_WCP1254 |
Windows Turkish |
TM1CS_WCP1255 |
Windows Hebrew |
TM1CS_WCP1256 |
Windows Arabic |
TM1CS_WCP1257 |
Windows Baltic |
TM1CS_WCP1258 |
Windows Vietnam |
TM1CS_WCP874 |
Windows Thai |
TM1CS_WCP932 |
Windows Japanese |
TM1CS_WCP936 |
Windows Simplified Chinese |
TM1CS_WCP949 |
Windows Korean |
TM1CS_WCP950 |
Windows Traditional Chinese |
TM1CS_KOI8R |
Russian and Cyrillic (KOI8-R) |
TM1CS_GB18030 |
PRC version UNICODE |
TM1CS_BIG5 |
Traditional Chinese |
TM1CS_SHIFTJIS |
JIS 0201 + JIS 0208, slightly different from CP932 |
TM1CS_SJIS0213 |
JIS 0213-2004, non-BMP required. |
TM1CS_EUC_JP |
EUC Japanese |
TM1CS_EUC_CN |
EUC Simplified Chinese |
TM1CS_EUC_KR |
EUC Korean |
TM1CS_UTF8 |
UTF-8 |
TM1CS_UTF16 |
UTF-16 Little Endian |
TM1CS_UTF16ESC |
UNICODE notation |
TM1CS_UTF32 |
UTF-32 Little Endian |
TM1CS_OS_DEFAULT |
operating system default |
TM1CS_LOCALPATH |
local encoding but UNICODE notation on non-native. |
Example
SetInputCharacterSet ('TM1CS_ISO_8859_11');
This example specifies that the input character set for the TurboIntegrator data source is ISO-8859-11 Latin/Thai.