SetInputCharacterSet

SetInputCharacterSet function lets you specify the character set used in a TurboIntegrator data source.

This function is valid in TM1® TurboIntegrator processes only.

When a TurboIntegrator process reads an external file as input, it needs to know the character set in which that external file was written. If the file contains a valid byte-order-mark, TM1 functions will correctly convert the file to UTF-8 if required.

For formats lacking a valid byte-order-mark, the characters must be converted from some other encoding to UTF-8. If the proper converters are present on the machine hosting the server, the input file will be converted to the Unicode character set required by TM1.

Syntax

SetInputCharacterSet (CharacterSet);

Argument	Description
CharacterSet	The character encoding in the input file to be used by the TurboIntegrator process. If the CharacterSet argument is not a known character type, the type defaults to the system locale.

Argument

Description

CharacterSet

The character encoding in the input file to be used by the TurboIntegrator process.

If the CharacterSet argument is not a known character type, the type defaults to the system locale.

These are the valid values for CharacterSet.

Character Encoding	System Locale
TM1CS_ISO_8859_1	ISO-8859-1 Latin-1, Western Europe
TM1CS_ISO_8859_2	ISO-8859-2 Latin-2, Central Europe
TM1CS_ISO_8859_3	ISO-8859-3 Latin-3, South Europe
TM1CS_ISO_8859_4	ISO-8859-4 Latin-4, North Europe
TM1CS_ISO_8859_5	ISO-8859-5 Latin/Cyrillic
TM1CS_ISO_8859_6	ISO-8859-6 Latin/Arabic
TM1CS_ISO_8859_7	ISO-8859-7 Latin/Greek
TM1CS_ISO_8859_8	ISO-8859-8 Latin/Hebrew
TM1CS_ISO_8859_9	ISO-8859-9 Latin-5, Turkish
TM1CS_ISO_8859_10	ISO-8859-10 Latin-6, Nordic,
TM1CS_ISO_8859_11	ISO-8859-11 Latin/Thai
TM1CS_ISO_8859_13	ISO-8859-13 Latin-7, Baltic Rim
TM1CS_ISO_8859_14	ISO-8859-14 Latin-8, Celtic
TM1CS_ISO_8859_15	ISO-8859-15 Latin-9, replaces ISO-8859-1
TM1CS_ISO_8859_16	ISO-8859-16 Latin-10, South-Eastern Europe
TM1CS_WCP1250	Microsoft Windows Central Europe
TM1CS_WCP1251	Windows Cyrillic
TM1CS_WCP1252	Windows Latin-1 multilingual
TM1CS_WCP1253	Windows Greek
TM1CS_WCP1254	Windows Turkish
TM1CS_WCP1255	Windows Hebrew
TM1CS_WCP1256	Windows Arabic
TM1CS_WCP1257	Windows Baltic
TM1CS_WCP1258	Windows Vietnam
TM1CS_WCP874	Windows Thai
TM1CS_WCP932	Windows Japanese
TM1CS_WCP936	Windows Simplified Chinese
TM1CS_WCP949	Windows Korean
TM1CS_WCP950	Windows Traditional Chinese
TM1CS_KOI8R	Russian and Cyrillic (KOI8-R)
TM1CS_GB18030	PRC version UNICODE
TM1CS_BIG5	Traditional Chinese
TM1CS_SHIFTJIS	JIS 0201 + JIS 0208, slightly different from CP932
TM1CS_SJIS0213	JIS 0213-2004, non-BMP required.
TM1CS_EUC_JP	EUC Japanese
TM1CS_EUC_CN	EUC Simplified Chinese
TM1CS_EUC_KR	EUC Korean
TM1CS_UTF8	UTF-8
TM1CS_UTF16	UTF-16 Little Endian
TM1CS_UTF16ESC	UNICODE notation
TM1CS_UTF32	UTF-32 Little Endian
TM1CS_OS_DEFAULT	operating system default
TM1CS_LOCALPATH	local encoding but UNICODE notation on non-native.

Example

SetInputCharacterSet ('TM1CS_ISO_8859_11');

This example specifies that the input character set for the TurboIntegrator data source is ISO-8859-11 Latin/Thai.