SetInputCharacterSet

SetInputCharacterSet function lets you specify the character set used in a TurboIntegrator data source.

This function is valid in TM1® TurboIntegrator processes only.

When a TurboIntegrator process reads an external file as input, it needs to know the character set in which that external file was written. If the file contains a valid byte-order-mark, TM1 functions will correctly convert the file to UTF-8 if required.

For formats lacking a valid byte-order-mark, the characters must be converted from some other encoding to UTF-8. If the proper converters are present on the machine hosting the server, the input file will be converted to the Unicode character set required by TM1.

Syntax

SetInputCharacterSet (CharacterSet);

Argument

Description

CharacterSet

The character encoding in the input file to be used by the TurboIntegrator process.

If the CharacterSet argument is not a known character type, the type defaults to the system locale.

These are the valid values for CharacterSet.

Character Encoding

System Locale

TM1CS_ISO_8859_1

ISO-8859-1 Latin-1, Western Europe

TM1CS_ISO_8859_2

ISO-8859-2 Latin-2, Central Europe

TM1CS_ISO_8859_3

ISO-8859-3 Latin-3, South Europe

TM1CS_ISO_8859_4

ISO-8859-4 Latin-4, North Europe

TM1CS_ISO_8859_5

ISO-8859-5 Latin/Cyrillic

TM1CS_ISO_8859_6

ISO-8859-6 Latin/Arabic

TM1CS_ISO_8859_7

ISO-8859-7 Latin/Greek

TM1CS_ISO_8859_8

ISO-8859-8 Latin/Hebrew

TM1CS_ISO_8859_9

ISO-8859-9 Latin-5, Turkish

TM1CS_ISO_8859_10

ISO-8859-10 Latin-6, Nordic,

TM1CS_ISO_8859_11

ISO-8859-11 Latin/Thai

TM1CS_ISO_8859_13

ISO-8859-13 Latin-7, Baltic Rim

TM1CS_ISO_8859_14

ISO-8859-14 Latin-8, Celtic

TM1CS_ISO_8859_15

ISO-8859-15 Latin-9, replaces ISO-8859-1

TM1CS_ISO_8859_16

ISO-8859-16 Latin-10, South-Eastern Europe

TM1CS_WCP1250

Microsoft Windows Central Europe

TM1CS_WCP1251

Windows Cyrillic

TM1CS_WCP1252

Windows Latin-1 multilingual

TM1CS_WCP1253

Windows Greek

TM1CS_WCP1254

Windows Turkish

TM1CS_WCP1255

Windows Hebrew

TM1CS_WCP1256

Windows Arabic

TM1CS_WCP1257

Windows Baltic

TM1CS_WCP1258

Windows Vietnam

TM1CS_WCP874

Windows Thai

TM1CS_WCP932

Windows Japanese

TM1CS_WCP936

Windows Simplified Chinese

TM1CS_WCP949

Windows Korean

TM1CS_WCP950

Windows Traditional Chinese

TM1CS_KOI8R

Russian and Cyrillic (KOI8-R)

TM1CS_GB18030

PRC version UNICODE

TM1CS_BIG5

Traditional Chinese

TM1CS_SHIFTJIS

JIS 0201 + JIS 0208, slightly different from CP932

TM1CS_SJIS0213

JIS 0213-2004, non-BMP required.

TM1CS_EUC_JP

EUC Japanese

TM1CS_EUC_CN

EUC Simplified Chinese

TM1CS_EUC_KR

EUC Korean

TM1CS_UTF8

UTF-8

TM1CS_UTF16

UTF-16 Little Endian

TM1CS_UTF16ESC

UNICODE notation

TM1CS_UTF32

UTF-32 Little Endian

TM1CS_OS_DEFAULT

operating system default

TM1CS_LOCALPATH

local encoding but UNICODE notation on non-native.

Example

SetInputCharacterSet ('TM1CS_ISO_8859_11');

This example specifies that the input character set for the TurboIntegrator data source is ISO-8859-11 Latin/Thai.