XML and HTML encodings

The HTML and XML editors support the following encodings. The IANA name is used in the encoding statement of an XML file, or charset directive in an HTML file.

XML Encoding (IANA Name)

Description

BIG5

Big5, Traditional Chinese

EUC-JP

EUC encoding, Japanese

EUC-KR

EUC encoding, Korean

GB2312

GBK, Simplified Chinese

GB18030 National Standard, Chinese
IBM864 PC Arabic

ISO-2022-JP

ISO 2022, Japanese

ISO-2022-KR

ISO 2022, Korean

ISO-8859-1

ISO Latin-1

ISO-8859-2 Central/East European (Slavic)
ISO-8859-3 Southern European
ISO-8859-4 ISO 8859-4, Cyrillic
ISO-8859-5 ISO 8859-5, Cyrillic

ISO-8859-6

Arabic (Logical)
ISO-8859-7 Greek

ISO-8859-8-I

Hebrew (Logical)

ISO-8859-8

Hebrew (Visual)

ISO-8859-9 Turkish

SHIFT_JIS

Shift-JIS, Japanese

TIS-620 TISI, Thai

US-ASCII

US ASCII

UTF-8

ISO 10646/Unicode, one-byte encoding

UTF-16

ISO 10646/Unicode, two-byte encoding

UTF-16BE

Unicode BigEndian

UTF-16LE

Unicode LittleEndian

WINDOWS-874 Thai, Microsoft®

WINDOWS-1252

ISO Latin-1

WINDOWS-1255

Hebrew

WINDOWS-1256

Arabic

X-EUC-JP EUC encoding, Japanese (alias for EUC-JP)
X-SJIS Shift-JIS, Japanese (alias for SHIFT_JIS)

(C) Copyright IBM Corporation 2000, 2005. All Rights Reserved.