Basic Encoding Set (English Java Runtime Version)

Canonical Name Alias Description

ASCII

us-ascii

American Standard Code for Information Interchange

Cp1252

windows-1252

Windows Latin-1

ISO8859_1

iso-8859-1

ISO 8859-1, Latin alphabet No. 1

UnicodeBig

unicodeFFFE

Sixteen-bit Unicode Transformation Format, big-endian byte order, with byte-order mark

UnicodeBigUnmarked

 

Sixteen-bit Unicode Transformation Format, big-endian byte order

UnicodeLittle

 

Sixteen-bit Unicode Transformation Format, little-endian byte order, with byte-order mark

UnicodeLittleUnmarked

 

Sixteen-bit Unicode Transformation Format, little-endian byte order

UTF8

utf-8

Eight-bit Unicode Transformation Format

UTF-16

utf-16

Sixteen-bit Unicode Transformation Format, byte order specified by a mandatory initial byte-order mark

 

Extended Encoding Set (International Java Runtime Version)

Canonical Name Alias Description

Big5

 

Big5, Traditional Chinese

Big5_HKSCS

 

Big5 with Hong Kong extensions, Traditional Chinese

Big5_Solaris

 

Big5 with seven additional Hanzi ideograph character mappings for the Solaris zh_TW.BIG5 locale

Cp037

ebcdic-cp-us

USA, Canada (Bilingual, French), Netherlands, Portugal, Brazil, Australia

Cp273

x-EBCDIC-Germany

IBM Austria, Germany

Cp277

x-EBCDIC-DenmarkNorway

IBM Denmark, Norway

Cp278

x-EBCDIC-FinlandSweden

IBM Finland, Sweden

Cp280

x-EBCDIC-Italy

IBM Italy

Cp284

X-EBCDIC-Spain

IBM Catalan/Spain, Spanish Latin America

Cp285

x-EBCDIC-UK

IBM United Kingdom, Ireland

Cp297

x-ebcdic-france

IBM France

Cp420

x-EBCDIC-Arabic

IBM Arabic

Cp424

x-EBCDIC-Hebrew

IBM Hebrew

Cp437

Cp437

MS-DOS United States, Australia, New Zealand, South Africa

Cp500

 

EBCDIC 500V1

Cp737

ibm737

PC Greek

Cp775

ibm775

PC Baltic

Cp838

 

IBM Thailand extended SBCS

Cp850

Cp850

MS-DOS Latin-1

Cp852

Cp852

MS-DOS Latin-2

Cp855

x-EBCDIC-cyrillic

IBM Cyrillic

Cp856

x-EBCDIC-Hebrew

IBM Hebrew

Cp857

x-EBCDIC-Turkish

IBM Turkish

Cp858

 

Variant of Cp850 with Euro character

Cp860

Cp860

MS-DOS Portuguese

Cp861

Cp861

MS-DOS Icelandic

Cp862

ibm862

PC Hebrew

Cp863

Cp863

MS-DOS Canadian French

Cp864

ibm864

PC Arabic

Cp865

Cp865

MS-DOS Nordic

Cp866

cp866

MS-DOS Russian

Cp868

Cp868

MS-DOS Pakistan

Cp869

ibm869

IBM Modern Greek

Cp870

 

IBM Multilingual Latin-2

Cp871

x-ebcdic-icelandic

IBM Iceland

Cp874

x-EBCDIC-Thai

IBM Thai

Cp875

x-EBCDIC-Greek

IBM Greek

Cp918

 

IBM Pakistan (Urdu)

Cp921

 

IBM Latvia, Lithuania (AIX, DOS)

Cp922

 

IBM Estonia (AIX, DOS)

Cp930

 

Japanese Katakana-Kanji mixed with 4370 UDC, superset of 5026

Cp933

 

Korean Mixed with 1880 UDC, superset of 5029

Cp935

 

Simplified Chinese Host mixed with 1880 UDC, superset of 5031

Cp937

 

Traditional Chinese Host miexed with 6204 UDC, superset of 5033

Cp939

 

Japanese Latin Kanji mixed with 4370 UDC, superset of 5035

Cp942

 

IBM OS/2 Japanese, superset of Cp932

Cp942C

 

Variant of Cp942

Cp943

 

IBM OS/2 Japanese, superset of Cp932 and Shift-JIS

Cp943C

 

Variant of Cp943

Cp948

 

OS/2 Chinese (Taiwan) superset of 938

Cp949

cp949

PC Korean

Cp949C

cp949c

Variant of Cp949

Cp950

Cp950

PC Chinese (Hong Kong, Taiwan)

Cp964

 

AIX Chinese (Taiwan)

Cp970

 

AIX Korean

Cp1006

 

IBM AIX Pakistan (Urdu)

Cp1025

x-EBCDIC-CyrillicSerbianBulgarian

IBM Multilingual Cyrillic: Bulgaria, Bosnia, Herzegovina, Macedonia (FYR)

Cp1026

 

IBM Latin-5, Turkey

Cp1046

 

IBM Arabic - Windows

Cp1097

 

IBM Iran (Farsi)/Persian

Cp1098

 

IBM Iran (Farsi)/Persian (PC)

Cp1112

 

IBM Latvia, Lithuania

Cp1122

 

IBM Estonia

Cp1123

 

IBM Ukraine

Cp1124

 

IBM AIX Ukraine

Cp1140

x-ebcdic-cp-us-euro

Variant of Cp037 with Euro character

Cp1141

x-EBCDIC-Germany-euro

Variant of Cp273 with Euro character

Cp1142

x-EBCDIC-DenmarkNorway-euro

Variant of Cp277 with Euro character

Cp1143

x-ebcdic-finlandsweden-euro

Variant of Cp278 with Euro character

Cp1144

x-ebcdic-italy-euro

Variant of Cp280 with Euro character

Cp1145

X-EBCDIC-Spain-euro

Variant of Cp284 with Euro character

Cp1146

x-EBCDIC-UK-euro

Variant of Cp285 with Euro character

Cp1147

x-ebcdic-france-euro

Variant of Cp297 with Euro character

Cp1148

 

Variant of Cp500 with Euro character

Cp1149

x-ebcdic-icelandic-euro

Variant of Cp871 with Euro character

Cp1250

windows-1250

Windows Eastern European

Cp1251

windows-1251

Windows Cyrillic

Cp1253

windows-1253

Windows Greek

Cp1254

windows-1254

Windows Turkish

Cp1255

windows-1255

Windows Hebrew

Cp1256

windows-1256

Windows Arabic

Cp1257

windows-1257

Windows Baltic

Cp1258

windows-1258

Windows Vietnamese

Cp1381

 

IBM OS/2, DOS People's Republic of China (PRC)

Cp1383

 

IBM AIX People's Republic of China (PRC)

Cp33722

 

IBM-eucJP - Japanese (superset of 5050)

EUC_CN

 

GB2312, EUC encoding, Simplified Chinese

EUC_JP

 

JIS X 0201, 0208, 0212, EUC encoding, Japanese

EUC_KR

 

KS C 5601, EUC encoding, Korean

EUC_TW

 

CNS11643 (Plane 1-3), EUC encoding, Traditional Chinese

GB18030

 

Simplified Chinese, PRC standard

GBK

 

GBK, Simplified Chinese

ISCII91

 

ISCII91 encoding of Indic scripts

ISO2022CN

 

ISO 2022 CN, Chinese (conversion to Unicode only)

ISO2022CN_CNS

 

CNS 11643 in ISO 2022 CN form, Traditional Chinese (conversion from Unicode only)

ISO2022CN_GB

 

GB 2312 in ISO 2022 CN form, Simplified Chinese (conversion from Unicode only)

ISO2022JP

iso-2022-jp

JIS X 0201, 0208 in ISO 2022 form, Japanese

ISO2022KR

iso-2022-kr

ISO 2022 KR, Korean

ISO8859_2

iso-8859-2

ISO 8859-2, Latin alphabet No. 2

ISO8859_3

iso-8859-3

ISO 8859-3, Latin alphabet No. 3

ISO8859_4

iso-8859-4

ISO 8859-4, Latin alphabet No. 4

ISO8859_5

iso-8859-5

ISO 8859-5, Latin/Cyrillic alphabet

ISO8859_6

iso-8859-6

ISO 8859-6, Latin/Arabic alphabet

ISO8859_7

iso-8859-7

ISO 8859-7, Latin/Greek alphabet

ISO8859_8

iso-8859-8

ISO 8859-8, Latin/Hebrew alphabet

ISO8859_9

iso-8859-9

ISO 8859-9, Latin alphabet No. 5

ISO8859_13

iso-8859-13

ISO 8859-13, Latin alphabet No. 7

ISO8859_15_FDIS

iso-8859-15-FDIS

ISO 8859-15, Latin alphabet No. 9

JIS0201

 

JIS X 0201, Japanese

JIS0208

 

JIS X 0208, Japanese

JIS0212

 

JIS X 0212, Japanese

JISAutoDetect

 

Detects and converts from Shift-JIS, EUC-JP, ISO 2022 JP (conversion to Unicode only)

Johab

 

Johab, Korean

KOI8_R

koi8-r

KOI8-R, Russian

MS874

windows-874

Windows Thai

MS932

windows-932

Windows Japanese

MS936

windows-936

Windows Simplified Chinese

MS949

windows-949

Windows Korean

MS950

windows-950

Windows Traditional Chinese

MacArabic

x-mac-arabic

Macintosh Arabic

MacCentralEurope

 

Macintosh Latin-2

MacCroatian

x-mac-croatian

Macintosh Croatian

MacCyrillic

x-mac-cyrillic

Macintosh Cyrillic

MacDingbat

x-mac-dingbat

Macintosh Dingbat

MacGreek

x-mac-greek

Macintosh Greek

MacHebrew

x-mac-hebrew

Macintosh Hebrew

MacIceland

x-mac-icelandic

Macintosh Iceland

MacRoman

x-mac-roman

Macintosh Roman

MacRomania

x-mac-romanian

Macintosh Romania

MacSymbol

x-mac-symbol

Macintosh Symbol

MacThai

x-mac-thai

Macintosh Thai

MacTurkish

x-mac-turkish

Macintosh Turkish

MacUkraine

x-mac-ukrainian

Macintosh Ukraine

SJIS

 

Shift-JIS, Japanese

TIS620

 

TIS620, Thai


Copyright 2002-2006 Sferyx Srl. All rights reserved. Sferyx and the Sferyx logo are registered trademarks of Sferyx Srl. http://www.sferyx.com