Encoding Scheme Identifiers

The following table contains the basic definitions for the Encoding Scheme identifiers (ESID) referenced in the CCSID documents. For a more complete explanation of encoding schemes refer to the material on encoding schemes found in the CDRA Reference document.


ESID (hex) Interpretation Number of States
1100 EBCDIC, SBCS. No code extension is allowed 1
2100 IBM-PC Data, SBCS. No code extension is allowed 1
3100 IBM-PC Display, SBCS. No code extension is allowed 1
4100 ISO 8, SBCS. No code extension is allowed 1
4105 ISO 8 (ASCII code), SBCS, Graphics in C1. Note that graphic characters may be present in the area normally reserved for the C1 control codes. (ie X'80' to X'9F') 1
4155 ISO 8 Presentation (ASCII code), SBCS, Graphics in C0 and C1 1
5100 ISO 7 (ASCII code), SBCS, No code extension is allowed 1
5150 ISO 7 Presentation (ASCII code), SBCS, Graphics in C0 1
6100 EBCDIC Presentation, SBCS. No code extension is allowed 1
8100 8 bit, SBCS, used with a 7-bit code page, characters are shifted into the right hand side of the encoding space, used only for single-byte EUC G-sets when each G-set is treated as a standalone code. 1
1200 EBCDIC, DBCS, No code extension is allowed 1
2200 IBM-PC Data, DBCS, No code extension is allowed 1
3200 IBM-PC Display, DBCS, No code extension is allowed 1
5200 ISO 7 (ASCII code), DBCS, No code extension is allowed 1
6200 EBCDIC DBCS presentation. 1
7200 Unicode, UCS-2, including UTF-16 to allow for support of surrogates, Big Endian order. No code extension is allowed 1
7209 Unicode pure double-byte. Used for any standalone, individual Unicode plane 1
720B Unicode, UCS-2, including UTF-16 to allow for support of surrogates, Little Endian order. No code extension is allowed 1
720F Unicode, UCS-2, including UTF-16 to allow for support of surrogates, endianness is determined by byte order mark (BOM), assumed to be big endian in the absence of BOM. No code extension is allowed 1
8200 Unicode Display 1
9200 8 bit, DBCS, used with a 7-bit code page, characters are shifted into the right hand side of the encoding space, used only for double-byte EUC G-sets when each G-set is treated as a standalone code. 1
1301 EBCDIC, Mixed single-byte and double-byte, using SO/SI code extension method 1
2300 IBM-PC Data, Mixed single-byte and double-byte, with implicit code extension 1
2305 PC Data, Mixed single-byte and double-byte, implicit code extension, single-byte is Windows encoding 1
3300 IBM-PC Display, Mixed single-byte and double-byte, with implicit code extension 1
4403 IBM EUC 2-4
5404 ISO 2022 TCP/IP using ESC sequences to designate codesets to G0 2-4
5409 ISO 2022 TCP/IP using SO/SI 2
540A ISO 2022 TCP/IP using SO, SI, SS2, SS3 2-3
7500 Unicode UTF-32, Big Endian order. No code extension is allowed 1
750B Unicode UTF-32, Little Endian order. No code extension is allowed 1
750F Unicode UTF-32, endianness is determined by byte order mark (BOM), assumed to be big endian in the absence of BOM. No code extension is allowed 1
5700 ISO 7 Triple-Byte Code Set, No code extension is allowed 1
1808 UTF-EBCDIC, UCS-2 transform, as defined in Unicode Technical Report 16. 1
7807 UTF-8, UCS-2 transform, No code extension is allowed 1
780D Compatibility Encoding Scheme for UTF-16: 8-Bit (CESU-8), as defined in Unicode Technical Report #26 1
2900 PC Data, fixed 4-byte 1
2A00 PC Data, mixed single-, double- and four-byte (Note: IBM PC or Windows code pages may be used as the single-byte component of a CCSID using this ESID.) 3
7B0C Standard Compression Scheme for Unicode (SCSU) as defined in Unicode Technical Standard 6.
7B0E Binary Ordered Compression for Unicode (BOCU-1) as defined in Unicode Technical Note #6
Fxxx Private use. User-defined encoding scheme
xFxx Private use. User-defined encoding scheme
xxFE Private use. User-defined encoding scheme

Contact IBM

Need assistance with your globalization questions?