Skip to main content

Software  > Globalization > CDRA > 

Globalize your On Demand Business

Appendix C. CCSID Repository

The CCSID Repository contains the information on all of the CCSIDs which are currently registered by IBM.

This appendix contains supporting information for the CCSID Repository. Each CCSID record contains the following fields:

Field Explanation
CCSID

The value of the coded character set identifier (CCSID) assigned by CDRA. Both decimal and hexadecimal forms are shown.

Note: Wherever feasible, CCSID is assigned the same value as its CPGID element, to facilitate coexistence and migration; however, the one-to-one relationship between the two values cannot be maintained in all situations.
Registration Date This field contains the date on which the CCSID became officially registered. Note that this value is not available for some CCSIDs as originally registration dates were not recorded.
ES The value of the encoding scheme identifier (ES), in hexadecimal form. The number of other identifiers associated with ES depends on this value.
MCCSID The value of the Maximal CCSID. This is the CCSID with the largest character repretoire for a particular Code Page. The Maximal CCSID is always equal to or a superset of the CCSID.
CS, CP and CCSID The registered coded graphic character set global identifier (CGCSGID): the GCSGID is in the CS column and the CPGID is in the CP column. Decimal forms are shown. Depending on the ES value, more than one pair of CS and CP may be listed.

When the character set associated with a code page is maximal, the CS value associated with a CP may be entered as X'FFFF' indicating a growing character set. The contents of the associated code page resource definition in that installation implicitly defines the associated maximal character set.

The CCSID column contains the CCSID associated with the corresponding CS, CP pair. This is especially useful when dealing with CCSIDs that contain more than one CS, CP pair.
F/M/S (Size) This indicates whether the character set (CS) is full, maximal, growing or subset.
Full (F)
Uses all the allocated graphic character space in the ES
Maximal (M)
Does not use all the allocated graphic character space, but is the largest (maximal) set for the associated ES at the time of registering the value of CCSID by CDRA
Growing (G)
This indicates that the CCSID has a growing CS. Use the current maximal CS as the character repertoire.
Subset (S)
Does not use all the allocated graphic character space, and is a subset of another full or maximal registered character set. The number of individual graphic characters (excluding the character SPACE, whose GCGID is SP010000) in the character set is shown in parentheses as (Size ).
I/C This indicates whether the CCSID is designated for interoperable (I) use, or is required for coexistence and migration purposes (C).
ACRI This indicates the type of Additional Coding-related Required Information. The format is T-nn, where T is the type of ACRI information, and nn references an entry in the Definition Table for that ACRI Type. See "Additional Coding-Related Required Information" for definitions of the formats for these various ACRI types. The specific ACRI value definitions are found in the CCSID Repository.
  • (P) identifies the ACRI information as PCMB (PC Mixed Byte).
  • (EUC) identifies the ACRI information as EUC
  • (TCP) identifies the ACRI information as TCP.

Control Function Definition The default control function definitions associated with the CCSID. The possible values for each of the Control Function Definitions are found in the CCSID Repository.
  • SP (Space) references an entry in the Space Character Definition Table consisting of the code point value (hex), the width in bytes of the code point, and the state in which this code point of SPACE definition is used.
  • SUB (Substitute) references an entry in the SUB Character Definition Table consisting of the code point value (hex), the width in bytes of the code point, and the state in which this code point of SUB (Substitute) definition is used.
  • NL (New Line) references an entry in the New Line Character Definition Table consisting of the code point value (hex), the width in bytes of the code point, and the state in which this code point of NL (New Line) definition is used.
  • LF (Line Feed) references an entry in the Line Feed Character Definition Table consisting of the code point value (hex), the width in bytes of the code point, and the state in which this code point of LF (Line Feed) definition is used.
  • CR (Carriage Return) references an entry in the Carriage Return Character Definition Table consisting of the code point value (hex), the width in bytes of the code point, and the state in which this code point of CR (Carriage Return) definition is used.
  • EOF (End of File) references an entry in the End of File Character Definition Table consisting of the code point value (hex), value, the width in bytes of the code point, and the state in which this code point of EOF (End of File) definition is used.
Description A description of the CCSID (may contain the language, country or standard that it is used to support).
Notes The Notes field will contain additional information about a CCSID such as CCSIDs it has superceded or if it has been replaced by a newer CCSID.

We're here to help
Easy ways to get the answers you need.
E-mail IBM

Stay informed

Globalization News
Stay informed

Locate IBM globalized products

Product languages
Locate IBM globalized products