Skip to main content

Software  > Globalization > CDRA > 

Globalize your On Demand Business

Glossary
spacer image
spacer image

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z


spacer image
A
ACRI (Ref #7.)
Additional Coding-related Required Information. A CDRA term referring to the information, in addition to encoding scheme identifier, code page and character set global identifiers, that is required to complete the definition associated with using particular encoding schemes. An example is the ranges of valid first bytes of double-byte code points in a PC Mixed single-byte and double-byte code.
ACRI-PCMB (Ref #8.)
Additional Coding-Related Required Information - PC Mixed Byte A CDRA identifier that defines the ranges of valid first bytes of double byte code points in a PC Mixed SB/DB encoding scheme.
AM (Ref #9.)
SAA Application Manager Interface. This interface, together with the SAA Presentation Manager Interface, provides the facilities that allow the end-user to interface with applications.
API
See application programming interface.
APL
See A Programming Language. programming
APL (Ref #10.)
A Programming Language. One of the many programming languages supported on IBM systems in many countries. The APL character set consists of several mathematical symbols as part of its syntactical set, and makes it a special set from coded character set support point of view.
application programming interface (Ref #11.)
A formally defined programming language interface between an IBM system control program or a licensed program, and the user of the program.
Arabic shapes for numerals (Ref #12.)
The shapes 1, 2, 3, 4, 5, 6, 7, 8, 9, and 0. See also :1 id=glhsfn.Hindi shapes for numerals.
ASCII (Ref #13.)
American Standard Code for Information Interchange. The standard code, using a coded set consisting of 7-bit coded characters (8 bits including parity check), used for information interchange among data processing systems, data communication systems, and associated equipment. The ASCII set consists of control characters and graphic characters.

B
beyond 190 support (Ref #14.)
Support of more than 190 graphic characters in a single-byte EBCDIC code. The term encompasses the support of more than a single code page in a single-byte encoding scheme. An example is the support of a symbol set in addition to the base alphabet of a country in text processing applications.
bit combination (Ref #15.)
An ordered set of bits that represents a coded character or is used as part of the representation of coded character. Synonymous with code point. (ISO/IEC 6429)
blue layer (Ref #16.)
A collection of software components that together provide the traditional operating system services in a System/370.

C
CCITT (Ref #17.)
Consultative Committee for International Telephone and Telegraph.
CCS
See Common Communications Support.
CCSID Resource (Ref #18.)
A representation of the various elements associated with a CCSID in a system in a machine readable form.
CCSID Resource Repository (Ref #19.)
An organized collection of CCSID resources that are maintained by a service provider in a system.
CDRA
See Character Data Representation Architecture.
CDRA identifiers (Ref #20.)
Identifiers defined in CDRA for uniquely identifying graphic character representation. CCSID is the short form. The set of ESID, CS, CP, and ACRI is the long form.
CECP (Ref #21.)
Country Extended Code Pages. A number of single-byte EBCDIC coded character sets in the IBM corporate registry. Each code page contains the same character set, with an identifier of 00697 (also known as Latin Alphabet Number 1), but with different code point allocations.
Character Data Representation Architecture (Ref #22.)
An IBM architecture that defines a set of identifiers, services, and conventions to achieve a consistent representation, processing, and interchange of graphic character data in SAA environments.
code (Ref #23.)
A system of bit patterns to which a specific graphic or a control meaning has been assigned.
code extension method (Ref #24.)
A method prescribed in an encoding scheme for representing characters that cannot be accommodated within the limits of the basic structure of the code. It prescribes a method to alter the interpretation of one or more code points that follow a prescribed single control character or a control sequence. See also :1 id=glcdpgs.code page switching.
Coded Graphic Character Set Global Identifier (Ref #25.)
A concatenation of GCSGID and CPGID. It identifies the code point assignments in a code page of a specific graphic character set, from among all the graphic characters that may be assigned in a code page. Depending on the representation used for GCSGID and CPGID, the CGCSGID can be expressed either as a ten-digit decimal number, an eight-digit hexadecimal number, or a four-byte binary number.
code page (Ref #26.)
A specification of code points from a defined encoding structure for each graphic character in a set or in a collection of graphic character sets. Within a code page, a code point can have only one specific meaning.
Code Page Global Identifier (Ref #27.)
A number between 00001 and 65534 that is assigned to identify a code page. It may be expressed as a five-digit decimal number, a four-digit hexadecimal number, or a two-byte binary number.
code page switching (Ref #28.)
Changing the code page being used to another one. The switching can be static (code page selection, on a per session or per application basis), or dynamic (each character in a character string can be from a different code page). An example of static switching is the selection of code page 00437 or 00850 in OS/2. An example of dynamic switching is the switching to code page 00259 to pick up a scientific symbol in a Document Content Architecture (DCA) text data stream. ISO/IEC 2022 specifies standardized code-page-switching designation and invocation sequences.
code point (Ref #29.)
A unique bit pattern defined in a code. Depending on the code, a code point can be 7-bits, 8-bits, 16-bits, or other. Code points are assigned graphic characters in a code page. See also :1 id=glbtcmbbit combination.
coded character set (Ref #30.)
A set of unambiguous rules that establish a character set and the one-to-one relationships between the characters of the set and their coded representations. (ISO)
Coded Character Set Identifier (Ref #31.)
A 16-bit number identifying a specific set of encoding scheme identifier, character set identifier(s), code page identifier(s), and additional coding-related required information, that uniquely identifies the coded graphic character representation used.
coded graphic character set (Ref #32.)
A set of graphic characters with their assigned code points.
command (Ref #33.)
End-user command (MRI).
common application code page (Ref #34.)
The concept of applications using a single or a limited number of coded character sets, whereas the devices and other system elements may use different specific code pages. One or more service modules are expected to bridge the differences.
component (Ref #35.)
A hardware or software entity forming part of a system, or a piece of logic that controls the operation of a device. modifies, or stops a control function.
control function (Ref #36.)
An element of a character set that affects the recording, processing, transmission, or interpretation of data, and that has a coded representation of one or more bit combinations. (ISO/IEC 6429)
CP (Ref #37.)
A synonym (used in this document) for CPGID. See :1 id=glcdpgiCode Page Global Identifier.
CP Resource (Ref #38.)
A representation of the various elements associated with a CPGID in a machine readable form in a system.
conversion (Ref #39.)
Pertains to replacing a code point assigned to a character in one code with its corresponding code point assigned in another code. The term conversion does not include "translation" from one national language to another, or the "transformation" between different types of data such as between floating point and integer numbers.
conversion method (Ref #40.)
An algorithm used during conversion. It includes the necessary logic to separate the input code point string into appropriate substrings, converting the substrings and assembling the resultant substrings, for a particular set of criteria to be used during conversion. A conversion method may use associated conversion tables as resources during the conversion.
conversion table (Ref #41.)
A resource used with a conversion method to perform conversion. Typically, a conversion table contains a set of input code point values corresponding to a given set of input code point values. Its structure and contents are designed to suit the conversion algorithm with which it is to be used.
CPI
See Common Programming Interface.
CS (Ref #42.)
A synonym for GCSGID. See Graphic Character Set Global Identifier.
CUA
See Common User Access.

D
database (Ref #43.)
A collection of data fundamental to a system. An organized collection of user information that can be methodically created, updated, or retrieved. The database organization is usually defined and tailored to meet the specific needs of the user.
Data Description and Conversion (Ref #44.)
An architecture that facilitates description of the logical structure and type of data as stored in a data store or as viewed by an application.
data integrity boundaries (Ref #45.)
System boundaries within which different levels of graphic-character-data integrity can be assured. For example, the Syntactic Character Set, being part of most coded character sets, can be assured across most systems. Latin Alphabet Number 1 can be assured within and between systems fully supporting character set 00697. The data integrity of a fixed set of Kanji (and other) characters can be assured within and between systems supporting the DBCS character set 00370.
data stream (Ref #46.)
A continuous stream of data elements being transmitted, or intended for transmission, in character or binary-digit form, using a defined format.
data type (Ref #47.)
A classification of data into different types having some bearing on proper handling of that data. Floating point, Integer, Binary, and Character string are some examples.
DCE (Ref #48.)
Data Conversion Engine.
DCF (Ref #49.)
Document Composition Facility. An IBM program product for composing documents using SCRIPT mark-up language.
DCS (Ref #50.)
Data Conversion Services
DD&C
See Data Description and Conversion.
DDM
See Distributed Data Management.
Distributed Data Management (Ref #51.)
An SAA Common Communications Support architecture for distributed data supporting both distributed file management products and distributed relational database products following DRDA.
DFM (Ref #52.)
Distributed File Management
Distributed Relational Database Architecture (Ref #53.)
A protocol that allows applications to access data from remote databases.
double-byte (coded) character set (Ref #54.)
A coded character set in which each character is represented by a double-byte code point. Some character sets, such as Kanji, which is used in Japan, are too rich in symbols to be able to represent all the characters using single-byte codes. A double-byte coded character set is used to represent the symbols that make up such large character sets.
double-wide character (Ref #55.)
A character, such as a Kanji ideogram, that requires twice the nominal width of other characters, such as the letter A, for the character to be legible on a display screen or a printer.
DRDA
See Distributed Relational Database Architecture.

E
ECW
See Element Control Word.
element control word (Ref #56.)
a 32-bit structure that is used in the data structure of CCSID element definitions to specify the element identifier, the maximum length of the structure, the actual length used and a few relevant control flags.
electronic font printer (Ref #57.)
A printer using a technique in which the character image is formed by a matrix of pixels.
encoding boundary (Ref #58.)
A set of system components within which the same encoding scheme for character representation is used. For example, a number of interconnected Personal Computers all using IBM-PC Data code, pure SBCS, no code extensions, character set 00697, and code page 00850. However, an interchange between the PC and a System/370 computer would involve crossing the encoding boundary.
end-user (Ref #59.)
A person, process, program, device, or system that employs a user application network for the purpose of data processing and information exchange.
ES
A synonym for ESID. See Encoding Scheme Identifier.
even parity bit (Ref #60.)
A check bit that is usually generated or included in a parity-checking algorithm to make the total number of bits in a bit pattern an even number. See also odd parity bit.
extended coded graphic character set (Ref #61.)
A coded graphic character set that is used as an addition or extension to another coded graphic character set. This character set is accessed using a code extension method defined by the encoding scheme with which it is used. For example, a set of coded chemical symbols used as a supplement to the coded alphabets in text processing.

F
folding (Ref #62.)
The substitution of one character for another, generally to map a larger character set into a subset. Folding may result in the loss of information. For example, it allows the printing of uppercase graphic characters when lowercase characters are not available in the character array on the printer chain or train. See also mono-casing.
full character set (Ref #63.)
The maximal character set of a code page such that there are no more unassigned graphic code points remaining in the associated encoding scheme. No other larger character set can be represented in that code page. For example, CS 697 (the maximal character set of CP 500 in encoding scheme ES 1100), contains 190 graphic characters and is assigned all the 190 available graphic code points in ES 1100. See also maximal character set and subset character set.

G
GCCASN
See Graphic Character Conversion Alternative Selection Number.
GCCST
See Graphic Character Conversion Selection Table.
GCID (Ref #64.)
Another name for CGCSGID. See Coded Graphic Character Set Global Identifier.
GML
See Generic Markup Language.
Generic Markup Language (Ref #65.)
An IBM-markup language used with the Document Composition Facility for creating publishing quality documents.
graphic character (Ref #66.)
(1) (ISO 646-1983) A character other than a control function that has a visual representation normally handwritten, printed, or displayed.
(2) A character that can be displayed or printed.
(3) A graphic symbol such as a numeric, alphabetic, or special character, or ideogram.
Graphic Character Conversion Alternative Selection Number (Ref #67.)
A parameter of a function call to a graphic character data conversion process that facilitates selecting a specific conversion method and associated conversion tables from different alternatives.
Graphic Character Conversion Selection Table (Ref #68.)
A table used in the graphic character data conversion process to manage the access to the various conversion methods and associated conversion tables under its sphere of control.
graphic character set (Ref #69.)
A defined set of graphic characters. No coded representation is assumed.
Graphic Character Set Global Identifier (Ref #70.)
A number between 00001 and 65534 that is assigned to identify a graphic character set. It may be expressed as a five-digit decimal number, a four-digit hexadecimal number, or a two-byte binary number.

H
hard-coded (Ref #71.)
A value coded in a program as a constant (as opposed to a variable).
Hindi shapes for numerals (Ref #72.)
A set of shapes (for numeric digits 0 to 9) used in many Arabic countries instead of the more familiar "Arabic" ones. See also :1 id=glarbsh.Arabic shapes for numerals.
HLL (Ref #73.)
High-Level Language.

I
identity map (Ref #74.)
A special case of code point conversion in which all input code points are equal to the output code points, thus eliminating the need for a conversion. When converting data from one CCSID to another using the round trip criterion, if the CCSIDs share the same CPGID, an identity mapping condition exists.
IEC
See International Electrotechnical Commission
invariant character set (Ref #75.)
(1) A character set such as the Syntactic Character Set, whose code point assignments do not change from code page to code page.
(2) A minimum set of characters that is available as part of all character sets.
International Electrotechnical Commission (Ref #76.)
An international standards writing body.
International Organization for Standardization (Ref #77.)
An international standards writing body.
ISO (Ref #78.)
International Organization for Standardization; an international standards writing body
ISO environment (Ref #79.)
A coding structure defined in ISO 2022 that uses single (or multiple) septet(s) (7-bit) or octet(s) (8-bit) per code point, with or without code extension controls.
ISO/IEC (Ref #80.)
The prefix used for standards that are produced jointly by ISO and IEC.
ITA-2 (Ref #81.)
International Telegraphic Alphabet Number 2. A CCITT-defined coded character set used in the international Telex communication services, worldwide.
IWS (Ref #82.)
Intelligent Work Station.

K
Katakana (Ref #83.)
A set of phonetic symbols (syllabary) used in Japanese. It is often used to express foreign words phonetically.

L
Latin alphabet (Ref #84.)
An alphabet comprising the letters a, b, c, d, e, f, g, h, i, j, k, l, m, n, o, p, q, r, s, t, u, v, w, x, y, and z in uppercase and lowercase, in some cases with accents on certain characters.
Latin Alphabet Number 1 (Ref #85.)
A set of 190 graphic characters and SPACE defined in ISO 8859-1, containing the alphabets of Western European languages. This character set is identical to the character set 00697 (revised) in the IBM registry. Many corresponding national standards also specify this set. Latin Alphabet Numbers 2, 3, and 4 are defined in other parts of ISO 8859.
lowercase (Ref #86.)
Pertaining to the small alphabetic characters (accented or not) as distinguished from the capital alphabetic characters; for example a, b, and c, in the Latin alphabet. The concept of case also applies to other alphabets such as Cyrillic and Greek, but not to Arabic, Hebrew, Thai, and others. See also uppercase.

M
machine readable information (Ref #87.)
All textual information contained in a program such as a system control program, an application program, or microcode. MRI includes all information that is presented to or received from a user interacting with a system. This includes menus, prompts, messages, report headings, commands, and responses. MRI may appear on printers or on display panels. Contrast with machine readable material (MRM), which includes MRI, executable code, and constants.
maximal character set (Ref #88.)
The largest registered character set that is assigned to a registered code page following a particular encoding scheme. If this character set is not also a full character set, a future larger character set can become the new maximal character set, making the current one a subset character set. For example, CS 1172 (Katakana Extended, for Japan) contains only 162 graphic characters, and is the maximal registered character set for CP 1172 using ES 1100 (potential maximum being 190 graphic code points). See also :1 id=glsubcs.subset character set and :1 id=glfulcs.full character set.
Message Handling Services
A set of CCITT recommendations in the series X.400 that define the elements of services to provide generation, interchange, and reception of "messages", using public telecommunication facilities.
MFI (Ref #89.)
Main Frame Interactive (workstation).
MHS
See Message Handling Services.
mono-casing (Ref #90.)
The translation of alphabetic characters from one case (usually the lowercase) to their equivalents in another case (usually the uppercase). See also folding.
MRI
See machine readable information.

N
national language support (Ref #91.)
The ability for users to communicate with products in the language most familiar to them.
national use graphics (Ref #92.)
Graphic characters on a code page not forming part of the invariant character set.
nibble (Ref #93.)
A bit-pattern consisting of four bits.
NLIDG (Ref #94.)
National Language Information and Design Guide.
NLS
See National language support
non-Latin-based alphabet (Ref #95.)
An alphabet comprising letters other than the Latin-based ones, such as those used in Greek and Arabic.
Normalization Support CCSID Table (Ref #96.)
A table containing a default CCSID value associated with a pair of CCSIDs, which will be used to normalize two strings (that are coded in two different CCSIDs), before a string operation such as concatenation, comparison, or others is performed with the two strings.
NPT (Ref #97.)
Non-Programmable Terminal.
NSCT
See Normalization Support CCSID Table.

O
octet (Ref #98.)
An 8-bit byte.
odd parity bit (Ref #99.)
A check bit that is usually generated or included in a parity-checking algorithm to make the total number of bits in a bit pattern an odd number. See also :1 id=glevpbt.even parity bit.
operator (Ref #100.)
The system operator.
OSI (Ref #101.)
Open System Interconnection. An ISO model for interconnection of heterogeneous systems participating in an open environment.

P
PM (Ref #102.)
Presentation Manager.
presentation (Ref #103.)
Printing or displaying.
product (Ref #104.)
Hardware or software or any combination marketed as an individual unit, which can be separately ordered by a customer.
PWS (Ref #105.)
Programmable Work Station.

Q
QM (Ref #106.)
Query Manager.

R
RDCT
See Related Default CCSID Table.
Related Default CCSID Table (Ref #107.)
A table containing a default CCSID associated with another CCSID and an ESID; this default CCSID is considered to be the nearest equivalent of its associated CCSID based on some relationship between the two.
RFTDCA (Ref #108.)
Revisable Form Text - Document Content Architecture. An IBM architecture used to represent documents for interchanging between text-editing products such as displaywriter.

S
section (Ref #109.)
A synonym for a ward. Also known as a DBCS section.
septet (Ref #110.)
A 7-bit byte.
session (Ref #111.)
The period of time between logon and logoff without re-IPL.
SGML
See Standardized Generic Markup Language.
SIRS (Ref #112.)
Standards Information Retrieval System.
Standardized Generic Markup Language (Ref #113.)
A markup language for document representation, as specified in ISO 8879, aimed at the text processing applications in the publishing industry.
SNA (Ref #114.)
System Network Architecture. An architecture under Common Communications Support, for networking systems.
SPA (Ref #115.)
Standards Project Authority.
special character (Ref #116.)
A graphic character that is not a letter, a digit, or a space character.
SQL (Ref #117.)
Structured Query Language. A language that can be used within programs written in other languages, or interactively to access database-manager data and to control access to database-manager resources.
subset character set (Ref #118.)
(1) A set of characters that is completely contained in another larger set of characters.
(2) From among all the character sets that can be represented in a single code page, each character set except the largest is a subset character set in that code page. For example, CS 101 (94 characters) is a subset character set in any one of the Country-Extended Code Pages that have CS 697 (190 characters) as the maximal character set. See also full character set and maximal character set.
Syntactic character set (Ref #119.)
A set of 81 graphic characters registered in the IBM registry as character set 00640. This set was originally recommended to the programming language community to be used for syntactic purposes towards maximizing portability and interchangeability across systems and country boundaries. It is contained in most of the primary registered character sets, with a few exceptions. See also :1 id=glinvcs.invariant character set.
system (Ref #120.)
A collection of interconnected hardware and software organized to accomplish a set of specific functions.

T
tag (Ref #121.)
A mechanism used to identify certain attributes having some bearing on handling of character data. Some examples are character set identifier, code page identifier, language identifier, country identifier, and encoding scheme identifier.

U
UDC
See user-defined characters.
uppercase (Ref #122.)
Pertaining to the capital alphabetic characters (whether accented or not) as distinguished from the small alphabetic characters; for example A, B, or C in the Latin alphabet. The concept of case also applies to alphabets such as Cyrillic and Greek, but not to Arabic, Hebrew, Thai and others. See also lowercase.
user
See end-user.
user-defined characters (Ref #123.)
Characters that are defined by individual users or user organizations, for assignment in code pages. Some standards reserve coding space for user-defined characters. These characters have uniqueness only within the sphere of control of the individual user organization.

V
Va and Va+ (Ref #124.)
(To be read as Five-A and Five-A Plus.) Va refers to Table Va in an older version of C-S 3-3220-002. This table assigned different national use characters to 13 code points, to be used with the SBCS-EBCDIC invariant set, in typical data processing applications. Some countries have extended these coded character sets by adding more characters needed to properly support their national language alphabets. Such extended sets are called Va+.

W
ward (Ref #125.)
A section of a double-byte-coded-character set, where the first byte of all the code points belonging in that section are the same. A synonym for a DBCS section. Also known as a DBCS ward.
wysiwyg (Ref #126.)
What-you-see-is-what-you-get. A term used to refer to the characteristics of some text processing applications. Such applications permit the users to visualize and manipulate a replica of the printed document on an appropriate display device.

Back to top


We're here to help
Easy ways to get the answers you need.
E-mail IBM

Stay informed

Globalization News
Stay informed

Locate IBM globalized products

Product languages
Locate IBM globalized products