IBM Support

IBM Datacap 9.0 Language Support

Product documentation


Abstract

This document provides details about the languages that are supported by the different IBM Datacap Version 9.0 components.

Content

The following tables show the languages that are supported in the corresponding Datacap 9.0 component.

Notes

  • OCR-S/OCR-SR: Nuance engine
  • OCR-A: ABBYY engine
  • OCR-N: NovoDynamics engine
  • ICR-C: RecoStar engine
  • Legal Dict.: OCR-S Legal Dictionary
  • Financial Dict.: OCR-S Financial Dictionary
  • Medical Dict.: OCR-S Medical Dictionary
  • ICR-P: Parascript engine
  • Admin/Install doc.: Administration/installation documentation

Languages:

Afrikaans through Czech

Important:

Support for Arabic requires that customers license NovoDynamics NovoVarus separately and install it on the Rulerunner machine where the Datacap Studio actions for Arabic (Datacap.Libraries.NovoDynamics) will be running.

For Chinese (traditional) OCR-S/OCR-SR support, HKSCS extensions are not supported.

For Chinese (simplified) and Chinese (traditional), OCR-A is recommended instead of OCR-S/OCR-SR, because OCR-S confidence calculation might return high confidence for replaced characters.

Table 1

Language Data Entry Datacap Desktop FastDoc Datacap Web Datacap Navigator OCR-N OCR-S OCR-SR Legal Dict. Financial Dict. Medical Dict.
Afrikaans Supported Supported
Albanian Supported Supported
Arabic Supported Supported Supported Supported</td><td width=
Bosnian (Latin) Supported
Catalan Supported Supported
Chinese (simplified) Supported Supported Supported Supported Supported Supported
Chinese (traditional) Supported Supported
Croatian Supported Supported Supported Supported Supported Supported
Czech Supported Supported Supported Supported Supported Supported

Table 1 continued

Language OCR-A ICR-C ICR-P IBM Content Classification Admin/Install doc. Online Help
Afrikaans Supported Supported
Albanian Supported Supported
Arabic
Bosnian (Latin) Supported
Catalan Supported Supported
Chinese (simplified) Supported Supported
Chinese (traditional) Supported
Croatian Supported Supported
Czech Supported Supported

Back to top

Table 2 Danish through Estonian

Language Data Entry Datacap Desktop FastDoc Datacap Web Datacap Navigator OCR-S OCR-SR Legal Dict. Financial Dict. Medical Dict.
Danish Supported Supported
Dutch Supported Supported Supported Supported Supported Supported Supported Supported
Dutch Belgian Supported
English Supported Supported Supported Supported Supported Supported Supported Supported Supported
Esperanto Supported Supported
Estonian Supported Supported

Table 2 Danish through Estonian continued

Language OCR-A ICR-C ICR-P IBM Content Classification Admin/Install doc. Online Help
Danish Supported Supported
Dutch Supported Supported Supported
Dutch Belgian Supported
English Supported Supported Supported Supported Supported Supported
Esperanto Supported
Estonian Supported Supported

Back to top

Table 3 Faroese through Greek

Language Data Entry Datacap Desktop FastDoc Datacap Web Datacap Navigator OCR-S OCR-SR Legal Dict. Financial Dict. Medical Dict.
Faroese Supported Supported
Finnish Supported Supported
French Supported Supported Supported Supported Supported Supported Supported Supported
Gaelic Irish Supported Supported
Gaelic Scottish Supported Supported
German Supported Supported Supported Supported Supported Supported Supported Supported
Greek Supported Supported Supported Supported Supported Supported

Table 3 Faroese through Greek continued

Language OCR-A ICR-C ICR-P IBM Content Classification Admin/Install doc. Online Help
Faroese Supported Supported
Finnish Supported Supported
French Supported Supported Supported
Gaelic Irish Supported Supported
Gaelic Scottish Supported
German Supported Supported Supported
Greek Supported Supported

Back to top

Table 4 Hebrew through Norwegian

For Japanese, OCR-A is recommended instead of OCR-S/OCR-SR, because OCR-S confidence calculation might return high confidence for replaced characters.

Language Data Entry Datacap Desktop FastDoc Datacap Web Datacap Navigator OCR-S OCR-SR Legal Dict. Financial Dict. Medical Dict.
Hebrew Supported Supported Supported
Hungarian Supported Supported Supported Supported Supported Supported
Icelandic Supported Supported
Italian Supported Supported Supported Supported Supported Supported
Japanese Supported Supported Supported Supported Supported Supported
Latvian Supported Supported
Lithuanian Supported Supported
Maltese Supported Supported
Norwegian Supported Supported

Table 4 Hebrew through Norwegian continued

Language OCR-A ICR-C ICR-P IBM Content Classification Admin/Install doc. Online Help
Hebrew Supported
Hungarian Supported Supported
Icelandic Supported Supported
Italian Supported Supported Supported
Japanese Supported Supported
Latvian Supported Supported
Lithuanian Supported
Maltese Supported
Norwegian Supported Supported Supported

Back to top

Table 5 Polish through Sami Southern

Language Data Entry Datacap Desktop FastDoc Datacap Web Datacap Navigator OCR-S OCR-SR Legal Dict. Financial Dict. Medical Dict.
Polish Supported Supported Supported Supported Supported Supported
Portuguese (Brazil) Supported Supported Supported Supported Supported Supported
Portuguese (Portugal) Supported Supported
Rhaeto-Romanic Supported Supported
Romanian Supported Supported Supported Supported Supported Supported
Russian Supported Supported Supported Supported Supported Supported
Sami Supported Supported
Sami Northern Supported Supported
Sami Southern Supported Supported

Table 5 Polish through Sami Southern continued

Language OCR-A ICR-C ICR-P IBM Content Classification Admin/Install doc. Online Help
Polish Supported Supported
Portuguese (Brazil) Supported Supported Supported
Portuguese (Portugal) Supported Supported Supported
Rhaeto-Romanic Supported Supported
Romanian Supported Supported
Russian Supported Supported Supported
Sami
Sami Northern
Sami Southern

Back to top

Table 6 Serbian through Turkish

Language Data Entry Datacap Desktop FastDoc Datacap Web Datacap Navigator OCR-S OCR-SR Legal Dict. Financial Dict. Medical Dict.
Serbian (Cyrillic)* Supported Supported
Serbian (Latin) Supported Supported
Slovak Supported Supported Supported Supported Supported Supported
Slovenian Supported Supported
Spanish Supported Supported Supported Supported Supported Supported
Swahili Supported Supported
Swedish Supported Supported Supported Supported Supported Supported
Turkish Supported Supported Supported Supported Supported Supported

Table 6 Serbian through Turkish continued

Language OCR-A ICR-C ICR-P IBM Content Classification Admin/Install doc. Online Help
Serbian (Cyrillic)*
Serbian (Latin) Supported
Slovak Supported Supported
Slovenian Supported
Spanish Supported Supported Supported
Swahili Supported Supported
Swedish Supported Supported Supported
Turkish Supported Supported

*Important: Datacap Version 9.0 does not expose a user interface to select the Serbian Cyrillic recognition option, but support for Serbian (Cyrillic) is invoked through the implementation of actions in Datacap Studio. See the technical document, Setting the OCR/S recognition language to Serbian (Cyrillic).

Back to top

Document information

More support for: Datacap

Software version: 9.0.0

Operating system(s): Windows

Reference #: 7044111

Modified date: 19 July 2016


Translate this page: