Planning for installation

Before you run the IBM® Content Classification installation program, ensure that your system meets the requirements. You might also want to gather information about your system that the installation program needs.

For performance and scalability, you can run the installation program on multiple computers. When you run the installation program, you can install all of the Content Classification components or select the components that you want to install.

The Content Classification components include:
IBM Content Classification Server
The server component provides classification services for client applications. To administer the system, you use the Management Console to:
  • Add knowledge bases and decision plans to an IBM Content Classification system
  • Modify knowledge base and decision plan properties, such as how to collect feedback and whether backup versions should be automatically created
  • Start and stop knowledge base and decision plan instances
  • Add and modify field definitions
  • View the computers on which the system is running
IBM Content Classification Client
The client component is the main interface for the IBM Content Classification server. To help you develop client applications, libraries and samples are provided in the following programming languages:
  • C
  • Java™
  • COM
  • .NET
When you run the installation program, you can choose to install the client component with the server component or install only the client component. You can install the client component on any of the supported operating systems.
Classification Workbench
You use Classification Workbench to create and analyze knowledge bases and decision plans. You can also run reports and graphical diagnostics to evaluate how well a knowledge base or decision plan is performing. You can import improved knowledge bases and decision plans to the IBM Content Classification server to improve the accuracy of classification results.
Classification Workbench is a Windows application. When you run the installation program, you can choose to install only Classification Workbench. You must install the application on Windows.
The Taxonomy Proposer application is a tool that you can use to discover new categories in an uncategorized or partially categorized body of documents. This tool is automatically installed when you install Classification Workbench. If you remove Classification Workbench from a computer, the Taxonomy Proposer is also removed.
Administration and data server
IBM Content Classification system data, such as knowledge bases, decision plans, configuration data, and status data, is stored in the file system. The data server is the component that is responsible for storing and retrieving this data.
The system uses only one data server regardless of how many instances of the IBM Content Classification server component are installed. In a multiple server configuration, each server connects to the same data server.
For performance and scalability, you can run the installation program multiple times to set up multiple IBM Content Classification servers and multiple client installations. Typically, you install the administration and data server on the first IBM Content Classification server that you install. When you install additional clients or servers, you specify information that enables those computers to connect to the computer where the administration and data server is installed.
Listener
The listener is the server-side component that acts as the entry point to IBM Content Classification. Client requests are received by the listener and dispatched to the appropriate server component for processing.
When you install an IBM Content Classification server component, you specify whether you want to install the listener component. The default option is to install the listener on the same computer with the server component.
Classification Center
You can use this web application to configure, run, and monitor classification of content that is stored in IBM Content Manager, IBM FileNet® Content Manager, and file systems. You can also use this application to review documents and, if necessary, reclassify them by specifying different knowledge base categories or decision plan actions. Included with the Classification Center is the Content Extractor, which is a command-line tool that you can use to extract the content that you want to classify from an IBM FileNet Content Manager object store or IBM Content Manager repository. You can import the extracted content into the Classification Workbench and use it to train a knowledge base or test the rules in a decision plan.
Start of change Classification Quick Start Tool End of change
Start of change You can use the Classification Quick Start Tool to evaluate IBM Content Classification and create an initial classification project. You can easily import documents from the file system and organize them into categories. Tune classification by completing tasks that are suggested by the Classification Quick Start Tool to improve classification performance. Then you can view the results, adjust the ratio between automation and accuracy, and export the project to Classification Workbench for further development. End of change