Before you run the IBM® Content
Classification installation
program, ensure that your system meets the requirements. You might
also want to gather information about your system that the installation
program needs.
For performance and scalability, you can run the installation program
on multiple computers. When you run the installation program, you
can install all of the Content Classification components
or select the components that you want to install.
The
Content Classification components
include:
- IBM Content
Classification Server
- The server component provides classification services for client
applications. To administer the system, you use the Management Console to:
- Add knowledge bases and decision plans to an IBM Content
Classification system
- Modify knowledge base and decision plan properties, such as how
to collect feedback and whether backup versions should be automatically
created
- Start and stop knowledge base and decision plan instances
- Add and modify field definitions
- View the computers on which the system is running
- IBM Content
Classification Client
- The client component is the main interface for the IBM Content
Classification server. To help you
develop client applications, libraries and samples are provided in
the following programming languages:
- When you run the installation program, you can choose to install
the client component with the server component or install only the
client component. You can install the client component on any of the
supported operating systems.
- Classification Workbench
- You use Classification Workbench to
create and analyze knowledge bases and decision plans. You can also
run reports and graphical diagnostics to evaluate how well a knowledge
base or decision plan is performing. You can import improved knowledge
bases and decision plans to the IBM Content
Classification server to improve the
accuracy of classification results.
- Classification Workbench is a Windows application. When you run the installation
program, you can choose to install only Classification Workbench. You must install the
application on Windows.
- The Taxonomy Proposer application
is a tool that you can use to discover new categories in an uncategorized
or partially categorized body of documents. This tool is automatically
installed when you install Classification Workbench.
If you remove Classification Workbench from
a computer, the Taxonomy Proposer is
also removed.
- Administration and data server
- IBM Content
Classification system
data, such as knowledge bases, decision plans, configuration data,
and status data, is stored in the file system. The data server is
the component that is responsible for storing and retrieving this
data.
- The system uses only one data server regardless of how many instances
of the IBM Content
Classification server
component are installed. In a multiple server configuration, each
server connects to the same data server.
- For performance and scalability, you can run the installation
program multiple times to set up multiple IBM Content
Classification servers and multiple
client installations. Typically, you install the administration and
data server on the first IBM Content
Classification server
that you install. When you install additional clients or servers,
you specify information that enables those computers to connect to
the computer where the administration and data server is installed.
- Listener
- The listener is the server-side component that acts as the entry
point to IBM Content
Classification. Client
requests are received by the listener and dispatched to the appropriate
server component for processing.
- When you install an IBM Content
Classification server
component, you specify whether you want to install the listener component.
The default option is to install the listener on the same computer
with the server component.
- Classification Center
- You can use this web application to configure, run, and monitor
classification of content that is stored in IBM Content
Manager, IBM FileNet® Content Manager, and file systems. You
can also use this application to review documents and, if necessary,
reclassify them by specifying different knowledge base categories
or decision plan actions. Included with the Classification Center is the Content Extractor, which is a command-line
tool that you can use to extract the content that you want to classify
from an IBM FileNet Content Manager object
store or IBM Content
Manager repository.
You can import the extracted content into the Classification Workbench and use it to train a
knowledge base or test the rules in a decision plan.
- Classification Quick Start Tool
- You can use the Classification Quick Start Tool to
evaluate IBM Content
Classification and
create an initial classification project. You can easily import documents
from the file system and organize them into categories. Tune classification
by completing tasks that are suggested by the Classification Quick Start Tool to improve classification
performance. Then you can view the results, adjust the ratio between
automation and accuracy, and export the project to Classification Workbench for further development.