Watson Content Analytics provides several sets of application programming interfaces (APIs) so that you can create search and administration applications, modify crawled documents, filter search results, export documents, set up an identity management component to enforce document-level security, and perform ad-hoc text analysis on documents. .
For information about how to use the Watson Content Analytics APIs, see the examples in the ES_INSTALL_ROOT/samples directory.
Use the REST APIs to create search, content mining, and administration applications. The search REST API is available on Watson Content Analytics search servers and is deployed on the search application port, which by default is port 8393 if you use the embedded web application server. If you use WebSphere Application Server, the default port is 9081 or 80 if IBM HTTP Server is configured. The administrative REST API is available on the master server if you use the embedded web application server and uses the same port number as the administrative console, which by default is 8390. If you use WebSphere Application Server, the administrative REST API is available on the search application port, which by default is 9081 or 80 if IBM HTTP Server is configured. You can change these port numbers when you install Watson Content Analytics.
For more information about using the REST APIs, see the API documentation in the ES_INSTALL_ROOT/docs/api/rest directory. Sample scenarios that demonstrate how to perform administrative and search tasks are available in the ES_INSTALL_ROOT/samples/rest directory.
You can use the search and index application programming interfaces to create custom enterprise search applications. The Watson Content Analytics implementation of the search and index API (SIAPI) allows the search server to be accessed remotely.
Access to sensitive information that is contained in multiple repositories is typically controlled and enforced by the managing software. You identify yourself to the host system with a user ID and password. After the system authenticates your user ID and password, the managing software controls which documents you are allowed to see based on your access rights. Unless a single sign-on policy is implemented, you must have several different user IDs and passwords for each repository.
Watson Content Analytics provides an identity management component that enables users to search multiple repositories with a single query and see only the documents that they are allowed to see. You can build this component into your applications so that users can sign on with only one user ID and password when searching secure collections.
See the Javadoc documentation for details about the APIs that can be used to create your own identity management component or customize the provided solution.