No author metadata extraction or mapping in Unix File System Crawler in IBM Content Analytics with Enterprise Search

Technote (troubleshooting)


Problem(Abstract)

Why is there no option to map author field in Unix File System Crawler in IBM Content Analytics with Enterprise Search?


Resolving the problem

For a UNIX File System crawler, there is no author information available from the file system other than the owner userid. Even with that, it is difficult to derive a proper ownership when there are many
owning userid's at the directory / file hierarchy, which are meant for access permission rather than true authorship of a particular document. Thus, it is not very useful to fetch them as native metadata.

For example, it is a common practice that an administrative userid be set up to own the directory where the documents are stored as well as the owning userid for these documents. This is designed to make access control simple to manage. With any such userid, it has no bearing on who actually authored each individual document.

Rate this page:

(0 users)Average rating

Document information


More support for:

Watson Content Analytics

Software version:

3.0

Operating system(s):

AIX, Linux, Windows

Reference #:

1637991

Modified date:

2013-05-20

Translate my page

Machine Translation

Content navigation