No author metadata extraction or mapping in Unix File System Crawler in IBM Content Analytics with Enterprise Search
Why is there no option to map author field in Unix File System Crawler in IBM Content Analytics with Enterprise Search?
Resolving the problem
For a UNIX File System crawler, there is no author information available from the file system other than the owner userid. Even with that, it is difficult to derive a proper ownership when there are many
owning userid's at the directory / file hierarchy, which are meant for access permission rather than true authorship of a particular document. Thus, it is not very useful to fetch them as native metadata.
For example, it is a common practice that an administrative userid be set up to own the directory where the documents are stored as well as the owning userid for these documents. This is designed to make access control simple to manage. With any such userid, it has no bearing on who actually authored each individual document.