Fixes are available
APAR status
Closed as program error.
Error description
Reorg and Cleanup Deamon cause Search Collection Corruption We know that two processes cause index corruption. Cleanup daemon that runs by default at night. Or 2 hours before midnight? So indexing will interfere with cleanup daemon. Cleanup daemon marks expired documents and broken links document as deleted. Another process is reorg() that actually happens when you have more than 1000 deleted documents, it deletes all these documents from the index. When both run simultaneously despite wheather a crawl is running will corrupt the collection.
Local fix
We changed code logic. Since we know that reorg and cleanup cause corruption when they run together. We avoid such scenarios by running reorg after cleanup process is over. And reorg will run only if number of deleted documents is greater than 10% of the indexed documents, to reduce load on the index, reorg() is pretty heavy procedure. If we run it on more than 20% of indexed documents, it could cause performance issues. So we have made those changes in the new webscanner.jar
Problem summary
Collection is corrupted because of reorg process failure
Problem conclusion
A fix is available from Fix Central: http://www.ibm.com/eserver/support/fixes/fixcentral/swg/quickord er?apar=PK96690&productid=WebSphere%20Portal&brandid=5 You might need to type or paste the complete address into your Web browser.
Temporary fix
Comments
APAR Information
APAR number
PK96690
Reported component name
WEBSPHERE PORTA
Reported component ID
5724E7600
Reported release
60H
Status
CLOSED PER
PE
NoPE
HIPER
NoHIPER
Special Attention
NoSpecatt
Submitted date
2009-09-21
Closed date
2009-12-03
Last modified date
2010-06-07
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
Fix information
Fixed component name
WEBSPHERE PORTA
Fixed component ID
5724E7600
Applicable component levels
R601 PSY
UP
[{"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSHRKX","label":"WebSphere Portal"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"6.0.1.4","Edition":"","Line of Business":{"code":"","label":""}}]
Document Information
Modified date:
07 June 2010