What are options for reducing the amount of disk space that is being used for collections?

Technote (FAQ)


How can I reduce the amount of disk space that is used by a search collection without, of course, reducing the number of documents being crawled?


Here are a few of the big ones

  • compression
  • disable cached content types (this means no html previw)
  • don't store the text in the index (this means no dynamic summaries - but everything is still indexed)
  • leverage the light crawler - or at least a subset of the settings (this will reduce the size of the crawler database but has some tradeoffs you need to be aware of)

A full merge will remove deleted documents.

A new crawl will reduce the size of a crawl log that was bloated by refreshes.

Misconfiguring distributed indexing can cause updates to be saved for later (written to disk) if they can't be sent to a client.

Historical Number


Document information

More support for:

Watson Explorer
Best Practices

Software version:

5.0, 6.0, 6.1, 7.0, 7.5, 8.0, 8.1, 8.2.0

Operating system(s):

Linux, Solaris, Windows

Reference #:


Modified date:


Translate my page

Content navigation