What are options for reducing the amount of disk space that is being used for collections?

Technote (FAQ)


Question

How can I reduce the amount of disk space that is used by a search collection without, of course, reducing the number of documents being crawled?

Answer

Here are a few of the big ones

  • compression
  • disable cached content types (this means no html previw)
  • don't store the text in the index (this means no dynamic summaries - but everything is still indexed)
  • leverage the light crawler - or at least a subset of the settings (this will reduce the size of the crawler database but has some tradeoffs you need to be aware of)

A full merge will remove deleted documents.

A new crawl will reduce the size of a crawl log that was bloated by refreshes.

Misconfiguring distributed indexing can cause updates to be saved for later (written to disk) if they can't be sent to a client.

Historical Number

1793

Rate this page:

(0 users)Average rating

Document information


More support for:

Watson Explorer
Best Practices

Software version:

5.0, 6.0, 6.1, 7.0, 7.5, 8.0, 8.1, 8.2.0

Operating system(s):

Linux, Solaris, Windows, Windows 2003 server, Windows 2008 server, Windows Vista

Reference #:

1620326

Modified date:

2013-03-26

Translate my page

Machine Translation

Content navigation