To remove frequently occurring terms from queries, such as enterprise-specific vocabulary, you must specify which words qualify as stop words in an XML file.
The XML file that lists the stop words must comply with a specific schema specified in the XML document. This is an example of an XML file for stop words:
<?xml version="1.0" encoding="UTF-8"?>
<stopWords xmlns="http://www.ibm.com/of/83/stopwordbuilder/xml">
<stopWord>WebSphere Application Server</stopWord>
<stopWord>WAS</stopWord>
<stopWord>...</stopWord>
</stopWords>
A stop word can include white-space characters, but it cannot include punctuation characters, such as a comma (,) or vertical bar (|), because these characters might interfere with the query syntax.
You do not need to enumerate normalizations of the term, such as the removal of accents or umlauts (normalization is handled automatically). For example, if you want to include the term météo as a stop word, you do not need to include the term meteo, too.
When you create the dictionary from your XML file, you can specify the lc parameter to control whether upper and lower case variants of the term are to be ignored or respected. For example, if you create a case-insensitive dictionary and include the term météo, you do not need to include the term METEO, too.
To create a list of stop words: