* If {{Code|stemming}} is activated, words are shortened to a base form by a language-specific stemmer:
<pre class="brush:xquery">
"catch" contains text "catches" using stemming,"Haus" contains text "Häuser" using stemming using language 'de'
</pre>
* With the {{Code|stop words}} option, a list of words can be defined that will be ignored when tokenizing a string. This is particularly helpful if the full-text index takes too much space (a standard stopword list for English texts is provided in the directory {{Code|etc/stopwords.txt}} in the full distributions of BaseX, and available online at http://files.basex.org/etc/stopwords.txt):