Changes

Jump to navigation Jump to search
67 bytes removed ,  22:08, 25 May 2016
* If {{Code|stemming}} is activated, words are shortened to a base form by a language-specific stemmer:
<pre class="brush:xquery">
"catch" contains text "catches" using stemming,"Haus" contains text "Häuser" using stemming using language 'de'
</pre>
* With the {{Code|stop words}} option, a list of words can be defined that will be ignored when tokenizing a string. This is particularly helpful if the full-text index takes too much space (a standard stopword list for English texts is provided in the directory {{Code|etc/stopwords.txt}} in the full distributions of BaseX, and available online at http://files.basex.org/etc/stopwords.txt):
Bureaucrats, editor, reviewer, Administrators
13,550

edits

Navigation menu