*<code>1:</code> Standard TF/IDF algorithm, which treats ''document nodes'' as document units.
*<code>2:</code> Each ''text node'' is treated as a document unit in the TF/IDF algorithm. This variant is an alternative for type <code>1</code>, if the database contains large, few XML files.
==Querying Using Thesaurus==
BaseX supports full-text queries using thesauri, but it does not provide a default thesaurus. This is why query such as
<pre class="brush:xquery">
'computers' contains text 'hardware'
using thesaurus default
</pre>
will return <code>false</code>. However, if the thesaurus is specified, then the result will be <code>true</code>
<pre class="brush:xquery">
'computers' contains text 'hardware'
using thesaurus at 'XQFTTS_1_0_4/TestSources/usability2.xml'
</pre>
The format of the thesaurus files must be the same as the format of the thesauri provided by the [http://dev.w3.org/2007/xpath-full-text-10-test-suite XQuery and XPath Full Text 1.0 Test Suite]. It is an XML with structure defined by an [http://dev.w3.org/cvsweb/~checkout~/2007/xpath-full-text-10-test-suite/TestSuiteStagingArea/TestSources/thesaurus.xsd?rev=1.3;content-type=application%2Fxml XSD Schema].
==Fuzzy Querying==