Changes

Jump to navigation Jump to search
909 bytes added ,  17:03, 20 September 2011
Added thesuarus query section
*<code>1:</code> Standard TF/IDF algorithm, which treats ''document nodes'' as document units.
*<code>2:</code> Each ''text node'' is treated as a document unit in the TF/IDF algorithm. This variant is an alternative for type <code>1</code>, if the database contains large, few XML files.
 
==Querying Using Thesaurus==
 
BaseX supports full-text queries using thesauri, but it does not provide a default thesaurus. This is why query such as
 
<pre class="brush:xquery">
'computers' contains text 'hardware'
using thesaurus default
</pre>
 
will return <code>false</code>. However, if the thesaurus is specified, then the result will be <code>true</code>
 
<pre class="brush:xquery">
'computers' contains text 'hardware'
using thesaurus at 'XQFTTS_1_0_4/TestSources/usability2.xml'
</pre>
 
The format of the thesaurus files must be the same as the format of the thesauri provided by the [http://dev.w3.org/2007/xpath-full-text-10-test-suite XQuery and XPath Full Text 1.0 Test Suite]. It is an XML with structure defined by an [http://dev.w3.org/cvsweb/~checkout~/2007/xpath-full-text-10-test-suite/TestSuiteStagingArea/TestSources/thesaurus.xsd?rev=1.3;content-type=application%2Fxml XSD Schema].
==Fuzzy Querying==
administrator, Bureaucrats, editor, reviewer, Administrators
98

edits

Navigation menu