Changes

Full-Text (edit)

Revision as of 17:03, 20 September 2011

909 bytes added , 17:03, 20 September 2011

Added thesuarus query section

*<code>1:</code> Standard TF/IDF algorithm, which treats ''document nodes'' as document units.

*<code>2:</code> Each ''text node'' is treated as a document unit in the TF/IDF algorithm. This variant is an alternative for type <code>1</code>, if the database contains large, few XML files.

==Querying Using Thesaurus==

BaseX supports full-text queries using thesauri, but it does not provide a default thesaurus. This is why query such as

'computers' contains text 'hardware'

using thesaurus default

</pre>

will return <code>false</code>. However, if the thesaurus is specified, then the result will be <code>true</code>

'computers' contains text 'hardware'

using thesaurus at 'XQFTTS_1_0_4/TestSources/usability2.xml'

</pre>

The format of the thesaurus files must be the same as the format of the thesauri provided by the [http://dev.w3.org/2007/xpath-full-text-10-test-suite XQuery and XPath Full Text 1.0 Test Suite]. It is an XML with structure defined by an [http://dev.w3.org/cvsweb/~checkout~/2007/xpath-full-text-10-test-suite/TestSuiteStagingArea/TestSources/thesaurus.xsd?rev=1.3;content-type=application%2Fxml XSD Schema].

==Fuzzy Querying==

Dimitar

administrator, Bureaucrats, editor, reviewer, Administrators

98

edits

Changes

Full-Text (edit)

Revision as of 17:03, 20 September 2011

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools