Changes

2,455 bytes added , 12:07, 6 December 2010

Created page with "==Existing Indexes== Indexes can speedup queries by magnitudes. Currently, four indexes exist: <ul> <li> Text Index: This index speeds up text comparisons in p..."

==Existing Indexes==

Indexes can speedup queries by magnitudes.
Currently, four indexes exist:
<ul>
<li> Text Index: This index speeds up text comparisons in predicates.</li>
<li> Attribute Index: This index speeds up attribute value comparisons in predicates.</li>
<li> Full-Text Index: Full-text queries are sped up by this index.</li>
<li> Path Summary: This index speeds up the resolution of location paths.</li>
</ul>

==Examples of using the indexes==

Here are some examples for queries which are rewritten for index access:

==Text-Based Queries:==
<ul>
<li><code>//node()[text() = 'Usability']</code></li>
<li><code>//div[p = 'Usability' or p = 'Testing']</code></li>
<li><code>path/to/relevant[text() = 'Usability Testing']/and/so/on</code></li>
</ul>
==Attribute Index:==
<ul>
<li><code>//node()[@align = 'right']</code></li>
<li><code>descendant::elem[@id = '1']</code></li>
<li><code>range/query[@id >= 1 and @id <= 5]</code></li>
</ul>
==Full-Text Index:==
<ul>
<li><code>//node[text() contains text 'Usability']</code></li>
<li><code>//node[text() contains text 'Usebiliti' using fuzzy]</code></li>
<li><code>//book[chapter contains text ('web' ftor 'WWW' using no stemming)
ftand 'diversity' using stemming distance at most 5 words]</code></li>
</ul>
The full-text index is optimized to support all features of the XQuery Full Text
Recommendation.
BaseX extends the specification by offering a fuzzy match option.
Fuzzy search is based on the Levenshtein algorithm; the longer
query terms are, the more errors will be tolerated.
Default "Case Sensitivity", "Stemming" and "Diacritics" options
will be considered in the index creation. Consequently, all queries
will be sped up which use the default index options.

==Index data structures==

<ul>
<li>Text/Attribute Index 
Both the text and attribute index are based on a balanced B-Tree
and support exact matches and range queries.</li>
<li>Full-Text Index (Standard) 
The standard full-text index is implemented as sorted array
structure. It is optimized for simple and fuzzy searches.</li>
<li>Full-Text Index (Wildcards enabled) 
A second full-text index is implemented as a compressed trie.
Its needs slightly more memory than the standard full-text index,
but it supports more features, such as full wildcard search.
</li>
</ul>

Anonymous user

134.34.226.134

Changes

Indexes (edit)

Revision as of 12:07, 6 December 2010

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools