Difference between revisions of "Options"

From BaseX Documentation
Jump to navigation Jump to search
Line 2: Line 2:
 
[on]/[off] or a value can be set:</p>  
 
[on]/[off] or a value can be set:</p>  
 
   
 
   
<h3>General options</h3>
+
===General options===
<ul>
 
<li><code>debug</code>: Show internal debug info, usually used for development of new features.</li>
 
<li><code>mainmem</code>: Use main-memory mode. Now all data is stored exclusively in main memory and enables even faster evaluation times, but the data is lost after shut down of BaseX.</li>
 
<li><code>timeout [TIME]</code>: Set server timeout in seconds. The timeout describes the maximum execution time of a query at the BaseX server. The timeout is deactivated if <code>TIME</code> is set to <code>0</code>.</li>
 
<li><code>serverport [PORT]</code>: Client/server communication: <code>PORT</code>, used for starting the server.</li>
 
<li><code>restport [PORT]</code>: Client/server communication: <code>PORT</code>, used for starting the REST server.</li>
 
<li><code>host [HOST]</code>: Client/server communication: <code>HOST</code>, used for connecting new clients.</li>
 
<li><code>port [PORT]</code>: Client/server communication: <code>PORT</code>, used for connecting new clients.</li>
 
</ul>
 
 
   
 
   
<h3>Database options</h3>
+
;debug
<ul>
+
: Show internal debug info, usually used for development of new features.
<li><code>dbpath [PATH]</code>: Set a new database <code>PATH</code>.</li>
 
<li><code>chop</code>: Chop all XML whitespace nodes and reduce the memory consumption of the database without lack of information.</li>
 
<li><code>intparse</code>: Use internal XML parser instead of standard Java XML parser and reduce main memory consumption.</li>
 
<li><code>entity</code>: Internal parser parses XML entities.</li>
 
<li><code>dtd</code>: Internal parser parses a DTD and checks the parsed XML data against the DTD.</li>
 
<li><code>pathindex</code>: Create index for paths and speedup path based queries.</li>
 
<li><code>textindex</code>: Create index for text nodes and speedup queries querying the entire text of a node.</li>
 
<li><code>attrindex</code>: Create index for attribute values and speedup queries querying attribute values.</li>
 
<li><code>ftindex</code>: Create index for fulltext and speedup queries querying full text.</li>
 
<li><code>wildcards</code>: Create wildcard optimized full text index that is especially tuned for querying full text using wildcards expressions.</li>
 
<li><code>stemming</code>: Stemm full text tokens before indexing and decrease index size. Should only be assigned for full text queries using stemming, otherwise the full text index could not be assigned for query speedups.</li>
 
<li><code>casesens</code>: Case sensitive full text indexing, i.e. full text tokens are stored case sensitive and enables the assignment of the full text index for case sensitive queries.</li>
 
<li><code>diacritics</code>: Diacritic sensitive full text indexing, i.e. full text tokens are stored with diacritics and enables the assignment of the full text index for diacritic queries.</li>
 
<li><code>language [LANG]</code>: Language for full-text search index, e.g. <code>LANG</code> can be set to <code>English</code> or <code>German</code>.</li>
 
<li><code>scoring [MODE]</code>: Precalculation of full text scores and storage within the full text index structure. Enables TF/IDF based scores in full text queries and increases memory consumption while indexing. <code>MODE</code> can be set to <code>1</code> for document based scoring, <code>2</code> for text node based scoring and <code>0</code> for non scoring.</li>
 
<li><code>stopwords [PATH]</code>: Specify stop word list at <code>PATH</code>, that could be used e.g. to decrease the full text index size. The standard stopword list (english) is provided at <code>'etc/xml/stopWords'</code>.</li>
 
<li><code>lserror [ERROR]</code>: Levenshtein default error used for fuzzy search with BaseX specialized own FTMatchOption, e.g. <code>ERROR</code> could be <code>2</code>.</li>
 
<li><code>createfilter [FILTER]</code>: Path for filtering xml documents, e.g. <code>FILTER</code> could be <code>'*.xml'</code>.</li>
 
<li><code>compress [MODE]</code>: Compress text in database. Activate with setting <code>MODE</code> to <code>1</code></li>
 
</ul>
 
 
   
 
   
<h3>Query options</h3>
+
;mainmem
<ul>
+
: Use main-memory mode. Now all data is stored exclusively in main memory and enables even faster evaluation times, but the data is lost after shut down of BaseX.
<li><code>queryinfo</code>: Show (all) process info.</li>
 
<li><code>serialize</code>: Flag for serialization of query results. I.e. there are any results printed if serialization is turned off.</li>
 
<li><code>wrapoutput</code>: Serialize query results as XML, i.e. the result output is a valid xml document.</li>
 
<li><code>serializer [ARGS]</code>: Serialization parameters <code>ARGS</code>, separated by commas.</li>
 
<li><code>exporter [ARGS]</code>: Exporter serialization parameters <code>ARGS</code>, separated by commas.</li>
 
<li><code>runs [NUM]</code>: Specify number of runs a query is executed. Results are printed a single time and evaluation times are averages of all runs.</li>
 
<li><code>tablemem</code>: Loading database table into main memory and speedup evaluation times.</li>
 
</ul>
 
 
   
 
   
<h3>Print query options</h3>
+
;timeout [TIME]</code>: Set server timeout in seconds. The timeout describes the maximum execution time of a query at the BaseX server. The timeout is deactivated if <code>TIME</code> is set to <code>0
<ul>
+
:.
<li><code>dotplan</code>: Create query plans visualized with [http://www.graphviz.org dotty], see <code>'plan.dot'</code> file in your project folder.</li>
 
<li><code>dotcompact</code>: Compact dot representation.</li>
 
<li><code>dotdisplay</code>: Display dot representation after query execution.</li>
 
<li><code>dotty [PATH]</code>: Program <code>PATH</code> of dotty executable.</li>
 
<li><code>xmlplan</code>: Print query plan as xml document.</li>
 
<li><code>compplan</code>: Create query plan before or after compilation - query plan might change due to optimizations.</li>
 
<li><code>cachequery</code>: Cache the query results.</li>
 
<li><code>maxtext [SIZE]</code>: Maximum text size to be displayed, e.g. <code>SIZE</code> could be <code>1024</code> for displaying maximal 1024 bytes.</li>
 
<li><code>maxstat [NUM]</code>: Set maximum number of index occurrences to print (through the command info index), e.g. <code>NUM</code> could be <code>15</code> for displaying fifteen index occurrences.</li>
 
<li><code>writeback</code>: Write original files back after updates.</li>
 
</ul>
 
 
   
 
   
<h3>DeepFS options</h3>  
+
;serverport [PORT]</code>: Client/server communication: <code>PORT
<ul>  
+
:, used for starting the server.
<li><code>fsmeta</code>: Import file metadata.</li>  
+
<li><code>fscont</code>: Import file content.</li>  
+
;restport [PORT]</code>: Client/server communication: <code>PORT
<li><code>fsxml</code>: Import xml contents.</li>  
+
:, used for starting the REST server.
<li><code>fstextmax [SIZE]</code>: Maximal size of textual imports, e.g. <code>SIZE</code> could be <code>1024</code> for importing less equal 1024 bytes.</li>  
+
<li><code>fsverbose</code>: Verbose debug informations for file system traversal.</li>  
+
;host [HOST]</code>: Client/server communication: <code>HOST
<li><code>fuse</code>: Fuse support.</li>  
+
:, used for connecting new clients.
<li><code>spotlight</code>: Spotlight integration on mac platforms, i.e. the spotlight index is used instead of the internal parser implementations.</li>
+
</ul>
+
;port [PORT]</code>: Client/server communication: <code>PORT
 +
:, used for connecting new clients.
 +
 +
 +
 +
===Database options===
 +
 +
;dbpath [PATH]</code>: Set a new database <code>PATH
 +
:.
 +
 +
;chop
 +
: Chop all XML whitespace nodes and reduce the memory consumption of the database without lack of information.
 +
 +
;intparse
 +
: Use internal XML parser instead of standard Java XML parser and reduce main memory consumption.
 +
 +
;entity
 +
: Internal parser parses XML entities.
 +
 +
;dtd
 +
: Internal parser parses a DTD and checks the parsed XML data against the DTD.
 +
 +
;pathindex
 +
: Create index for paths and speedup path based queries.
 +
 +
;textindex
 +
: Create index for text nodes and speedup queries querying the entire text of a node.
 +
 +
;attrindex
 +
: Create index for attribute values and speedup queries querying attribute values.
 +
 +
;ftindex
 +
: Create index for fulltext and speedup queries querying full text.
 +
 +
;wildcards
 +
: Create wildcard optimized full text index that is especially tuned for querying full text using wildcards expressions.
 +
 +
;stemming
 +
: Stemm full text tokens before indexing and decrease index size. Should only be assigned for full text queries using stemming, otherwise the full text index could not be assigned for query speedups.
 +
 +
;casesens
 +
: Case sensitive full text indexing, i.e. full text tokens are stored case sensitive and enables the assignment of the full text index for case sensitive queries.
 +
 +
;diacritics
 +
: Diacritic sensitive full text indexing, i.e. full text tokens are stored with diacritics and enables the assignment of the full text index for diacritic queries.
 +
 +
;language [LANG]</code>: Language for full-text search index, e.g. <code>LANG</code> can be set to <code>English</code> or <code>German
 +
:.
 +
 +
;scoring [MODE]</code>: Precalculation of full text scores and storage within the full text index structure. Enables TF/IDF based scores in full text queries and increases memory consumption while indexing. <code>MODE</code> can be set to <code>1</code> for document based scoring, <code>2</code> for text node based scoring and <code>0
 +
: for non scoring.
 +
 +
;stopwords [PATH]</code>: Specify stop word list at <code>PATH</code>, that could be used e.g. to decrease the full text index size. The standard stopword list (english) is provided at <code>'etc/xml/stopWords'
 +
:.
 +
 +
;lserror [ERROR]</code>: Levenshtein default error used for fuzzy search with BaseX specialized own FTMatchOption, e.g. <code>ERROR</code> could be <code>2
 +
:.
 +
 +
;createfilter [FILTER]</code>: Path for filtering xml documents, e.g. <code>FILTER</code> could be <code>'*.xml'
 +
:.
 +
 +
;compress [MODE]</code>: Compress text in database. Activate with setting <code>MODE
 +
: to <code>1</code>
 +
 +
 +
 +
===Query options===
 +
 +
;queryinfo
 +
: Show (all) process info.
 +
 +
;serialize
 +
: Flag for serialization of query results. I.e. there are any results printed if serialization is turned off.
 +
 +
;wrapoutput
 +
: Serialize query results as XML, i.e. the result output is a valid xml document.
 +
 +
;serializer [ARGS]</code>: Serialization parameters <code>ARGS
 +
:, separated by commas.
 +
 +
;exporter [ARGS]</code>: Exporter serialization parameters <code>ARGS
 +
:, separated by commas.
 +
 +
;runs [NUM]
 +
: Specify number of runs a query is executed. Results are printed a single time and evaluation times are averages of all runs.
 +
 +
;tablemem
 +
: Loading database table into main memory and speedup evaluation times.
 +
 +
 +
 +
===Print query options===
 +
 +
;dotplan</code>: Create query plans visualized with [http://www.graphviz.org dotty], see <code>'plan.dot'
 +
: file in your project folder.
 +
 +
;dotcompact
 +
: Compact dot representation.
 +
 +
;dotdisplay
 +
: Display dot representation after query execution.
 +
 +
;dotty [PATH]</code>: Program <code>PATH
 +
: of dotty executable.
 +
 +
;xmlplan
 +
: Print query plan as xml document.
 +
 +
;compplan
 +
: Create query plan before or after compilation - query plan might change due to optimizations.
 +
 +
;cachequery
 +
: Cache the query results.
 +
 +
;maxtext [SIZE]</code>: Maximum text size to be displayed, e.g. <code>SIZE</code> could be <code>1024
 +
: for displaying maximal 1024 bytes.
 +
 +
;maxstat [NUM]</code>: Set maximum number of index occurrences to print (through the command info index), e.g. <code>NUM</code> could be <code>15
 +
: for displaying fifteen index occurrences.
 +
 +
;writeback
 +
: Write original files back after updates.
 +
 +
 +
 +
===DeepFS options===
 +
 +
;fsmeta
 +
: Import file metadata.
 +
 +
;fscont
 +
: Import file content.
 +
 +
;fsxml
 +
: Import xml contents.
 +
 +
;fstextmax [SIZE]</code>: Maximal size of textual imports, e.g. <code>SIZE</code> could be <code>1024
 +
: for importing less equal 1024 bytes.
 +
 +
;fsverbose
 +
: Verbose debug informations for file system traversal.
 +
 +
;fuse
 +
: Fuse support.
 +
 +
;spotlight
 +
: Spotlight integration on mac platforms, i.e. the spotlight index is used instead of the internal parser implementations.

Revision as of 16:35, 9 December 2010

The following options are available and can be changed using the Set command. Either they can be turned [on]/[off] or a value can be set:

General options

debug
Show internal debug info, usually used for development of new features.
mainmem
Use main-memory mode. Now all data is stored exclusively in main memory and enables even faster evaluation times, but the data is lost after shut down of BaseX.
timeout [TIME]
Set server timeout in seconds. The timeout describes the maximum execution time of a query at the BaseX server. The timeout is deactivated if TIME is set to 0
.
serverport [PORT]
Client/server communication: PORT
, used for starting the server.
restport [PORT]
Client/server communication: PORT
, used for starting the REST server.
host [HOST]
Client/server communication: HOST
, used for connecting new clients.
port [PORT]
Client/server communication: PORT
, used for connecting new clients.


Database options

dbpath [PATH]
Set a new database PATH
.
chop
Chop all XML whitespace nodes and reduce the memory consumption of the database without lack of information.
intparse
Use internal XML parser instead of standard Java XML parser and reduce main memory consumption.
entity
Internal parser parses XML entities.
dtd
Internal parser parses a DTD and checks the parsed XML data against the DTD.
pathindex
Create index for paths and speedup path based queries.
textindex
Create index for text nodes and speedup queries querying the entire text of a node.
attrindex
Create index for attribute values and speedup queries querying attribute values.
ftindex
Create index for fulltext and speedup queries querying full text.
wildcards
Create wildcard optimized full text index that is especially tuned for querying full text using wildcards expressions.
stemming
Stemm full text tokens before indexing and decrease index size. Should only be assigned for full text queries using stemming, otherwise the full text index could not be assigned for query speedups.
casesens
Case sensitive full text indexing, i.e. full text tokens are stored case sensitive and enables the assignment of the full text index for case sensitive queries.
diacritics
Diacritic sensitive full text indexing, i.e. full text tokens are stored with diacritics and enables the assignment of the full text index for diacritic queries.
language [LANG]
Language for full-text search index, e.g. LANG can be set to English or German
.
scoring [MODE]
Precalculation of full text scores and storage within the full text index structure. Enables TF/IDF based scores in full text queries and increases memory consumption while indexing. MODE can be set to 1 for document based scoring, 2 for text node based scoring and 0
for non scoring.
stopwords [PATH]
Specify stop word list at PATH, that could be used e.g. to decrease the full text index size. The standard stopword list (english) is provided at 'etc/xml/stopWords'
.
lserror [ERROR]
Levenshtein default error used for fuzzy search with BaseX specialized own FTMatchOption, e.g. ERROR could be 2
.
createfilter [FILTER]
Path for filtering xml documents, e.g. FILTER could be '*.xml'
.
compress [MODE]
Compress text in database. Activate with setting MODE
to 1


Query options

queryinfo
Show (all) process info.
serialize
Flag for serialization of query results. I.e. there are any results printed if serialization is turned off.
wrapoutput
Serialize query results as XML, i.e. the result output is a valid xml document.
serializer [ARGS]
Serialization parameters ARGS
, separated by commas.
exporter [ARGS]
Exporter serialization parameters ARGS
, separated by commas.
runs [NUM]
Specify number of runs a query is executed. Results are printed a single time and evaluation times are averages of all runs.
tablemem
Loading database table into main memory and speedup evaluation times.


Print query options

dotplan
Create query plans visualized with dotty, see 'plan.dot'
file in your project folder.
dotcompact
Compact dot representation.
dotdisplay
Display dot representation after query execution.
dotty [PATH]
Program PATH
of dotty executable.
xmlplan
Print query plan as xml document.
compplan
Create query plan before or after compilation - query plan might change due to optimizations.
cachequery
Cache the query results.
maxtext [SIZE]
Maximum text size to be displayed, e.g. SIZE could be 1024
for displaying maximal 1024 bytes.
maxstat [NUM]
Set maximum number of index occurrences to print (through the command info index), e.g. NUM could be 15
for displaying fifteen index occurrences.
writeback
Write original files back after updates.


DeepFS options

fsmeta
Import file metadata.
fscont
Import file content.
fsxml
Import xml contents.
fstextmax [SIZE]
Maximal size of textual imports, e.g. SIZE could be 1024
for importing less equal 1024 bytes.
fsverbose
Verbose debug informations for file system traversal.
fuse
Fuse support.
spotlight
Spotlight integration on mac platforms, i.e. the spotlight index is used instead of the internal parser implementations.