Difference between revisions of "Options"

From BaseX Documentation
Jump to navigation Jump to search
m (Text replacement - "syntaxhighlight" to "pre")
 
(208 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
This page is linked from the [[Getting Started]] Section.
 
This page is linked from the [[Getting Started]] Section.
  
The options listed on this page influence the way how database [[Commands|commands]] are executed and XQuery expressions are evaluated. Options are divided into [[#Global Options|'''global options''']], which are valid for all BaseX instances, and '''local options''', which are specific to a client or session. Values of options are either ''strings'', ''numbers'' or ''booleans''.
+
The options listed on this page influence the way how database [[Commands|commands]] are executed and XQuery expressions are evaluated. Two kinds of options exist:
  
The {{Code|.basex}} [[Configuration#Configuration Files|configuration file]] is parsed by every new local BaseX instance. It contains all global options and, optionally, local options at the end of the file.
+
* '''[[#Global Options|Global Options]]''' are valid for all BaseX instances in the same JVM. This is particularly relevant if you are working with the client/server architecture.
 +
* '''Local options''' (all remaining ones) are specific to a client or session.
  
Various ways exist to access and change options:
+
Values of options are either ''strings'', ''numbers'' or ''booleans''. Options are ''static'' and not bound to a single operation (for example, the next command). Various ways exist to access and change options:
  
* The current value of an option can be requested with the [[Commands#GET|GET]] command. Local options can be changed via [[Commands#SET|SET]]. All values are ''static'': They stay valid until they are changed once again by another operation. If an option is of type ''boolean'', and if no value is specified, its current value will be inverted.
+
* The current value of an option can be requested with the {{Command|SHOW OPTIONS}} command. Local options can be changed via {{Command|SET}} (all global options, except for {{Option|DEBUG}}, can only be changed at startup time). If an option is of type ''boolean'', and if no value is specified, its current value is inverted.
  
* Initial values for global options can also be specified via system properties, which can e.g. be passed on with the [http://docs.oracle.com/javase/1.4.2/docs/tooldocs/windows/java.html#options -D flag] on command line, or using [http://docs.oracle.com/javase/6/docs/api/java/lang/System.html#setProperty(java.lang.String,%20java.lang.String) System.setProperty()] before creating a BaseX instance. The specified keys need to be prefixed with {{Code|org.basex.}}. An example:
+
* The {{Code|.basex}} [[Configuration#Configuration Files|configuration file]] is parsed by every new local BaseX instance. It contains all global options. Local options can be specified at the end of the file after the {{Code|Local Options}} comment:
  
<pre class="brush:bash">
+
<pre lang="perl">
java -Dorg.basex.CHOP=false -cp basex.jar org.basex.BaseX -c"get chop"
+
# General Options
CHOP: false
+
DEBUG = false
 +
...
 +
 
 +
# Local Options
 +
CATALOG = etc/w3-catalog.xml
 +
</pre>
 +
 
 +
* Initial values for global options can also be specified via system properties, which can be passed on with the [https://docs.oracle.com/en/java/javase/11/tools/java.html -D flag] on the command line, or with [https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/lang/System.html#setProperty(java.lang.String,java.lang.String) System.setProperty()] before creating a BaseX instance. The specified keys need to be prefixed with {{Code|org.basex.}}. An example:
 +
 
 +
<pre lang="perl">
 +
java -Dorg.basex.CATALOG=etc/w3-catalog.xml -cp basex.jar org.basex.BaseX -c"SHOW OPTIONS catalog"
 +
CATALOG: etc/w3-catalog.xml
 +
</pre>
 +
 
 +
* If the Mac OS X packaged application is used, global options can be set within the Info.plist file within the Contents folder of the application package. For example:
 +
 
 +
<pre lang="xml">
 +
<key>JVMOptions</key>
 +
<array>
 +
  <string>-Dorg.basex.CATALOG=etc/w3-catalog.xml</string>
 +
</array>
 
</pre>
 
</pre>
  
* If using the Mac OS X packaged application then global options can be set within the Info.plist file within the Contents folder of the application package. For example:
+
* In a [[Web Application]], the default can be adjusted in the {{Code|web.xml}} file as follows:
  
    <key>JVMOptions</key>
+
<pre lang="xml">
    <array>
+
<context-param>
        <string>-Dorg.basex.CHOP=false</string>
+
  <param-name>org.basex.catalog</param-name>
    </array>
+
  <param-value>etc/w3-catalog.xml</param-value>
 +
</context-param>
 +
</pre>
  
* In XQuery, local options can be set via option declarations and pragmas (see [[XQuery Extensions]]).
+
* In XQuery, local options can be set via option declarations and [[XQuery Extensions#Pragmas|pragmas]].
  
If options are implicitly changed by operations in the [[GUI]], the underlying commands will be listed in the [[GUI#Visualizations|Info View]].<br/><br/>
+
If options are changed by operations in the [[GUI]], the underlying commands are output in the [[GUI#Visualizations|Info View]].<br/>
  
 
=Global Options=
 
=Global Options=
Line 36: Line 59:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|DEBUG [boolean]}}
 
|{{Code|DEBUG [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Sends internal debug info to STDERR. This option can be turned on to get additional information for development and debugging purposes. It can also be triggered on [[Command-Line Options#BaseX Standalone|command line]] via <code>-d</code>.
+
|Sends debug information to STDERR. This option can be enabled to get additional information for development and debugging purposes. It can also be triggered on the [[Command-Line Options#BaseX Standalone|command line]] via <code>-d</code>. Note that the amount of output can be overwhelming, so it should not be used productively.
 
|}
 
|}
  
Line 50: Line 73:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|DBPATH [path]}}
 
|{{Code|DBPATH [path]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
|<code>[[Configuration#Database Directory|{home}/BaseXData]]</code> or <code>[[Configuration#Database Directory|{home}/data]]</code>
+
|<code><code>[[Configuration#Database Directory|{home}/data]]</code>
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|Points to the directory in which all databases are located.
 
|Points to the directory in which all databases are located.
 +
|}
 +
 +
===LOGPATH===
 +
 +
{| width='100%'
 +
|- valign="top"
 +
| width='120' | '''Signature'''
 +
|{{Code|LOGPATH [path]}}
 +
|- valign="top"
 +
| '''Default'''
 +
|<code>.logs</code>
 +
|- valign="top"
 +
| '''Summary'''
 +
|Points to the directory where [[Logging|log files]] are stored. Relative paths are resolved against the {{Option|DBPATH}} directory.
 
|}
 
|}
  
Line 64: Line 101:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|REPOPATH [path]}}
 
|{{Code|REPOPATH [path]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
|<code>[[Configuration#Database Directory|{home}/BaseXRepo]]</code>
+
|<code>[[Configuration#Database Directory|{home}/repo]]</code>
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|Points to the [[Repository]], in which all XQuery modules are located.
 
|Points to the [[Repository]], in which all XQuery modules are located.
Line 78: Line 115:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|LANG [language]}}
 
|{{Code|LANG [language]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|English}}
 
|{{Code|English}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies the interface language. Currently, seven languages are available: 'English', 'German', 'French', 'Dutch', 'Italian', 'Japanese', and 'Vietnamese'.
+
|Specifies the interface language.
 
|}
 
|}
  
===LANGKEY===
+
===FAIRLOCK===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
|{{Code|LANGKEY [boolean]}}
+
|{{Code|FAIRLOCK [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Prefixes all texts with the internal language keys. This option is helpful if BaseX is translated into another language, and if you want to see where particular texts are displayed.
+
|Defines the locking strategy:
 +
* By default, non-fair locking is used. Read transactions are favored, and transactions that no not access databases can be evaluated even if the limit for parallel transactions (specified via {{Option|PARALLEL}}) is reached. This prevents update operations from blocking all other requests. For example, the DBA can still be used to see which jobs are running even if the queue is full.
 +
* When fair locking is enabled, read and write transactions are treated equally (first in, first out). This avoids starvation of updates and should be used when prompt evaluation of updates is critical.
 
|}
 
|}
  
===GLOBALLOCK===
+
===CACHETIMEOUT===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
|{{Code|GLOBALLOCK [boolean]}}
+
|{{Code|CACHETIMEOUT [seconds]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
|{{Code|false}}
+
|{{Code|3600}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Controls if local (database) or global (process) locking will be used for managing read and write operations. The article on [[Transaction Management]] provides more details on concurrency control.
+
|Specifies how many seconds the results of [[Job Module|asynchronously executed jobs]] are cached in main memory.
 
|}
 
|}
  
Line 122: Line 161:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|HOST [host]}}
 
|{{Code|HOST [host]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|localhost}}
 
|{{Code|localhost}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|This host name is used by the client when connecting to a server. This option can also be changed when running the client on [[Command-Line Options#BaseX Client|command line]] via <code>-n</code>.
 
|This host name is used by the client when connecting to a server. This option can also be changed when running the client on [[Command-Line Options#BaseX Client|command line]] via <code>-n</code>.
Line 136: Line 175:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|PORT [port]}}
 
|{{Code|PORT [port]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|1984}}
 
|{{Code|1984}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|This port is used by the client when connecting to a server. This option can also be changed when running the client on [[Command-Line Options#BaseX Client|command line]] via <code>-p</code>.
 
|This port is used by the client when connecting to a server. This option can also be changed when running the client on [[Command-Line Options#BaseX Client|command line]] via <code>-p</code>.
Line 150: Line 189:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|SERVERPORT [port]}}
 
|{{Code|SERVERPORT [port]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|1984}}
 
|{{Code|1984}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|This is the port the database server will be listening to. This option can also be changed when running the server on [[Command-Line Options#BaseX Server|command line]] via <code>-p</code>.
+
|This is the port the database server is listening to. This option can also be changed when running the server on [[Command-Line Options#BaseX Server|command line]] via <code>-p</code>.
 
|}
 
|}
  
Line 164: Line 203:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|USER [name]}}
 
|{{Code|USER [name]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Represents a user name, which is used for accessing the server or an HTTP service:
+
|Represents a username, which is used for accessing the server or an HTTP service:
* The default value will be overwritten if a client specifies its own credentials.
+
* The default value is overwritten if a client specifies its own credentials.
 
* If the default value is empty, login will only be possible if the client specifies credentials.
 
* If the default value is empty, login will only be possible if the client specifies credentials.
 
* The option can also be changed on [[Command-Line Options#BaseX Client|command line]] via <code>-U</code>.
 
* The option can also be changed on [[Command-Line Options#BaseX Client|command line]] via <code>-U</code>.
Line 181: Line 220:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|PASSWORD [password]}}
 
|{{Code|PASSWORD [password]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Represents a password, which is used for accessing the server or an HTTP service:
+
|Represents a password, which is used for accessing the server:
* The default value will be overwritten if a client specifies its own credentials.
+
* The default value is overwritten if a client specifies its own credentials.
* If the default value is empty, login will only be possible if the client specifies credentials.
+
* If the default value is empty, authentication will only be possible if the client supplies credentials.
 
* The option can also be changed on [[Command-Line Options#BaseX Client|command line]] via <code>-P</code>.
 
* The option can also be changed on [[Command-Line Options#BaseX Client|command line]] via <code>-P</code>.
 
* Please note that it is a security risk to specify your password in plain text.
 
* Please note that it is a security risk to specify your password in plain text.
Line 199: Line 238:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|AUTHMETHOD [method]}}
 
|{{Code|AUTHMETHOD [method]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''Basic''
 
|''Basic''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies the HTTP Authentication, which will be proposed by the [[Web Application|HTTP server]] if a client sends an unauthorized request. Allowed values are {{Code|Basic}} and {{Code|Digest}}.
+
|Specifies the default authentication method used by the [[Web Application|HTTP server]] for negotiating credentials. Allowed values are {{Code|Basic}}, {{Code|Digest}}, and {{Code|Custom}}:
 +
* If basic access is chosen, the client can still request digest authentication.
 +
* This is different for digest access, which cannot be overwritten.
 +
* With custom authentication, the server will not do any authentication.
 
|}
 
|}
  
Line 213: Line 255:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|SERVERHOST [host&#x7c;ip]}}
 
|{{Code|SERVERHOST [host&#x7c;ip]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|This is the host name or ip address the server is bound to. If the option is set to an empty string (which is the default), the server will be open to all clients.
+
|This is the host name or IP address the server is bound to. If the option is set to an empty string (which is the default), the server accepts requests from all clients.
 
|}
 
|}
  
Line 227: Line 269:
  
 
{| width='100%' width='100%'
 
{| width='100%' width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|PROXYHOST [host]}}
 
|{{Code|PROXYHOST [host]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''  
 
|''empty''  
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|This is the host name of a proxy server. If the value is an empty string, it will be ignored.
+
|This is the host name of a proxy server. If the value is an empty string, it is ignored.
 
|}
 
|}
  
Line 241: Line 283:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|PROXYPORT [port]}}
 
|{{Code|PROXYPORT [port]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|0}}
 
|{{Code|0}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|This is the port number of a proxy server. If the value is set to {{Code|0}}, it will be ignored.
+
|This is the port number of a proxy server. If the value is set to {{Code|0}}, it is ignored.
 
|}
 
|}
  
Line 255: Line 297:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|NONPROXYHOSTS [hosts]}}
 
|{{Code|NONPROXYHOSTS [hosts]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|This is a list of hosts that should be directly accessed. If the value is an empty string, it will be ignored.
+
|This is a list of hosts that should be directly accessed. If the value is an empty string, it is ignored.
 
|}
 
|}
  
Line 269: Line 311:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|IGNORECERT [boolean]}}
 
|{{Code|IGNORECERT [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|This option can be turned on to ignore untrusted certificates when connecting to servers. Please use this option carefully.
+
|Disable SSL hostname verification and ignore untrusted certificates when connecting to servers.
 
|}
 
|}
  
Line 283: Line 325:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|TIMEOUT [seconds]}}
 
|{{Code|TIMEOUT [seconds]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|30}}
 
|{{Code|30}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies the maximum time a read-only transaction may take. If an operation takes longer than the specified timeout, it will be aborted. Write operations will not be affected by this timeout, as this would corrupt the integrity of the database. The timeout is deactivated if the timeout is set to {{Code|0}}. It is ignored for {{Code|ADMIN}} operations.
+
|Specifies the maximum time a transaction triggered by a client may take. An operation is interrupted when it takes longer than the specified number of seconds. Running update operations are not affected by this timeout, as this would corrupt the integrity of the database. The timeout is disabled if the timeout is set to {{Code|0}}. It is ignored for operations with [[User Management|admin permissions]].
 
|}
 
|}
  
Line 297: Line 339:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|KEEPALIVE [seconds]}}
 
|{{Code|KEEPALIVE [seconds]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|600}}
 
|{{Code|600}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies the maximum time a client will be remembered by the server. If there has been no interaction with a client for a longer time than specified by this timeout, it will be disconnected. Running operations will not be affected by this option. The keepalive check is deactivated if the value is set to {{Code|0}}.
+
|Specifies the maximum time a client is remembered by the server. If there has been no interaction with a client for a longer time than specified by this timeout, it will be disconnected. Running operations will not be affected by this option. The check is disabled if the value is set to {{Code|0}}.
 
|}
 
|}
  
Line 311: Line 353:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|PARALLEL [number]}}
 
|{{Code|PARALLEL [number]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|8}}
 
|{{Code|8}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Denotes the maximum allowed {{Code|number}} of parallel [[Transaction Management|transactions]].<br/>Note that a higher number of parallel operations may increase disk activity and thus slow down queries. In some cases, a single transaction may even give you better results than any parallel activity. The main reason for allowing parallel operations is to prevent slow transactions from blocking all other operations.
+
|Denotes the maximum allowed number of parallel [[Transaction Management|transactions]]:
 +
* If {{Option|FAIRLOCK}} is enabled, the number of parallel transactions will never exceed the specified value.
 +
* If the option is disabled (which is the default), the limit only applies to transactions that access databases.
 +
* The main reason for allowing parallel operations is to prevent slow transactions from blocking all other operations. A higher number of parallel operations may increase disk activity and thus slow down queries. In some cases, a single transaction may even give you better results than any parallel activity.
 
|}
 
|}
  
Line 325: Line 370:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|LOG [boolean]}}
 
|{{Code|LOG [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|true}}
 
|{{Code|true}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|Turns [[Logging]] of server operations and HTTP requests on/off. This option can also be changed when running the server on [[Command-Line Options#BaseX Server|command line]] via <code>-z</code>.
 
|Turns [[Logging]] of server operations and HTTP requests on/off. This option can also be changed when running the server on [[Command-Line Options#BaseX Server|command line]] via <code>-z</code>.
Line 339: Line 384:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|LOGMSGMAXLEN [length]}}
 
|{{Code|LOGMSGMAXLEN [length]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|1000}}
 
|{{Code|1000}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|Specifies the maximum length of a single [[Logging|log message]].
 
|Specifies the maximum length of a single [[Logging|log message]].
 +
|}
 +
 +
===LOGTRACE===
 +
 +
{| width='100%'
 +
|- valign="top"
 +
| width='120' | '''Signature'''
 +
|{{Code|LOGTRACE [boolean]}}
 +
|- valign="top"
 +
| '''Default'''
 +
|{{Code|true}}
 +
|- valign="top"
 +
| '''Summary'''
 +
|If BaseX is running as a [[Web Application]], trace output (generated via {{Code|fn:trace}}, {{Function|Profiling|prof:dump}} and similar functions) is written to the [[Logging|database logs]]. When this option is disabled, trace output is redirected to standard error, as it is known from the standalone version of BaseX.
 
|}
 
|}
  
 
==HTTP Services==
 
==HTTP Services==
  
If BaseX is run as web servlet, the HTTP options must be specified in the <code>[https://github.com/BaseXdb/basex/tree/master/basex-api/src/main/webapp/WEB-INF webapp/WEB-INF]</code> directory and the {{Code|jetty.xml}} and {{Code|web.xml}} configuration files.
+
Most HTTP options are defined in the {{Code|jetty.xml}} and {{Code|web.xml}} configuration files in the <code>[https://github.com/BaseXdb/basex/tree/master/basex-api/src/main/webapp/WEB-INF webapp/WEB-INF]</code> directory. Some additional BaseX-specific options exist that are set before the web server is started:
  
 
===WEBPATH===
 
===WEBPATH===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|WEBPATH [path]}}
 
|{{Code|WEBPATH [path]}}
|-
+
|- valign="top"
 +
| '''Default'''
 +
|<code>[[Configuration#Database Directory|{home}/webapp]]</code>
 +
|- valign="top"
 +
| '''Summary'''
 +
|Points to the directory in which all the [[Web Application]] contents are stored, including XQuery, Script, [[RESTXQ]] and configuration files:
 +
* The option is ignored if BaseX is deployed as [[Web Application#Servlet_Container|web servlet]].
 +
* It cannot be assigned via the {{Code|web.xml}} file, as it is evaluated before the configuration files are parsed.
 +
|}
 +
 
 +
===GZIP===
 +
 
 +
{| width='100%'
 +
|- valign="top"
 +
| width='120' | '''Signature'''
 +
|{{Code|GZIP [boolean]}}
 +
|- valign="top"
 
| '''Default'''
 
| '''Default'''
|<code>[[Configuration#Database Directory|{home}/BaseXWeb]]</code> or <code>[[Configuration#Database Directory|{home}/webapp]]</code>
+
|<code>false</code>
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Points to the directory in which all the [[Web Application]] contents are stored, including XQuery, Script, [[RESTXQ]] and configuration files. This option is ignored if BaseX is deployed as [[Web Application#Servlet_Container|web servlet]].
+
|Jetty provides a [https://www.eclipse.org/jetty/documentation/current/gzip-filter.html Gzip handler] for dynamically uncompressing requests and compressing responses. This feature can be enabled if Jetty is started via the [[Web Application|BaseX HTTP Server]]:
 +
* The option can also be enabled on [[Command-Line Options#HTTP Server|command line]] via <code>-g</code>.
 +
* It cannot be assigned via the {{Code|web.xml}} file, as the option is evaluated before the configuration files are parsed.
 +
* In addition to the [https://github.com/eclipse/jetty.project/blob/7cc552013eb4d05cb603ba0bc85d176c97957cd4/jetty-server/src/main/java/org/eclipse/jetty/server/handler/gzip/GzipHandler.java#L187-L211 Jetty defaults] (GET requests, exclusion of binaries, MSIE 6.0, etc.), POST and PUT requests are supported.
 
|}
 
|}
  
Line 371: Line 449:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|RESTXQPATH [path]}}
 
|{{Code|RESTXQPATH [path]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 +
| '''Summary'''
 +
|Points to the directory which contains the [[RESTXQ]] modules of a web application. Relative paths are resolved against the {{Option|WEBPATH}} directory.
 +
|}
 +
 
 +
===PARSERESTXQ===
 +
 
 +
{| width='100%'
 +
|- valign="top"
 +
| width='120' | '''Signature'''
 +
|{{Code|PARSERESTXQ}}
 +
|- valign="top"
 +
| '''Default'''
 +
|{{Code|3}}
 +
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Points to the directory which contains the [[RESTXQ]] modules of a web application. Relative paths will be resolved against the [[#WEBPATH|WEBPATH]] directory.
+
|Timeout after which the RESTXQ directory is parsed for changes:
 +
* If {{Code|0}} is specified, the directory is parsed every time a RESTXQ function is called.
 +
* A positive value defines the idle time, in seconds, after which parsing is enforced. The default value is {{Code|3}}: Changes in the RESTXQ directory is detected after 3 seconds without RESTXQ function calls.
 +
* Monitoring is completely disabled if a negative value is specified.
 +
 
 +
See [[RESTXQ#Preliminaries|RESTXQ Preliminaries]] for more details.
 
|}
 
|}
  
===CACHERESTXQ===
+
===RESTXQERRORS===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
|{{Code|CACHERESTXQ}}
+
|{{Code|RESTXQERRORS}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
|{{Code|false}}
+
|{{Code|true}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Caches [[RESTXQ]] modules once when starting the web application.<br/>The option is helpful in productive environments with a high load, but files should not be replaced while the web server is running.
+
|Reports parsing errors in XQuery modules in the RESTXQ directory and returns the full error message and stack trace to the client. By default, this option is enabled. In a production environment, it can be disabled to suppress errors that should not be seen by the user of an API (the full error information can still be looked up in the database logs). See [[RESTXQ#Error Handling|RESTXQ Error Handling]] for more details.
 
|}
 
|}
  
Line 399: Line 496:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|RESTPATH [path]}}
 
|{{Code|RESTPATH [path]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Points to the directory which contains XQuery files and command scripts, which can be evaluated via the [[REST#GET Requests|REST run operation]]. Relative paths will be resolved against the [[#WEBPATH|WEBPATH]] directory.
+
|Points to the directory which contains XQuery files and command scripts, which can be evaluated via the [[REST#GET Requests|REST run operation]]. Relative paths are resolved against the {{Option|WEBPATH}} directory.
 
|}
 
|}
  
Line 413: Line 510:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|HTTPLOCAL [boolean]}}
 
|{{Code|HTTPLOCAL [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|By default, if BaseX is run as [[Web Application]], a database server instance will be started as soon as the first HTTP service is called. The server can then be addressed by other BaseX clients in parallel to the HTTP services.<br/>If the option is set to {{Code|false}}, the database server will be disabled.
+
|By default, if BaseX is run as [[Web Application]], the database server instance is started in addition, which can then be addressed by [[Clients]] via the database port (see {{Option|PORT}}). If the option is set to {{Code|true}}, no database server is launched.
 
|}
 
|}
  
Line 427: Line 524:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|STOPPORT [port]}}
 
|{{Code|STOPPORT [port]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
|{{Code|8985}}
+
|{{Code|8081}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|This is the port on which the [[Startup#BaseX HTTP Server|HTTP Server]] can be locally closed:
+
|If Jetty is started from BaseX, this is the port on which the [[Startup#BaseX HTTP Server|HTTP Server]] can be closed:
 
* The listener for stopping the web server will only be started if the specified value is greater than {{Code|0}}.
 
* The listener for stopping the web server will only be started if the specified value is greater than {{Code|0}}.
 
* The option is ignored if BaseX is used as a [[Web Application]] or started via [[Web Application#Maven|Maven]].
 
* The option is ignored if BaseX is used as a [[Web Application]] or started via [[Web Application#Maven|Maven]].
Line 448: Line 545:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|MAINMEM [boolean]}}
 
|{{Code|MAINMEM [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|If this option is turned on, new databases will be exclusively created in main memory. Most queries will be evaluated faster in main memory mode, but all data is lost if BaseX is shut down. The value of this option will be assigned once to a new database, and cannot be changed after that.
+
|If this option is turned on, new databases arecreated in main memory:
 +
* Most queries are evaluated faster in main-memory mode, but all data is lost if the BaseX instance in which the database was created is shut down.
 +
* It is not possible to store binary resources in a main-memory database.
 +
* A main-memory database will have no disk representation. However, it is possible to export the database via the {{Command|EXPORT}} command, and create a new database from the exported file in a second step.
 +
* This option will not be available for {{Function|Database|db:create}} because the database would not be accessible anymore after database creation, i.e., outside the query scope.
 
|}
 
|}
  
Line 462: Line 563:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|ADDCACHE [boolean]}}
 
|{{Code|ADDCACHE [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|If this option is activated, data structures of documents will first be cached to disk before being added to the final database. This option is helpful when larger documents need to be added, and if the existing heuristics cannot estimate the input size (e.g. when adding directories or sending input streams).
 
|If this option is activated, data structures of documents will first be cached to disk before being added to the final database. This option is helpful when larger documents need to be added, and if the existing heuristics cannot estimate the input size (e.g. when adding directories or sending input streams).
Line 478: Line 579:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|CREATEFILTER [filter]}}
 
|{{Code|CREATEFILTER [filter]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|*.xml}}
 
|{{Code|*.xml}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|File filter in the [[Commands#Glob Syntax|Glob Syntax]], which is applied whenever new databases are created, or resources are added to a database.
 
|File filter in the [[Commands#Glob Syntax|Glob Syntax]], which is applied whenever new databases are created, or resources are added to a database.
Line 492: Line 593:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|ADDARCHIVES [boolean]}}
 
|{{Code|ADDARCHIVES [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|true}}
 
|{{Code|true}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|If this option is set to {{Code|true}}, files within archives (ZIP, GZIP, TAR, TGZ, DOCX, etc.) are parsed whenever new databases are created or resources are added to a database.
 
|If this option is set to {{Code|true}}, files within archives (ZIP, GZIP, TAR, TGZ, DOCX, etc.) are parsed whenever new databases are created or resources are added to a database.
Line 506: Line 607:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|ARCHIVENAME [boolean]}}
 
|{{Code|ARCHIVENAME [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|If this option is set to {{Code|true}}, the file name of parsed archives will be included in the document paths.
+
|If this option is set to {{Code|true}}, the file name of parsed archives is included in the document paths.
 
|}
 
|}
  
Line 520: Line 621:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|SKIPCORRUPT [boolean]}}
 
|{{Code|SKIPCORRUPT [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Skips corrupt (i.e., not well-formed) files while creating a database or adding new documents. If this option is activated, document updates are slowed down, as all files will be parsed twice. Next, main memory consumption will be higher as parsed files will be cached in main memory.
+
|Skips corrupt (i.e., not well-formed) files while creating a database or adding new documents. If this option is activated, document updates are slowed down, as all files are parsed twice. Next, main-memory consumption is higher, as parsed files are cached in main memory.
 
|}
 
|}
  
Line 534: Line 635:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|ADDRAW [boolean]}}
 
|{{Code|ADDRAW [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|If this option is activated, and if new resources are added to a database, all files that are not filtered by the [[#CREATEFILTER|CREATEFILTER]] option will be added as ''raw'' files (i.e., in their binary representation).
+
|If this option is enabled, all resources that are filtered out by the {{Option|CREATEFILTER}} option while being added to a database are stored as [[Binary Data|raw files]] instead (i.e., in their binary representation).
 
|}
 
|}
  
Line 548: Line 649:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|PARSER [type]}}
 
|{{Code|PARSER [type]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|XML}}
 
|{{Code|XML}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Defines a [[Parsers|parser]] for importing new files to the database. Currently, 'XML', 'JSON', 'CSV', 'TEXT', 'HTML' are available as parsers. HTML will be parsed as normal XML files if [http://home.ccil.org/~cowan/XML/tagsoup/ Tagsoup] is not found in the classpath.
+
|Defines a [[Parsers|parser]] for importing new files to the database. Available parsers are {{Code|XML}}, {{Code|JSON}}, {{Code|CSV}}, {{Code|TEXT}}, {{Code|HTML}}, and {{Code|RAW}}. HTML input is parsed as XML if [[Parsers#HTMLParser|Tagsoup]] is not found in the class path.
 
|}
 
|}
  
Line 562: Line 663:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|CSVPARSER [options]}}
 
|{{Code|CSVPARSER [options]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies the way how CSV data will be parsed. The available options are listed in the [[CSV Module#Options|CSV Module]].
+
|Specifies the way how CSV data is parsed. Keys and values are delimited with <code>=</code>, and multiple options are delimited with <code>,</code>. The available options (except for the additional <code>encoding</code> option) are described in the [[CSV Module#Options|CSV Module]].
 +
|- valign="top"
 +
| '''Examples'''
 +
|<code>encoding=CP1252,header=true</code> parses the input as CP1252 and the first line as header.
 
|}
 
|}
  
Line 576: Line 680:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|JSONPARSER [options]}}
 
|{{Code|JSONPARSER [options]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies the way how JSON data will be parsed. The available options are listed in the [[JSON Module#Options|JSON Module]].
+
|Specifies the way how JSON data is parsed. Keys and values are delimited with <code>=</code>, and multiple options are delimited with <code>,</code>. The available options (except for the additional <code>encoding</code> option) are described in the [[JSON Module#Options|JSON Module]].
 +
|- valign="top"
 +
| '''Examples'''
 +
|<code>format=jsonml,lax=yes</code> interprets the input as JSONML and uses lax parsing.
 
|}
 
|}
  
Line 590: Line 697:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|HTMLPARSER [options]}}
 
|{{Code|HTMLPARSER [options]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies the way how HTML data will be parsed. Available options are listed in the [[Parsers]] article.
+
|Specifies the way how HTML data is parsed. Keys and values are delimited with <code>=</code>, and multiple options are delimited with <code>,</code>. The available options are described in the [[Parsers#Options|Parsers]] article.
|}
+
|- valign="top"
 
+
| '''Examples'''
===TEXTPARSER===
+
|
 
+
* <code>encoding=Shift-JIS,nons=true</code> parses the input as Shift-JIS and suppresses namespaces.
{| width='100%'
+
* <code>lexical=true</code> preserves comments.
|-
 
| width='120' | '''Signature'''
 
|{{Code|TEXTPARSER [options]}}
 
|-
 
| '''Default'''
 
|''empty''
 
|-
 
| '''Summary'''
 
|Specifies the way how TEXT data will be parsed. Available options are listed in the [[Parsers]] article.
 
 
|}
 
|}
  
 
==XML Parsing==
 
==XML Parsing==
  
===CHOP===
+
===STRIPWS===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
|{{Code|CHOP [boolean]}}
+
|{{Code|STRIPWS [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
|{{Code|true}}
+
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Many XML documents include whitespaces that have been added to improve readability. The {{Code|CHOP}} option controls the [http://www.w3.org/TR/REC-xml/#sec-white-space white-space processing mode] of the XML parser:
+
|Many XML documents include whitespaces that have been added to improve readability. This option controls the [https://www.w3.org/TR/REC-xml/#sec-white-space white-space processing mode] of the XML parser:
* By default, this option is set to {{Code|true}}. This way, leading and trailing whitespaces from text nodes will be chopped and all empty text nodes will be discarded.
+
* If the option is set to {{Code|true}}, leading and trailing whitespaces from text nodes are stripped and empty text nodes are discarded.
* The flag should be turned off if a document contains [[Full-Text#Mixed Content|mixed content]].
+
* The flag should be disabled if a document contains [[Full-Text#Mixed Content|mixed content]].
* The flag can also be turned off on [[Command-Line Options#BaseX Standalone|command line]] via <code>-w</code>.
+
* The flag can also be turned on via the [[Command-Line Options#BaseX Standalone|command line]] and <code>-w</code>.
* If the <code>xml:space="preserve"</code> attribute is attached to an element, chopping will be turned off for all descendant text nodes. In the following example document, the whitespaces in the text nodes of the {{Code|text}} element will not be chopped:
+
* If the option is enabled, whitespaces of an element and its descendants can locally be preserved with the <code>xml:space="preserve"</code> attribute:
<pre class="brush:xml">
+
 
 +
<pre lang="xml">
 
<xml>
 
<xml>
 
   <title>
 
   <title>
     Demonstrating the CHOP flag
+
     Demonstrating the STRIPWS flag
 
   </title>
 
   </title>
 
   <text xml:space="preserve">To <b>be</b>, or not to <b>be</b>, that is the question.</text>
 
   <text xml:space="preserve">To <b>be</b>, or not to <b>be</b>, that is the question.</text>
 
</xml>
 
</xml>
 
</pre>
 
</pre>
 +
If whitespaces are stripped, <code>indent=yes</code> can be assigned to the {{Option|SERIALIZER}} option to get properly indented XML output.<br/>
 +
See [[BaseX_10#Whitespaces|BaseX 10: Whitespaces]] for more information on whitespace handling.
 
|}
 
|}
  
Line 646: Line 747:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|STRIPNS [boolean]}}
 
|{{Code|STRIPNS [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Strips all namespaces from an XML document and all elements while parsing.
+
|Strips all namespaces from an XML document while parsing.
 
|}
 
|}
  
Line 660: Line 761:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|INTPARSE [boolean]}}
 
|{{Code|INTPARSE [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Uses the internal XML parser instead of the standard Java XML parser. The internal parser is faster, more fault tolerant and supports common HTML entities out-of-the-box, but it does not support all features needed for parsing DTDs.
+
|Uses the internal XML parser instead of the standard Java XML parser. Reasons for using the internal parser are:
 +
* Performance: Documents, in particular small ones, are parsed faster.
 +
* Entity support: More than 2000 [https://html.spec.whatwg.org/multipage/named-characters.html HTML character references] are decoded.
 +
* Fault tolerance: Invalid characters are replaced with the Unicode replacement character <code>FFFD</code> (&#xFFFD;).
 +
By default, Java’s Xerces parser is used as it supports all features of the XML standard, including {{Option|XINCLUDE}}, or recursive entity expansion if specified in DTDs.
 
|}
 
|}
  
Line 674: Line 779:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|DTD [boolean]}}
 
|{{Code|DTD [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Parses referenced DTDs and resolves XML entities. By default, this option is switched to {{Code|false}}, as many DTDs are located externally, which may completely block the process of creating new databases. The [[#CATFILE|CATFILE]] option can be changed to locally resolve DTDs.
+
|Parses referenced DTDs and resolves XML entities. By default, this option is switched to {{Code|false}}, as many DTDs are located externally, which may completely block the process of creating new databases. The {{Option|CATALOG}} option can be changed to locally resolve DTDs.
 
|}
 
|}
  
Line 688: Line 793:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|XINCLUDE [boolean]}}
 
|{{Code|XINCLUDE [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|true}}
 
|{{Code|true}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Resolves XInclude inclusion tags and merges referenced XML documents. By default, this option is switched to {{Code|true}}. This option is only available if the standard Java XML Parser is used (see [[#INTPARSE|INTPARSE]]).
+
|Resolves XInclude inclusion tags and merges referenced XML documents. By default, this option is switched to {{Code|true}}. This option is only available if the standard Java XML Parser is used (see {{Option|INTPARSE}}).
 
|}
 
|}
  
===CATFILE===
+
===CATALOG===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
|{{Code|CATFILE [path]}}
+
|{{Code|CATALOG [path]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies a catalog file to locally resolve DTDs. See the entry on [[Catalog Resolver]]s for more details.
+
|Semicolon-separated list of XML catalog files to resolve URIs. See [[Catalog Resolver]]s for more details.
 
|}
 
|}
  
 
==Indexing==
 
==Indexing==
  
The following options control the creation of index structures. The current values will be considered if a new database is created. See [[Indexes]] for more details.
+
The following options control the creation of index structures. The current values are considered if a new database is created. See [[Indexes]] for more details.
  
 
===TEXTINDEX===
 
===TEXTINDEX===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|TEXTINDEX [boolean]}}
 
|{{Code|TEXTINDEX [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|true}}
 
|{{Code|true}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|Creates a text index whenever a new database is created. A text index speeds up queries with equality comparisons on text nodes. See [[Index#Text Index|Text Index]] for more details.
 
|Creates a text index whenever a new database is created. A text index speeds up queries with equality comparisons on text nodes. See [[Index#Text Index|Text Index]] for more details.
Line 734: Line 839:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|ATTRINDEX [boolean]}}
 
|{{Code|ATTRINDEX [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|true}}
 
|{{Code|true}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|Creates an attribute index whenever a new database is created. An attribute index speeds up queries with equality comparisons on attribute values. See [[Index#Attribute Index|Attribute Index]] for more details.
 
|Creates an attribute index whenever a new database is created. An attribute index speeds up queries with equality comparisons on attribute values. See [[Index#Attribute Index|Attribute Index]] for more details.
Line 746: Line 851:
  
 
===TOKENINDEX===
 
===TOKENINDEX===
 
{{Mark|Introduced with Version 8.4:}}
 
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|TOKENINDEX [boolean]}}
 
|{{Code|TOKENINDEX [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|true}}
 
|{{Code|true}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|Creates a token index whenever a new database is created. A token index speeds up searches for single tokens in attribute values. See [[Index#Token Index|Token Index]] for more details.
 
|Creates a token index whenever a new database is created. A token index speeds up searches for single tokens in attribute values. See [[Index#Token Index|Token Index]] for more details.
Line 764: Line 867:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|FTINDEX [boolean]}}
 
|{{Code|FTINDEX [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|Creates a full-text index whenever a new database is created. A full-text index speeds up queries with full-text expressions. See [[Index#Full-Text Index|Full-Text Index]] for more details.
 
|Creates a full-text index whenever a new database is created. A full-text index speeds up queries with full-text expressions. See [[Index#Full-Text Index|Full-Text Index]] for more details.
Line 778: Line 881:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|TEXTINCLUDE [names]}}
 
|{{Code|TEXTINCLUDE [names]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Defines name patterns for the parent elements of texts that are indexed. By default, all text nodes will be indexed.<br/>Name patterns are separated by commas. See [[Indexes#Selective Indexing|Selective Indexing]] for more details.
+
|Defines name patterns for the parent elements of texts that are indexed. By default, all text nodes are indexed.<br/>Name patterns are separated by commas. See [[Indexes#Selective Indexing|Selective Indexing]] for more details.
 
|}
 
|}
  
Line 792: Line 895:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|ATTRINCLUDE [names]}}
 
|{{Code|ATTRINCLUDE [names]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Defines name patterns for the attributes to be indexed. By default, all attribute nodes will be indexed.<br/>Name patterns are separated by commas. See [[Indexes#Selective Indexing|Selective Indexing]] for more details.
+
|Defines name patterns for the attributes to be indexed. By default, all attribute nodes are indexed.<br/>Name patterns are separated by commas. See [[Indexes#Selective Indexing|Selective Indexing]] for more details.
 
|}
 
|}
  
 
===TOKENINCLUDE===
 
===TOKENINCLUDE===
 
{{Mark|Introduced with Version 8.4:}}
 
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|TOKENINCLUDE [names]}}
 
|{{Code|TOKENINCLUDE [names]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Defines name patterns for the attributes to be indexed. By default, tokens in all attribute nodes will be indexed.<br/>Name patterns are separated by commas. See [[Indexes#Selective Indexing|Selective Indexing]] for more details.
+
|Defines name patterns for the attributes to be indexed. By default, tokens in all attribute nodes are indexed.<br/>Name patterns are separated by commas. See [[Indexes#Selective Indexing|Selective Indexing]] for more details.
 
|}
 
|}
  
Line 822: Line 923:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|FTINCLUDE [names]}}
 
|{{Code|FTINCLUDE [names]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Defines name patterns for the parent elements of texts that are indexed. By default, all text nodes will be indexed.<br/>Name patterns are separated by commas. See [[Indexes#Selective Indexing|Selective Indexing]] for more details.
+
|Defines name patterns for the parent elements of texts that are indexed. By default, all text nodes are indexed.<br/>Name patterns are separated by commas. See [[Indexes#Selective Indexing|Selective Indexing]] for more details.
 
|}
 
|}
  
Line 836: Line 937:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|MAXLEN [int]}}
 
|{{Code|MAXLEN [int]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|96}}
 
|{{Code|96}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies the maximum length of strings that are to be indexed by the name, path, value, and full-text index structures. The value of this option will be assigned once to a new database, and cannot be changed after that.
+
|Specifies the maximum length for strings to be stored in [[Indexes|index structures]]. The value of this option is assigned once to a new database, and can only be changed by creating a new database or doing a [[Commands#OPTIMIZE|full optimization]].
 
|}
 
|}
  
Line 850: Line 951:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|MAXCATS [int]}}
 
|{{Code|MAXCATS [int]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|100}}
 
|{{Code|100}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies the maximum number of distinct values (categories) that will be stored together with the element/attribute names or unique paths in the [[Index#Name Index|Name Index]] or [[Index#Path Index|Path Index]]. The value of this option will be assigned once to a new database, and cannot be changed after that.
+
|Specifies the maximum number of distinct values (categories) that are stored together with the element/attribute names or unique paths in the [[Index#Name Index|Name Index]] or [[Index#Path Index|Path Index]]. The value of this option is assigned once to a new database, and cannot be changed after that.
 
|}
 
|}
  
Line 864: Line 965:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|UPDINDEX [boolean]}}
 
|{{Code|UPDINDEX [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|If turned on, incremental indexing will be enabled:
+
|If turned on, incremental indexing is enabled:
* The current value of this option will be assigned to new databases. It can be changed for existing databases by running [[Commands#OPTIMIZE|OPTIMIZE ALL]] or [[Database_Module#db:optimize|db:optimize($db, true())]].
+
* The current value of this option is assigned to new databases. It can be changed for existing databases by running {{Command|OPTIMIZE}} with the {{Code|ALL}} keyword or {{Function|Database|db:optimize}} and {{Code|true()}} as second argument.
* After each update, the value indexes will be refreshed as well. Incremental updates are currently not available for the full-text index and database statistics.
+
* After each update, the value indexes are refreshed as well. Incremental updates are currently not available for the full-text index and database statistics.
 
* Find more details in the article on [[Index#Updates|Index Structures]].
 
* Find more details in the article on [[Index#Updates|Index Structures]].
 
|}
 
|}
Line 881: Line 982:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|AUTOOPTIMIZE [boolean]}}
 
|{{Code|AUTOOPTIMIZE [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|If turned on, auto optimization will be applied to new databases:
+
|If turned on, auto optimization is applied to new databases:
* With each update, outdated indexes and database statistics will be recreated.
+
* With each update, outdated indexes and database statistics are recreated.
 
* As a result, the index structures will always be up-to-date.
 
* As a result, the index structures will always be up-to-date.
 
* However, updates can take much longer, so this option should only be activated for medium-sized databases.
 
* However, updates can take much longer, so this option should only be activated for medium-sized databases.
* The value of this option will be assigned once to a new database. It can be reassigned by running [[Commands#OPTIMIZE|OPTIMIZE]] or [[Database_Module#db:optimize|db:optimize]].
+
* The value of this option is assigned once to a new database. It can be reassigned by running {{Command|OPTIMIZE}} or {{Function|Database|db:optimize}}.
 
|}
 
|}
  
===INDEXSPLITSIZE===
+
===SPLITSIZE===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
|{{Code|INDEXSPLITSIZE [num]}}
+
|{{Code|SPLITSIZE [num]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|0}}
 
|{{Code|0}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|This option affects the [[Indexes#Index Construction|construction]] of new text and attribute indexes. It specifies the number of index build operations that are performed before writing partial index data to disk. By default, if the value is set to 0, some dynamic split heuristics are applied. By setting the value to its maximum (2147483647), the index will never be split.
+
|This option affects the [[Indexes#Performance|construction]] of new value indexes. It controls the number of index build operations that are performed before writing partial index data to disk:
 +
* By default, if the value is set to {{Code|0}}, some heuristics are applied, based on the current memory consumption. Usually, this works fine.
 +
* If explicit garbage collection is disabled when running Java (e.g. via the JVM option {{Code|-XX:+DisableExplicitGC}}), you may need to choose a custom split size.
 +
* You can e.g. start with {{Code|1000000}} (one million) index operations and adjust this value in the next steps.
 +
* The larger the assigned value is, the fewer splits take place, and main-memory consumption increases.
 
|}
 
|}
  
===FTINDEXSPLITSIZE===
+
==Full-Text Indexing==
 
 
{| width='100%'
 
|-
 
| width='120' | '''Signature'''
 
|{{Code|FTINDEXSPLITSIZE [num]}}
 
|-
 
| '''Default'''
 
|{{Code|0}}
 
|-
 
| '''Summary'''
 
|This option affects the [[Indexes#Index Construction|construction]] of new full-text indexes. It specifies the number of index build operations that are performed before writing partial index data to disk. By default, if the value is set to 0, some dynamic split heuristics are applied. By setting the value to its maximum (2147483647), the index will never be split.
 
|}
 
 
 
==Full-Text==
 
  
 
===STEMMING===
 
===STEMMING===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|STEMMING [boolean]}}
 
|{{Code|STEMMING [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|If {{Code|true}}, all tokens will be stemmed during full-text indexing, using a language-specific stemmer implementation. By default, tokens will not be stemmed. See [[Full-Text#Indexes#Full-Text Index|Indexes#Full-Text Index]] for more details.
+
|If {{Code|true}}, all tokens are stemmed during full-text indexing, using a language-specific stemmer implementation. By default, tokens will not be stemmed. See [[Indexes#Full-Text Index|Full-Text Index]] for more details.
 
|}
 
|}
  
Line 946: Line 1,037:
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|CASESENS [boolean]}}
 
|{{Code|CASESENS [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|If {{Code|true}}, the case of tokens will be preserved during full-text indexing. By default, case will be ignored (all tokens will be indexed in lower case). See [[Full-Text#Indexes#Full-Text Index|Indexes#Full-Text Index]] for more details.
+
|If {{Code|true}}, the case of tokens is preserved during full-text indexing. By default, case is ignored (all tokens are indexed in lower case). See [[Indexes#Full-Text Index|Full-Text Index]] for more details.
 
|}
 
|}
  
Line 957: Line 1,048:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|DIACRITICS [boolean]}}
 
|{{Code|DIACRITICS [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|If set to {{Code|true}}, diacritics will be preserved during full-text indexing. By default, diacritics will be removed. See [[Full-Text#Indexes#Full-Text Index|Indexes#Full-Text Index]] for more details.
+
|If set to {{Code|true}}, diacritics are preserved during full-text indexing. By default, diacritics are removed. See [[Indexes#Full-Text Index|Full-Text Index]] for more details.
 
|}
 
|}
  
Line 971: Line 1,062:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|LANGUAGE [lang]}}
 
|{{Code|LANGUAGE [lang]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|en}}
 
|{{Code|en}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|The specified language will influence the way how an input text will be tokenized. This option is mainly important if tokens are to be stemmed, or if the tokenization of a language differs from Western languages. See [[Full-Text#Indexes#Full-Text Index|Indexes#Full-Text Index]] for more details.
+
|The specified language will influence the way how texts are tokenized and stemmed. It can be the name of a language or a language code. See [[Indexes#Full-Text Index|Full-Text Index]] for more details.
 
|}
 
|}
  
Line 985: Line 1,076:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|STOPWORDS [path]}}
 
|{{Code|STOPWORDS [path]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|A new full-text index will drop tokens that are listed in the specified stopword list. A stopword list may decrease the size of the full text index. See [[Full-Text#Indexes#Full-Text Index|Indexes#Full-Text Index]] for more details.
+
|If a text file with stop words is specified, frequently used terms contained in that file are ignored when a full-text index is created. A stop word list may decrease the size of the full-text index and speed up your queries. See [[Indexes#Full-Text Index|Full-Text Index]] for more details.
 
|}
 
|}
  
Line 1,001: Line 1,092:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|QUERYINFO [boolean]}}
 
|{{Code|QUERYINFO [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|Prints more information on internal query rewritings, optimizations, and performance. By default, this info is shown in the [[GUI#Visualizations|Info View]] in the GUI. It can also be activated on [[Command-Line Options#BaseX Standalone|command line]] via <code>-V</code>.  
 
|Prints more information on internal query rewritings, optimizations, and performance. By default, this info is shown in the [[GUI#Visualizations|Info View]] in the GUI. It can also be activated on [[Command-Line Options#BaseX Standalone|command line]] via <code>-V</code>.  
|}
 
 
===XQUERY3===
 
 
{| width='100%'
 
|-
 
| width='120' | '''Signature'''
 
|{{Code|XQUERY3}}
 
|-
 
| '''Default'''
 
|{{Code|true}}
 
|-
 
| '''Summary'''
 
|Enables all [[XQuery 3.0]] features supported by BaseX. If this option is set to {{Code|false}}, the XQuery parser will only accept expressions of the XQuery 1.0 specification.
 
 
|}
 
|}
  
Line 1,029: Line 1,106:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|MIXUPDATES}}
 
|{{Code|MIXUPDATES}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Allows queries to both contain updating and non-updating expressions. All updating constraints will be turned off, and nodes to be returned will be copied before they are modified by an updating expression. By default, this option is set to {{Code|false}}, because the XQuery Update Facility does not allow an updating query to [[XQuery Update#Returning Results|return results]].
+
|Allows queries to both contain updating and non-updating expressions. All updating constraints are turned off, and nodes to be returned are copied before they are modified by an updating expression. By default, in compliance with the XQuery Update Facility, this option is set to {{Code|false}}. See [[XQuery Update#Returning Results|Returning Results]] for more details.
 
|}
 
|}
  
 
===BINDINGS===
 
===BINDINGS===
 +
 +
{{Announce|Updated with Version 11}}: The Clark notation was replaced with the [[XQuery 3.0#Expanded QNames|Expanded QNames]] notation.
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|BINDINGS [vars]}}
 
|{{Code|BINDINGS [vars]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|Contains external variables to be bound to a query. The string must comply with the following rules:
 
|Contains external variables to be bound to a query. The string must comply with the following rules:
Line 1,056: Line 1,135:
 
* Commas in values must be duplicated.
 
* Commas in values must be duplicated.
 
* Variables may optionally be introduced with a leading dollar sign.
 
* Variables may optionally be introduced with a leading dollar sign.
* If a variable uses a namespace different to the default namespace, it can be specified with the [http://www.jclark.com/xml/xmlns.htm Clark Notation] or [http://www.w3.org/TR/xquery-30/#id-basics Expanded QName Notation].
+
* If a variable uses a namespace different from the default namespace, it can be specified with the [[XQuery 3.0#Expanded QNames|Expanded QNames]] notation.
 
This option can also be used on [[Command-Line Options#BaseX Standalone|command line]] with the flag <code>-b</code>.
 
This option can also be used on [[Command-Line Options#BaseX Standalone|command line]] with the flag <code>-b</code>.
|-
+
|- valign="top"
 
| '''Examples'''
 
| '''Examples'''
 
|
 
|
Line 1,064: Line 1,143:
 
* <code>a=1,,2</code> &nbsp; binds the value {{Code|1,2}} to the variable $a
 
* <code>a=1,,2</code> &nbsp; binds the value {{Code|1,2}} to the variable $a
 
* <code>{URI}a=x</code> &nbsp; binds the value {{Code|x}} to the variable $a with the namespace {{Code|URI}}.
 
* <code>{URI}a=x</code> &nbsp; binds the value {{Code|x}} to the variable $a with the namespace {{Code|URI}}.
* In the following [[Commands#Command_Scripts| Command Script]], the value {{Code|hello world!}} is bound to the variable $GREETING:
+
* In the following [[Commands#Command_Scripts| Command Script]], the value {{Code|hello world!}} is bound to the variable {{Code|$GREETING}}:
<pre class="brush:xml">
+
<pre lang='xquery'>
 
SET BINDINGS GREETING="hello world!"
 
SET BINDINGS GREETING="hello world!"
 
XQUERY declare variable $GREETING external; $GREETING
 
XQUERY declare variable $GREETING external; $GREETING
Line 1,071: Line 1,150:
 
|}
 
|}
  
===QUERYPATH===
+
===INLINELIMIT===
 +
 
 +
{| width='100%'
 +
|- valign="top"
 +
| width='120' | '''Signature'''
 +
|{{Code|INLINELIMIT}}
 +
|- valign="top"
 +
| '''Default'''
 +
|{{Code|50}}
 +
|- valign="top"
 +
| '''Summary'''
 +
|This option controls inlining of XQuery functions:
 +
* The XQuery compiler inlines functions to speed up query evaluation.
 +
* Inlining will only take place if a function body is not too large (i.e., if it does not contain too many expressions).
 +
* With this option, this maximum number of expressions can be specified.
 +
* Function inlining can be turned off by setting the value to {{Code|0}}.
 +
* The limit can be locally overwritten via the [[XQuery Extensions#Function Inlining|%basex:inline]] annotation (follow the link to get more information on function inlining).
 +
|}
 +
 
 +
===UNROLLLIMIT===
 +
 
 +
{| width='100%'
 +
|- valign="top"
 +
| width='120' | '''Signature'''
 +
|{{Code|UNROLLLIMIT}}
 +
|- valign="top"
 +
| '''Default'''
 +
|{{Code|5}}
 +
|- valign="top"
 +
| '''Summary'''
 +
|This option controls the unroll limit:
 +
* Loops with few iterations are ''unrolled'' by the XQuery compiler to enable further optimizations.
 +
* If the limit is increased, more optimizations will take place, but the memory consumption and compile time will increase.
 +
* See [[XQuery Optimizations#Loop Unrolling|Loop Unrolling]] for more details.
 +
|}
 +
 
 +
===ENFORCEINDEX===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
|{{Code|QUERYPATH [path]}}
+
|{{Code|ENFORCEINDEX [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
|''empty''
+
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Contains the path (''base URI'') to the executed query (default: ''empty''). This directory will be used to resolve relative paths to documents, query modules, and other resources addressed in a query.
+
|Enforces index rewritings in path expressions. See [[Indexes#Enforce Rewritings|Enforce Rewritings]] for details.
 
|}
 
|}
  
===INLINELIMIT===
+
===COPYNODE===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
|{{Code|INLINELIMIT}}
+
|{{Code|COPYNODE [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
|{{Code|100}}
+
|{{Code|true}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|The XQuery compiler inlines functions to speed up query evaluation. Inlining will only take place if a function body is not too large (i.e., if it does not contain too many expressions). With this option, this maximum number of expressions can be specified.<br/>Function inlining can be turned off by setting the value to {{Code|0}}. The limit can be locally overridden via the <code>[[XQuery_3.0#Annotations|%basex:inline]]</code> annotation.
+
|When creating new nodes in XQuery via [https://www.w3.org/TR/xquery-31/#id-constructors Node Constructors], copies of all enclosed nodes are created, and the copied nodes get new node identities. As a result, the following query yields <code>false</code>:
 +
<pre lang='xquery'>
 +
let $a := <a/>
 +
let $b := <b>{ $a }</b>
 +
return $b/a is $a
 +
</pre>
 +
This step can be expensive and memory consuming. If the option is disabled, child nodes will only be linked to the new parent nodes, and the upper query returns <code>true</code>.<br/>The option should be used carefully as it changes the semantics of XQuery. It should preferably be used in [[XQuery Extensions#Database Pragmas|Pragmas]].
 
|}
 
|}
  
Line 1,102: Line 1,223:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|TAILCALLS}}
 
|{{Code|TAILCALLS}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|256}}
 
|{{Code|256}}
|-
+
|- valign="top"
 +
| '''Summary'''
 +
|Specifies how many stack frames of [https://en.wikipedia.org/wiki/Tail_call tail-calls] are allowed on the stack at any time. When this limit is reached, tail-call optimization takes place and some call frames are eliminated. The feature can be turned off by setting the value to {{Code|-1}}.
 +
|}
 +
 
 +
===WITHDB===
 +
 
 +
{| width='100%'
 +
|- valign="top"
 +
| width='120' | '''Signature'''
 +
|{{Code|WITHDB}}
 +
|- valign="top"
 +
| '''Default'''
 +
|{{Code|true}}
 +
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies how many stack frames of [http://en.wikipedia.org/wiki/Tail_call tail-calls] are allowed on the stack at any time. When this limit is reached, tail-call optimization takes place and some call frames are eliminated. The feature can be turned off by setting the value to {{Code|-1}}.
+
|By default, resources specified via {{Code|fn:doc}} and {{Code|fn:collection}} are looked up both in the database and in the file system. If you always use {{Function|Database|db:get}} to access databases, it is recommendable to disable this option:
 +
* No locks are created for the two functions (see [[Transaction Management#Limitations|limitations of database locking]] for more details).
 +
* Access to local and external resources is faster, as the database lookup is skipped.
 
|}
 
|}
  
Line 1,116: Line 1,253:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|DEFAULTDB}}
 
|{{Code|DEFAULTDB}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|If this option is turned on, paths specified in the {{Code|fn:doc}} and {{Code|fn:collections}} functions will first be resolved against a database that has been opened in the global context outside the query (e.g. by the [[Commands#OPEN|OPEN]] command). If the path does not match any existing resources, it will be resolved as described in the article on [[Databases#Access Resources|accessing database resources]].
+
|If this option is turned on, paths specified in the {{Code|fn:doc}} and {{Code|fn:collection}} functions will first be resolved against a database that has been opened in the global context outside the query (e.g. by the {{Command|OPEN}} command). If the path does not match any existing resources, it is resolved as described in the article on [[Databases#Access Resources|accessing database resources]].
 
|}
 
|}
  
Line 1,130: Line 1,267:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|FORCECREATE [boolean]}}
 
|{{Code|FORCECREATE [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|By activating this option, the XQuery {{Code|doc()}} and {{Code|collection()}} functions will create database instances for the addressed input files.
+
|By enabling this option, database instances are created if the XQuery functions {{Code|fn:doc}} and {{Code|fn:collection}} are called.
 
|}
 
|}
  
Line 1,144: Line 1,281:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|CHECKSTRINGS [boolean]}}
 
|{{Code|CHECKSTRINGS [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|true}}
 
|{{Code|true}}
|-
+
|- valign="top"
 +
| '''Summary'''
 +
|By default, characters from external sources that are invalid in XML will trigger an error. If the option is set to <code>false</code>, these characters are replaced with the Unicode replacement character <code>FFFD</code> (&#xFFFD;). The option affects [[Java Bindings]] and string conversion and input functions such as {{Function|Archive|archive:create}}, {{Function|Archive|archive:extract-text}}, and {{Function|Archive|archive:update}}.
 +
|}
 +
 
 +
===WRAPJAVA===
 +
 
 +
{| width='100%'
 +
|- valign="top"
 +
| width='120' | '''Signature'''
 +
|{{Code|WRAPJAVA [mode]}}
 +
|- valign="top"
 +
| '''Default'''
 +
|{{Code|some}}
 +
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|If this option is turned off, strings from external sources will be adopted as is, i. e., without being checked for valid XML characters:
+
|Defines the way how values that result from Java code invocation are converted to XQuery items. Allowed values: {{Code|none}}, {{Code|all}}, {{Code|some}}, {{Code|instance}}, and {{Code|void}}. See [[Java_Bindings#Conversion to XQuery|Java Bindings]] for further details.
* This option affects [[Java Bindings]] and the string conversion and input functions [[Archive Module#archive:create|archive:create]], [[Archive Module#archive:extract-text|archive:extract-text]], [[Archive Module#archive:update|archive:update]], [[Conversion Module#convert:binary-to-string|convert:binary-to-string]], [[Fetch Module#fetch:text|fetch:text]], [[File Module#file:read-text|file:read-text]], and [[ZIP Module#zip:text-entry|zip:text-entry]].
 
* Please be aware that an inconsiderate use of this option may cause unexpected behavior when storing or outputting strings.
 
 
|}
 
|}
  
Line 1,160: Line 1,309:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|LSERROR [error]}}
 
|{{Code|LSERROR [error]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|0}}
 
|{{Code|0}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|This option specifies the maximum Levenshtein error for the BaseX-specific fuzzy match option. See the page on [[Full-Text#Fuzzy_Querying|Full-Texts]] for more information on fuzzy querying.
+
|This option specifies the maximum Levenshtein error for fuzzy full-text matching. By default, if {{Code|0}} is assigned, the error value is calculated dynamically. See [[Full-Text#Fuzzy_Querying|Fuzzy Querying]] for more details.
 
|}
 
|}
  
Line 1,174: Line 1,323:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|RUNQUERY [boolean]}}
 
|{{Code|RUNQUERY [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|true}}
 
|{{Code|true}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies if a query will be executed or parsed only. This option can also be changed on [[Command-Line Options#BaseX Standalone|command line]] via <code>-R</code>.
+
|Specifies if a query is executed or parsed only. This option can also be changed on [[Command-Line Options#BaseX Standalone|command line]] via <code>-R</code>.
 
|}
 
|}
  
Line 1,188: Line 1,337:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|RUNS [num]}}
 
|{{Code|RUNS [num]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|1}}
 
|{{Code|1}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies how often a query will be evaluated. The result is serialized only once, and the measured times are averages of all runs. This option can also be changed on [[Command-Line Options#BaseX Standalone|command line]] via <code>-r</code>.
+
|Specifies how often a query is evaluated. The result is serialized only once, and the measured times are averages of all runs. This option can also be changed on [[Command-Line Options#BaseX Standalone|command line]] via <code>-r</code>.
 
|}
 
|}
  
Line 1,204: Line 1,353:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|SERIALIZE [boolean]}}
 
|{{Code|SERIALIZE [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|true}}
 
|{{Code|true}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Results of XQuery expressions will be serialized if this option is turned on. For debugging purposes and performance measurements, this option can be set to {{Code|false}}. It can also be turned off on [[Command-Line Options#BaseX Standalone|command line]] via <code>-z</code>.  
+
|Results of XQuery expressions are serialized if this option is turned on. For debugging purposes and performance measurements, this option can be set to {{Code|false}}. It can also be turned off on [[Command-Line Options#BaseX Standalone|command line]] via <code>-z</code>.  
 
|}
 
|}
  
Line 1,218: Line 1,367:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|SERIALIZER [params]}}
 
|{{Code|SERIALIZER [params]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Parameters for [[Serialization|serializing]] query results. The string must comply with the following rules:
+
|Parameters for [[Serialization|serializing]] query results:
* Variable names and values must be separated by equality signs.
+
* Variable names and values are separated by equality signs.
* Multiple variables must be delimited by commas.
+
* Multiple variables are delimited by commas.
* Commas in values must be duplicated.
+
* Commas must be duplicated if they appear as literals in values.
 
The option can also be used on [[Command-Line Options#BaseX Standalone|command line]] with the flag <code>-s</code>.
 
The option can also be used on [[Command-Line Options#BaseX Standalone|command line]] with the flag <code>-s</code>.
|-
+
|- valign="top"
 
| '''Examples'''
 
| '''Examples'''
 
|
 
|
 +
* <code>indent=yes</code> : enables automatic indentation of XML nodes. This is recommended if whitespaces have been stripped from a document (see {{Options|STRIPWS}}).
 
* <code>encoding=US-ASCII,omit-xml-declaration=no</code> : sets the encoding to {{Code|US-ASCII}} and prints the XML declaration.
 
* <code>encoding=US-ASCII,omit-xml-declaration=no</code> : sets the encoding to {{Code|US-ASCII}} and prints the XML declaration.
 
* <code>item-separator=,,</code> : separates serialized items by a single comma.
 
* <code>item-separator=,,</code> : separates serialized items by a single comma.
Line 1,241: Line 1,391:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|EXPORTER [params]}}
 
|{{Code|EXPORTER [params]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|''empty''
 
|''empty''
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Contains parameters for exporting all resources of a database. Keys and values are separated by equality signs, multiple parameters are delimited by commas. See [[Serialization]] for more details.
+
|Contains parameters for exporting resources of a database and writing files after updates via the {{Option|WRITEBACK}} option. Keys and values are separated by equality signs, multiple parameters are delimited by commas. See [[Serialization]] for more details.
 +
|- valign="top"
 +
| '''Examples'''
 +
|
 +
* <code>indent=no,omit-xml-declaration=no</code> : disables automatic indentation of XML nodes, outputs the XML declaration.
 
|}
 
|}
  
Line 1,255: Line 1,409:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|XMLPLAN [boolean]}}
 
|{{Code|XMLPLAN [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
 
|Prints the execution plan of an XQuery expression in its XML representation. This option can also be activated on [[Command-Line Options#BaseX Standalone|command line]] via <code>-x</code>.  
 
|Prints the execution plan of an XQuery expression in its XML representation. This option can also be activated on [[Command-Line Options#BaseX Standalone|command line]] via <code>-x</code>.  
 
|}
 
|}
  
===COMPPLAN===
+
===FULLPLAN===
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
|{{Code|COMPPLAN [boolean]}}
+
|{{Code|FULLPLAN [boolean]}}
|-
+
|- valign="top"
| '''Default'''
 
|{{Code|true}}
 
|-
 
| '''Summary'''
 
|Generates the query plan, which can be activated via [[#XMLPLAN|XMLPLAN], before or after query compilation. This option can also be activated on [[Command-Line Options#BaseX Standalone|command line]] via <code>-X</code>.
 
|}
 
 
 
===DOTPLAN===
 
 
 
{| width='100%'
 
|-
 
| width='120' | '''Signature'''
 
|{{Code|DOTPLAN [boolean]}}
 
|-
 
| '''Default'''
 
|{{Code|false}}
 
|-
 
| '''Summary'''
 
|Saves the query plan of an XQuery expression as {{Code|.dot}} file in the current working directory, using the [https://en.wikipedia.org/wiki/DOT_(graph_description_language) graph description language]. The output file can e.g. be visualized with [http://www.graphviz.org Graphviz].
 
|}
 
 
 
===DOTCOMPACT===
 
 
 
{| width='100%'
 
|-
 
| width='120' | '''Signature'''
 
|{{Code|DOTCOMPACT [boolean]}}
 
|-
 
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Chooses a compact dot representation.
+
|Attaches the file path, line and column of the expressions in the original query string to the query plan. Values (items and sequences) have no input information attached.
 
|}
 
|}
  
Line 1,313: Line 1,439:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|AUTOFLUSH [boolean]}}
 
|{{Code|AUTOFLUSH [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|true}}
 
|{{Code|true}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Flushes database buffers to disk after each update. If this option is set to {{Code|false}}, bulk operations (multiple single updates) will be evaluated faster. As a drawback, the chance of data loss increases if the database is not explicitly flushed via the [[Commands#FLUSH|FLUSH]] command.
+
|Flushes database buffers to disk after each update. If this option is set to {{Code|false}}, bulk operations (multiple single updates) are evaluated faster. As a drawback, the chance of data loss increases if the database is not explicitly flushed via the {{Command|FLUSH}} command.
 
|}
 
|}
  
Line 1,327: Line 1,453:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|WRITEBACK [boolean]}}
 
|{{Code|WRITEBACK [boolean]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|false}}
 
|{{Code|false}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Propagates updates on main-memory instances of files that have been retrieved via {{Code|fn:doc}} or {{Code|fn:collection}} back to disk. This option can also be activated on [[Command-Line Options#BaseX Standalone|command line]] via <code>-u</code>. Please note that, when turning this option on, your original files will not be backed up.
+
|Propagates updates on main-memory instances of files that have been retrieved via {{Code|fn:doc}} and {{Code|fn:collection}} back to disk:
 +
* This option can also be activated on [[Command-Line Options#BaseX Standalone|command line]] via <code>-u</code>.
 +
* Please take in mind that no backup is created from your original files.
 +
* The serialization options can be controlled via the {{Option|EXPORTER}} option.
 
|}
 
|}
  
Line 1,341: Line 1,470:
  
 
{| width='100%'
 
{| width='100%'
|-
+
|- valign="top"
 
| width='120' | '''Signature'''
 
| width='120' | '''Signature'''
 
|{{Code|MAXSTAT [num]}}
 
|{{Code|MAXSTAT [num]}}
|-
+
|- valign="top"
 
| '''Default'''
 
| '''Default'''
 
|{{Code|30}}
 
|{{Code|30}}
|-
+
|- valign="top"
 
| '''Summary'''
 
| '''Summary'''
|Specifies the maximum number of index occurrences printed by the <code>[[Commands#INFO|INFO INDEX]]</code> command.
+
|Specifies the maximum number of index occurrences printed by the {{Command|INFO INDEX}} command.
 
|}
 
|}
  
 
=Changelog=
 
=Changelog=
 +
 +
;Version 11.0
 +
* Removed: <code>TEXTPARSER</code>
 +
* Updated: {{Option|BINDINGS}}: The Clark notation was replaced with the [[XQuery 3.0#Expanded QNames|Expanded QNames]] notation.
 +
 +
;Version 10.0
 +
* Removed: <code>COMPPLAN</code>, <code>IGNOREHOSTNAME</code>
 +
* Updated: {{Option|IGNORECERT}}: Additionally disable hostname verification.
 +
* Updated: {{Option|CATALOG}}: Renamed (before: {{Code|CATFILE}}).
 +
* Updated: {{Option|STRIPWS}}: Renamed (before: {{Code|CHOP}}), new default: false.
 +
 +
;Version 9.7
 +
* Updated: {{Option|GZIP}}: Support for POST and PUT requests.
 +
 +
;Version 9.6
 +
* Added: {{Option|UNROLLLIMIT}}, {{Option|WRAPJAVA}}
 +
 +
;Version 9.5
 +
* Updated: {{Option|INLINELIMIT}}: default reduced to 50.
 +
* Updated: {{Option|RESTXQERRORS}}: additionally suppress stack trace in HTTP response
 +
 +
;Version 9.4
 +
* Added: {{Option|LOGTRACE}}
 +
 +
;Version 9.3
 +
* Added: {{Option|WITHDB}}, {{Option|GZIP}}
 +
 +
;Version 9.2
 +
* Added: {{Option|RESTXQERRORS}}, {{Option|FULLPLAN}}
 +
* Removed: <code>DOTPLAN</code>, <code>DOTCOMPACT</code>
 +
 +
;Version 9.0
 +
* Added: {{Option|ENFORCEINDEX}}, {{Option|COPYNODE}}, {{Option|IGNOREHOSTNAME}}
 +
 +
;Version 8.6
 +
* Added: {{Option|FAIRLOCK}}, {{Option|PARSERESTXQ}}
 +
* Removed: {{Code|GLOBALLOCK}} (exclusive use of database lock)
 +
* Removed: {{Code|QUERYPATH}} (will now be internally assigned)
 +
* Removed: {{Code|CACHERESTXQ}} (replaced with PARSERESTXQ)
 +
 +
;Version 8.5
 +
* Added: {{Option|CACHETIMEOUT}}, {{Option|LOGPATH}}
 +
* Updated: {{Option|AUTHMETHOD}}: {{Code|custom}} value added.
  
 
;Version 8.4
 
;Version 8.4
 
+
* Added: {{Option|TOKENINDEX}}, {{Option|TOKENINCLUDE}}
* Added: <code>[[#TOKENINDEX|TOKENINDEX]]</code>, <code>[[#TOKENINCLUDE|TOKENINCLUDE]]</code>
+
* Added: {{Option|SPLITSIZE}} (replacing <code>INDEXSPLITSIZE</code> and <code>FTINDEXSPLITSIZE</code>)
 +
* Removed: <code>INDEXSPLITSIZE</code>, <code>FTINDEXSPLITSIZE</code>
  
 
;Version 8.3
 
;Version 8.3
 
+
* Added: {{Option|CACHERESTXQ}}, {{Option|TEXTINCLUDE}}, {{Option|ATTRINCLUDE}}, {{Option|FTINCLUDE}}, {{Option|ARCHIVENAME}}
* Added: <code>[[#CACHERESTXQ|CACHERESTXQ]]</code>, <code>[[#TEXTINCLUDE|TEXTINCLUDE]]</code>, <code>[[#ATTRINCLUDE|ATTRINCLUDE]]</code>, <code>[[#FTINCLUDE|FTINCLUDE]]</code>, <code>[[#ARCHIVENAME|ARCHIVENAME]]</code>
 
  
 
;Version 8.2
 
;Version 8.2
 
 
* Removed: <code>EVENTPORT</code>, <code>CACHEQUERY</code>
 
* Removed: <code>EVENTPORT</code>, <code>CACHEQUERY</code>
  
 
;Version 8.1
 
;Version 8.1
 
+
* Added: {{Option|IGNORECERT}}, {{Option|RESTPATH}}
* Added: <code>[[#IGNORECERT|IGNORECERT]]</code>, <code>[[#RESTPATH|RESTPATH]]</code>
 
  
 
;Version 8.0
 
;Version 8.0
 
+
* Added: {{Option|MIXUPDATES}}, {{Option|AUTOOPTIMIZE}}, {{Option|AUTHMETHOD}}, {{Option|XINCLUDE}}
* Added: <code>[[#MIXUPDATES|MIXUPDATES]]</code>, <code>[[#AUTOOPTIMIZE|AUTOOPTIMIZE]]</code>, <code>[[#AUTHMETHOD|AUTHMETHOD]]</code>, <code>[[#XINCLUDE|XINCLUDE]]</code>
+
* Updated: {{Option|PROXYPORT}}: default set to 0; will be ignored. {{Option|PROXYHOST}}, {{Option|NONPROXYHOSTS}}: empty strings will be ignored.
* Updated: <code>[[#PROXYPORT|PROXYPORT]]</code>: default set to 0; will be ignored. <code>[[#PROXYHOST|PROXYHOST]]</code>, <code>[[#NONPROXYHOSTS|NONPROXYHOSTS]]</code>: empty strings will be ignored.
 
  
 
;Version 7.8.1
 
;Version 7.8.1
* Updated: <code>[[#ADDARCHIVES|ADDARCHIVES]]</code>: parsing of TAR and TGZ files.
+
* Updated: {{Option|ADDARCHIVES}}: parsing of TAR and TGZ files.
  
 
;Version 7.8
 
;Version 7.8
  
* Added: <code>[[#CSVPARSER|CSVPARSER]]</code>, <code>[[#JSONPARSER|JSONPARSER]]</code>, <code>[[#TEXTPARSER|TEXTPARSER]]</code>, <code>[[#HTMLPARSER|HTMLPARSER]]</code>, <code>[[#INLINELIMIT|INLINELIMIT]]</code>, <code>[[#TAILCALLS|TAILCALLS]]</code>, <code>[[#DEFAULTDB|DEFAULTDB]]</code>, <code>[[#RUNQUERY|RUNQUERY]]</code>
+
* Added: {{Option|CSVPARSER}}, {{Option|JSONPARSER}}, TEXTPARSER, {{Option|HTMLPARSER}}, {{Option|INLINELIMIT}}, {{Option|TAILCALLS}}, {{Option|DEFAULTDB}}, {{Option|RUNQUERY}}
* Updated: <code>[[#WRITEBACK|WRITEBACK]]</code> only applies to main-memory document instances.
+
* Updated: {{Option|WRITEBACK}} only applies to main-memory document instances.
* Updated: <code>[[#DEBUG|DEBUG]]</code> option can be changed at runtime by users with admin permissions.
+
* Updated: {{Option|DEBUG}} option can be changed at runtime by users with admin permissions.
* Updated: default of <code>[[#INTPARSE|INTPARSE]]</code> is now {{Code|false}}.
+
* Updated: default of {{Option|INTPARSE}} is now {{Code|false}}.
* Removed: <code>HTMLOPT</code> (replaced with <code>[[#HTMLPARSER|HTMLPARSER]]</code>), <code>PARSEROPT</code> (replaced with parser-specific options), <code>DOTDISPLAY</code>, <code>DOTTY</code>
+
* Removed: <code>HTMLOPT</code> (replaced with {{Option|HTMLPARSER}}), <code>PARSEROPT</code> (replaced with parser-specific options), <code>DOTDISPLAY</code>, <code>DOTTY</code>
  
 
;Version 7.7
 
;Version 7.7
 
+
* Added: {{Option|ADDCACHE}}, {{Option|CHECKSTRINGS}}, {{Option|FTINDEXSPLITSIZE}}, {{Option|INDEXSPLITSIZE}}
* Added: <code>[[#ADDCACHE|ADDCACHE]]</code>, <code>[[#CHECKSTRINGS|CHECKSTRINGS]]</code>, <code>[[#FTINDEXSPLITSIZE|FTINDEXSPLITSIZE]]</code>, <code>[[#INDEXSPLITSIZE|INDEXSPLITSIZE]]</code>
 
  
 
;Version 7.6
 
;Version 7.6
 
+
* Added: {{Option|GLOBALLOCK}}
* Added: <code>[[#GLOBALLOCK|GLOBALLOCK]]</code>
 
 
* Added: store local options in configuration file after {{Code|# Local Options}} comments.
 
* Added: store local options in configuration file after {{Code|# Local Options}} comments.
  
 
;Version 7.5
 
;Version 7.5
 
 
* Added: options can now be set via system properties
 
* Added: options can now be set via system properties
 
* Added: a pragma expression can be used to locally change database options
 
* Added: a pragma expression can be used to locally change database options
* Added: <code>[[#USER|USER]]</code>, <code>[[#PASSWORD|PASSWORD]]</code>, <code>[[#LOG|LOG]]</code>, <code>[[#LOGMSGMAXLEN|LOGMSGMAXLEN]]</code>, <code>[[#WEBPATH|WEBPATH]]</code>, <code>[[#RESTXQPATH|RESTXQPATH]]</code><code>[[#HTTPLOCAL|HTTPLOCAL]]</code>, <code>[[#CREATEONLY|CREATEONLY]]</code>, <code>[[#STRIPNS|STRIPNS]]</code>
+
* Added: {{Option|USER}}, {{Option|PASSWORD}}, {{Option|LOG}}, {{Option|LOGMSGMAXLEN}}, {{Option|WEBPATH}}, {{Option|RESTXQPATH}}{{Option|HTTPLOCAL}}, {{Option|CREATEONLY}}, {{Option|STRIPNS}}
 
* Removed: {{Code|HTTPPATH}}; {{Code|HTTPPORT}}: {{Code|jetty.xml}} configuration file is used instead
 
* Removed: {{Code|HTTPPATH}}; {{Code|HTTPPORT}}: {{Code|jetty.xml}} configuration file is used instead
 
* Removed: global options cannot be changed anymore during the lifetime of a BaseX instance
 
* Removed: global options cannot be changed anymore during the lifetime of a BaseX instance
  
 
;Version 7.3
 
;Version 7.3
 
+
* Updated: {{Option|KEEPALIVE}}, {{Option|TIMEOUT}}: default values changed
* Updated: <code>[[#KEEPALIVE|KEEPALIVE]]</code>, <code>[[#TIMEOUT|TIMEOUT]]</code>: default values changed
 
 
* Removed: {{Code|WILDCARDS}}; new index supports both fuzzy and wildcard queries
 
* Removed: {{Code|WILDCARDS}}; new index supports both fuzzy and wildcard queries
 
* Removed: {{Code|SCORING}}; new scoring model will focus on lengths of text nodes and match options
 
* Removed: {{Code|SCORING}}; new scoring model will focus on lengths of text nodes and match options
  
 
;Version 7.2
 
;Version 7.2
 
+
* Added: {{Option|PROXYHOST}}, {{Option|PROXYPORT}}, {{Option|NONPROXYHOSTS}}, {{Option|HTMLOPT}}
* Added: <code>[[#PROXYHOST|PROXYHOST]]</code>, <code>[[#PROXYPORT|PROXYPORT]]</code>, <code>[[#NONPROXYHOSTS|NONPROXYHOSTS]]</code>, <code>[[#HTMLOPT|HTMLOPT]]</code>
+
* Updated: {{Option|TIMEOUT}}: ignore timeout for admin users
* Updated: <code>[[#TIMEOUT|TIMEOUT]]</code>: ignore timeout for admin users
 
  
 
;Version 7.1
 
;Version 7.1
 
+
* Added: {{Option|ADDRAW}}, {{Option|MAXLEN}}, {{Option|MAXCATS}}, {{Option|UPDINDEX}}
* Added: <code>[[#ADDRAW|ADDRAW]]</code>, <code>[[#MAXLEN|MAXLEN]]</code>, <code>[[#MAXCATS|MAXCATS]]</code>, <code>[[#UPDINDEX|UPDINDEX]]</code>
+
* Updated: {{Option|BINDINGS}}
* Updated: <code>[[#BINDINGS|BINDINGS]]</code>
 
  
 
;Version 7.0
 
;Version 7.0
 
+
* Added: {{Option|SERVERHOST}}, {{Option|KEEPALIVE}}, {{Option|AUTOFLUSH}}, {{Option|QUERYPATH}}
* Added: <code>[[#SERVERHOST|SERVERHOST]]</code>, <code>[[#KEEPALIVE|KEEPALIVE]]</code>, <code>[[#AUTOFLUSH|AUTOFLUSH]]</code>, <code>[[#QUERYPATH|QUERYPATH]]</code>
 

Latest revision as of 17:39, 1 December 2023

This page is linked from the Getting Started Section.

The options listed on this page influence the way how database commands are executed and XQuery expressions are evaluated. Two kinds of options exist:

  • Global Options are valid for all BaseX instances in the same JVM. This is particularly relevant if you are working with the client/server architecture.
  • Local options (all remaining ones) are specific to a client or session.

Values of options are either strings, numbers or booleans. Options are static and not bound to a single operation (for example, the next command). Various ways exist to access and change options:

  • The current value of an option can be requested with the SHOW OPTIONS command. Local options can be changed via SET (all global options, except for DEBUG, can only be changed at startup time). If an option is of type boolean, and if no value is specified, its current value is inverted.
  • The .basex configuration file is parsed by every new local BaseX instance. It contains all global options. Local options can be specified at the end of the file after the Local Options comment:
# General Options
DEBUG = false
...

# Local Options
CATALOG = etc/w3-catalog.xml
  • Initial values for global options can also be specified via system properties, which can be passed on with the -D flag on the command line, or with System.setProperty() before creating a BaseX instance. The specified keys need to be prefixed with org.basex.. An example:
java -Dorg.basex.CATALOG=etc/w3-catalog.xml -cp basex.jar org.basex.BaseX -c"SHOW OPTIONS catalog"
CATALOG: etc/w3-catalog.xml
  • If the Mac OS X packaged application is used, global options can be set within the Info.plist file within the Contents folder of the application package. For example:
<key>JVMOptions</key>
<array>
  <string>-Dorg.basex.CATALOG=etc/w3-catalog.xml</string>
</array>
  • In a Web Application, the default can be adjusted in the web.xml file as follows:
<context-param>
  <param-name>org.basex.catalog</param-name>
  <param-value>etc/w3-catalog.xml</param-value>
</context-param>
  • In XQuery, local options can be set via option declarations and pragmas.

If options are changed by operations in the GUI, the underlying commands are output in the Info View.

Contents

Global Options[edit]

Global options are constants. They can only be set in the configuration file or via system properties (see above). One exception is the DEBUG option, which can also be changed at runtime by users with admin permissions.

General Options[edit]

DEBUG[edit]

Signature DEBUG [boolean]
Default false
Summary Sends debug information to STDERR. This option can be enabled to get additional information for development and debugging purposes. It can also be triggered on the command line via -d. Note that the amount of output can be overwhelming, so it should not be used productively.

DBPATH[edit]

Signature DBPATH [path]
Default {home}/data
Summary Points to the directory in which all databases are located.

LOGPATH[edit]

Signature LOGPATH [path]
Default .logs
Summary Points to the directory where log files are stored. Relative paths are resolved against the DBPATH directory.

REPOPATH[edit]

Signature REPOPATH [path]
Default {home}/repo
Summary Points to the Repository, in which all XQuery modules are located.

LANG[edit]

Signature LANG [language]
Default English
Summary Specifies the interface language.

FAIRLOCK[edit]

Signature FAIRLOCK [boolean]
Default false
Summary Defines the locking strategy:
  • By default, non-fair locking is used. Read transactions are favored, and transactions that no not access databases can be evaluated even if the limit for parallel transactions (specified via PARALLEL) is reached. This prevents update operations from blocking all other requests. For example, the DBA can still be used to see which jobs are running even if the queue is full.
  • When fair locking is enabled, read and write transactions are treated equally (first in, first out). This avoids starvation of updates and should be used when prompt evaluation of updates is critical.

CACHETIMEOUT[edit]

Signature CACHETIMEOUT [seconds]
Default 3600
Summary Specifies how many seconds the results of asynchronously executed jobs are cached in main memory.

Client/Server Architecture[edit]

HOST[edit]

Signature HOST [host]
Default localhost
Summary This host name is used by the client when connecting to a server. This option can also be changed when running the client on command line via -n.

PORT[edit]

Signature PORT [port]
Default 1984
Summary This port is used by the client when connecting to a server. This option can also be changed when running the client on command line via -p.

SERVERPORT[edit]

Signature SERVERPORT [port]
Default 1984
Summary This is the port the database server is listening to. This option can also be changed when running the server on command line via -p.

USER[edit]

Signature USER [name]
Default empty
Summary Represents a username, which is used for accessing the server or an HTTP service:
  • The default value is overwritten if a client specifies its own credentials.
  • If the default value is empty, login will only be possible if the client specifies credentials.
  • The option can also be changed on command line via -U.

PASSWORD[edit]

Signature PASSWORD [password]
Default empty
Summary Represents a password, which is used for accessing the server:
  • The default value is overwritten if a client specifies its own credentials.
  • If the default value is empty, authentication will only be possible if the client supplies credentials.
  • The option can also be changed on command line via -P.
  • Please note that it is a security risk to specify your password in plain text.

AUTHMETHOD[edit]

Signature AUTHMETHOD [method]
Default Basic
Summary Specifies the default authentication method used by the HTTP server for negotiating credentials. Allowed values are Basic, Digest, and Custom:
  • If basic access is chosen, the client can still request digest authentication.
  • This is different for digest access, which cannot be overwritten.
  • With custom authentication, the server will not do any authentication.

SERVERHOST[edit]

Signature SERVERHOST [host|ip]
Default empty
Summary This is the host name or IP address the server is bound to. If the option is set to an empty string (which is the default), the server accepts requests from all clients.

PROXYHOST[edit]

Signature PROXYHOST [host]
Default empty
Summary This is the host name of a proxy server. If the value is an empty string, it is ignored.

PROXYPORT[edit]

Signature PROXYPORT [port]
Default 0
Summary This is the port number of a proxy server. If the value is set to 0, it is ignored.

NONPROXYHOSTS[edit]

Signature NONPROXYHOSTS [hosts]
Default empty
Summary This is a list of hosts that should be directly accessed. If the value is an empty string, it is ignored.

IGNORECERT[edit]

Signature IGNORECERT [boolean]
Default false
Summary Disable SSL hostname verification and ignore untrusted certificates when connecting to servers.

TIMEOUT[edit]

Signature TIMEOUT [seconds]
Default 30
Summary Specifies the maximum time a transaction triggered by a client may take. An operation is interrupted when it takes longer than the specified number of seconds. Running update operations are not affected by this timeout, as this would corrupt the integrity of the database. The timeout is disabled if the timeout is set to 0. It is ignored for operations with admin permissions.

KEEPALIVE[edit]

Signature KEEPALIVE [seconds]
Default 600
Summary Specifies the maximum time a client is remembered by the server. If there has been no interaction with a client for a longer time than specified by this timeout, it will be disconnected. Running operations will not be affected by this option. The check is disabled if the value is set to 0.

PARALLEL[edit]

Signature PARALLEL [number]
Default 8
Summary Denotes the maximum allowed number of parallel transactions:
  • If FAIRLOCK is enabled, the number of parallel transactions will never exceed the specified value.
  • If the option is disabled (which is the default), the limit only applies to transactions that access databases.
  • The main reason for allowing parallel operations is to prevent slow transactions from blocking all other operations. A higher number of parallel operations may increase disk activity and thus slow down queries. In some cases, a single transaction may even give you better results than any parallel activity.

LOG[edit]

Signature LOG [boolean]
Default true
Summary Turns Logging of server operations and HTTP requests on/off. This option can also be changed when running the server on command line via -z.

LOGMSGMAXLEN[edit]

Signature LOGMSGMAXLEN [length]
Default 1000
Summary Specifies the maximum length of a single log message.

LOGTRACE[edit]

Signature LOGTRACE [boolean]
Default true
Summary If BaseX is running as a Web Application, trace output (generated via fn:trace, prof:dump and similar functions) is written to the database logs. When this option is disabled, trace output is redirected to standard error, as it is known from the standalone version of BaseX.

HTTP Services[edit]

Most HTTP options are defined in the jetty.xml and web.xml configuration files in the webapp/WEB-INF directory. Some additional BaseX-specific options exist that are set before the web server is started:

WEBPATH[edit]

Signature WEBPATH [path]
Default {home}/webapp
Summary Points to the directory in which all the Web Application contents are stored, including XQuery, Script, RESTXQ and configuration files:
  • The option is ignored if BaseX is deployed as web servlet.
  • It cannot be assigned via the web.xml file, as it is evaluated before the configuration files are parsed.

GZIP[edit]

Signature GZIP [boolean]
Default false
Summary Jetty provides a Gzip handler for dynamically uncompressing requests and compressing responses. This feature can be enabled if Jetty is started via the BaseX HTTP Server:
  • The option can also be enabled on command line via -g.
  • It cannot be assigned via the web.xml file, as the option is evaluated before the configuration files are parsed.
  • In addition to the Jetty defaults (GET requests, exclusion of binaries, MSIE 6.0, etc.), POST and PUT requests are supported.

RESTXQPATH[edit]

Signature RESTXQPATH [path]
Default empty
Summary Points to the directory which contains the RESTXQ modules of a web application. Relative paths are resolved against the WEBPATH directory.

PARSERESTXQ[edit]

Signature PARSERESTXQ
Default 3
Summary Timeout after which the RESTXQ directory is parsed for changes:
  • If 0 is specified, the directory is parsed every time a RESTXQ function is called.
  • A positive value defines the idle time, in seconds, after which parsing is enforced. The default value is 3: Changes in the RESTXQ directory is detected after 3 seconds without RESTXQ function calls.
  • Monitoring is completely disabled if a negative value is specified.

See RESTXQ Preliminaries for more details.

RESTXQERRORS[edit]

Signature RESTXQERRORS
Default true
Summary Reports parsing errors in XQuery modules in the RESTXQ directory and returns the full error message and stack trace to the client. By default, this option is enabled. In a production environment, it can be disabled to suppress errors that should not be seen by the user of an API (the full error information can still be looked up in the database logs). See RESTXQ Error Handling for more details.

RESTPATH[edit]

Signature RESTPATH [path]
Default empty
Summary Points to the directory which contains XQuery files and command scripts, which can be evaluated via the REST run operation. Relative paths are resolved against the WEBPATH directory.

HTTPLOCAL[edit]

Signature HTTPLOCAL [boolean]
Default false
Summary By default, if BaseX is run as Web Application, the database server instance is started in addition, which can then be addressed by Clients via the database port (see PORT). If the option is set to true, no database server is launched.

STOPPORT[edit]

Signature STOPPORT [port]
Default 8081
Summary If Jetty is started from BaseX, this is the port on which the HTTP Server can be closed:
  • The listener for stopping the web server will only be started if the specified value is greater than 0.
  • The option is ignored if BaseX is used as a Web Application or started via Maven.
  • This option can also be changed when running the HTTP server on command line via -s.

Create Options[edit]

General[edit]

MAINMEM[edit]

Signature MAINMEM [boolean]
Default false
Summary If this option is turned on, new databases arecreated in main memory:
  • Most queries are evaluated faster in main-memory mode, but all data is lost if the BaseX instance in which the database was created is shut down.
  • It is not possible to store binary resources in a main-memory database.
  • A main-memory database will have no disk representation. However, it is possible to export the database via the EXPORT command, and create a new database from the exported file in a second step.
  • This option will not be available for db:create because the database would not be accessible anymore after database creation, i.e., outside the query scope.

ADDCACHE[edit]

Signature ADDCACHE [boolean]
Default false
Summary If this option is activated, data structures of documents will first be cached to disk before being added to the final database. This option is helpful when larger documents need to be added, and if the existing heuristics cannot estimate the input size (e.g. when adding directories or sending input streams).

Parsing[edit]

CREATEFILTER[edit]

Signature CREATEFILTER [filter]
Default *.xml
Summary File filter in the Glob Syntax, which is applied whenever new databases are created, or resources are added to a database.

ADDARCHIVES[edit]

Signature ADDARCHIVES [boolean]
Default true
Summary If this option is set to true, files within archives (ZIP, GZIP, TAR, TGZ, DOCX, etc.) are parsed whenever new databases are created or resources are added to a database.

ARCHIVENAME[edit]

Signature ARCHIVENAME [boolean]
Default false
Summary If this option is set to true, the file name of parsed archives is included in the document paths.

SKIPCORRUPT[edit]

Signature SKIPCORRUPT [boolean]
Default false
Summary Skips corrupt (i.e., not well-formed) files while creating a database or adding new documents. If this option is activated, document updates are slowed down, as all files are parsed twice. Next, main-memory consumption is higher, as parsed files are cached in main memory.

ADDRAW[edit]

Signature ADDRAW [boolean]
Default false
Summary If this option is enabled, all resources that are filtered out by the CREATEFILTER option while being added to a database are stored as raw files instead (i.e., in their binary representation).

PARSER[edit]

Signature PARSER [type]
Default XML
Summary Defines a parser for importing new files to the database. Available parsers are XML, JSON, CSV, TEXT, HTML, and RAW. HTML input is parsed as XML if Tagsoup is not found in the class path.

CSVPARSER[edit]

Signature CSVPARSER [options]
Default empty
Summary Specifies the way how CSV data is parsed. Keys and values are delimited with =, and multiple options are delimited with ,. The available options (except for the additional encoding option) are described in the CSV Module.
Examples encoding=CP1252,header=true parses the input as CP1252 and the first line as header.

JSONPARSER[edit]

Signature JSONPARSER [options]
Default empty
Summary Specifies the way how JSON data is parsed. Keys and values are delimited with =, and multiple options are delimited with ,. The available options (except for the additional encoding option) are described in the JSON Module.
Examples format=jsonml,lax=yes interprets the input as JSONML and uses lax parsing.

HTMLPARSER[edit]

Signature HTMLPARSER [options]
Default empty
Summary Specifies the way how HTML data is parsed. Keys and values are delimited with =, and multiple options are delimited with ,. The available options are described in the Parsers article.
Examples
  • encoding=Shift-JIS,nons=true parses the input as Shift-JIS and suppresses namespaces.
  • lexical=true preserves comments.

XML Parsing[edit]

STRIPWS[edit]

Signature STRIPWS [boolean]
Default false
Summary Many XML documents include whitespaces that have been added to improve readability. This option controls the white-space processing mode of the XML parser:
  • If the option is set to true, leading and trailing whitespaces from text nodes are stripped and empty text nodes are discarded.
  • The flag should be disabled if a document contains mixed content.
  • The flag can also be turned on via the command line and -w.
  • If the option is enabled, whitespaces of an element and its descendants can locally be preserved with the xml:space="preserve" attribute:
<xml>
  <title>
    Demonstrating the STRIPWS flag
  </title>
  <text xml:space="preserve">To <b>be</b>, or not to <b>be</b>, that is the question.</text>
</xml>

If whitespaces are stripped, indent=yes can be assigned to the SERIALIZER option to get properly indented XML output.
See BaseX 10: Whitespaces for more information on whitespace handling.

STRIPNS[edit]

Signature STRIPNS [boolean]
Default false
Summary Strips all namespaces from an XML document while parsing.

INTPARSE[edit]

Signature INTPARSE [boolean]
Default false
Summary Uses the internal XML parser instead of the standard Java XML parser. Reasons for using the internal parser are:
  • Performance: Documents, in particular small ones, are parsed faster.
  • Entity support: More than 2000 HTML character references are decoded.
  • Fault tolerance: Invalid characters are replaced with the Unicode replacement character FFFD (�).

By default, Java’s Xerces parser is used as it supports all features of the XML standard, including XINCLUDE, or recursive entity expansion if specified in DTDs.

DTD[edit]

Signature DTD [boolean]
Default false
Summary Parses referenced DTDs and resolves XML entities. By default, this option is switched to false, as many DTDs are located externally, which may completely block the process of creating new databases. The CATALOG option can be changed to locally resolve DTDs.

XINCLUDE[edit]

Signature XINCLUDE [boolean]
Default true
Summary Resolves XInclude inclusion tags and merges referenced XML documents. By default, this option is switched to true. This option is only available if the standard Java XML Parser is used (see INTPARSE).

CATALOG[edit]

Signature CATALOG [path]
Default empty
Summary Semicolon-separated list of XML catalog files to resolve URIs. See Catalog Resolvers for more details.

Indexing[edit]

The following options control the creation of index structures. The current values are considered if a new database is created. See Indexes for more details.

TEXTINDEX[edit]

Signature TEXTINDEX [boolean]
Default true
Summary Creates a text index whenever a new database is created. A text index speeds up queries with equality comparisons on text nodes. See Text Index for more details.

ATTRINDEX[edit]

Signature ATTRINDEX [boolean]
Default true
Summary Creates an attribute index whenever a new database is created. An attribute index speeds up queries with equality comparisons on attribute values. See Attribute Index for more details.

TOKENINDEX[edit]

Signature TOKENINDEX [boolean]
Default true
Summary Creates a token index whenever a new database is created. A token index speeds up searches for single tokens in attribute values. See Token Index for more details.

FTINDEX[edit]

Signature FTINDEX [boolean]
Default false
Summary Creates a full-text index whenever a new database is created. A full-text index speeds up queries with full-text expressions. See Full-Text Index for more details.

TEXTINCLUDE[edit]

Signature TEXTINCLUDE [names]
Default empty
Summary Defines name patterns for the parent elements of texts that are indexed. By default, all text nodes are indexed.
Name patterns are separated by commas. See Selective Indexing for more details.

ATTRINCLUDE[edit]

Signature ATTRINCLUDE [names]
Default empty
Summary Defines name patterns for the attributes to be indexed. By default, all attribute nodes are indexed.
Name patterns are separated by commas. See Selective Indexing for more details.

TOKENINCLUDE[edit]

Signature TOKENINCLUDE [names]
Default empty
Summary Defines name patterns for the attributes to be indexed. By default, tokens in all attribute nodes are indexed.
Name patterns are separated by commas. See Selective Indexing for more details.

FTINCLUDE[edit]

Signature FTINCLUDE [names]
Default empty
Summary Defines name patterns for the parent elements of texts that are indexed. By default, all text nodes are indexed.
Name patterns are separated by commas. See Selective Indexing for more details.

MAXLEN[edit]

Signature MAXLEN [int]
Default 96
Summary Specifies the maximum length for strings to be stored in index structures. The value of this option is assigned once to a new database, and can only be changed by creating a new database or doing a full optimization.

MAXCATS[edit]

Signature MAXCATS [int]
Default 100
Summary Specifies the maximum number of distinct values (categories) that are stored together with the element/attribute names or unique paths in the Name Index or Path Index. The value of this option is assigned once to a new database, and cannot be changed after that.

UPDINDEX[edit]

Signature UPDINDEX [boolean]
Default false
Summary If turned on, incremental indexing is enabled:
  • The current value of this option is assigned to new databases. It can be changed for existing databases by running OPTIMIZE with the ALL keyword or db:optimize and true() as second argument.
  • After each update, the value indexes are refreshed as well. Incremental updates are currently not available for the full-text index and database statistics.
  • Find more details in the article on Index Structures.

AUTOOPTIMIZE[edit]

Signature AUTOOPTIMIZE [boolean]
Default false
Summary If turned on, auto optimization is applied to new databases:
  • With each update, outdated indexes and database statistics are recreated.
  • As a result, the index structures will always be up-to-date.
  • However, updates can take much longer, so this option should only be activated for medium-sized databases.
  • The value of this option is assigned once to a new database. It can be reassigned by running OPTIMIZE or db:optimize.

SPLITSIZE[edit]

Signature SPLITSIZE [num]
Default 0
Summary This option affects the construction of new value indexes. It controls the number of index build operations that are performed before writing partial index data to disk:
  • By default, if the value is set to 0, some heuristics are applied, based on the current memory consumption. Usually, this works fine.
  • If explicit garbage collection is disabled when running Java (e.g. via the JVM option -XX:+DisableExplicitGC), you may need to choose a custom split size.
  • You can e.g. start with 1000000 (one million) index operations and adjust this value in the next steps.
  • The larger the assigned value is, the fewer splits take place, and main-memory consumption increases.

Full-Text Indexing[edit]

STEMMING[edit]

Signature STEMMING [boolean]
Default false
Summary If true, all tokens are stemmed during full-text indexing, using a language-specific stemmer implementation. By default, tokens will not be stemmed. See Full-Text Index for more details.

CASESENS[edit]

Signature CASESENS [boolean]
Default false
Summary If true, the case of tokens is preserved during full-text indexing. By default, case is ignored (all tokens are indexed in lower case). See Full-Text Index for more details.

DIACRITICS[edit]

Signature DIACRITICS [boolean]
Default false
Summary If set to true, diacritics are preserved during full-text indexing. By default, diacritics are removed. See Full-Text Index for more details.

LANGUAGE[edit]

Signature LANGUAGE [lang]
Default en
Summary The specified language will influence the way how texts are tokenized and stemmed. It can be the name of a language or a language code. See Full-Text Index for more details.

STOPWORDS[edit]

Signature STOPWORDS [path]
Default empty
Summary If a text file with stop words is specified, frequently used terms contained in that file are ignored when a full-text index is created. A stop word list may decrease the size of the full-text index and speed up your queries. See Full-Text Index for more details.

Query Options[edit]

QUERYINFO[edit]

Signature QUERYINFO [boolean]
Default false
Summary Prints more information on internal query rewritings, optimizations, and performance. By default, this info is shown in the Info View in the GUI. It can also be activated on command line via -V.

MIXUPDATES[edit]

Signature MIXUPDATES
Default false
Summary Allows queries to both contain updating and non-updating expressions. All updating constraints are turned off, and nodes to be returned are copied before they are modified by an updating expression. By default, in compliance with the XQuery Update Facility, this option is set to false. See Returning Results for more details.

BINDINGS[edit]

Updated with Version 11: The Clark notation was replaced with the Expanded QNames notation.

Signature BINDINGS [vars]
Default empty
Summary Contains external variables to be bound to a query. The string must comply with the following rules:
  • Variable names and values must be separated by equality signs.
  • Multiple variables must be delimited by commas.
  • Commas in values must be duplicated.
  • Variables may optionally be introduced with a leading dollar sign.
  • If a variable uses a namespace different from the default namespace, it can be specified with the Expanded QNames notation.

This option can also be used on command line with the flag -b.

Examples
  • $a=1,$b=2   binds the values 1 and 2 to the variables $a and $b
  • a=1,,2   binds the value 1,2 to the variable $a
  • {URI}a=x   binds the value x to the variable $a with the namespace URI.
  • In the following Command Script, the value hello world! is bound to the variable $GREETING:
SET BINDINGS GREETING="hello world!"
XQUERY declare variable $GREETING external; $GREETING

INLINELIMIT[edit]

Signature INLINELIMIT
Default 50
Summary This option controls inlining of XQuery functions:
  • The XQuery compiler inlines functions to speed up query evaluation.
  • Inlining will only take place if a function body is not too large (i.e., if it does not contain too many expressions).
  • With this option, this maximum number of expressions can be specified.
  • Function inlining can be turned off by setting the value to 0.
  • The limit can be locally overwritten via the %basex:inline annotation (follow the link to get more information on function inlining).

UNROLLLIMIT[edit]

Signature UNROLLLIMIT
Default 5
Summary This option controls the unroll limit:
  • Loops with few iterations are unrolled by the XQuery compiler to enable further optimizations.
  • If the limit is increased, more optimizations will take place, but the memory consumption and compile time will increase.
  • See Loop Unrolling for more details.

ENFORCEINDEX[edit]

Signature ENFORCEINDEX [boolean]
Default false
Summary Enforces index rewritings in path expressions. See Enforce Rewritings for details.

COPYNODE[edit]

Signature COPYNODE [boolean]
Default true
Summary When creating new nodes in XQuery via Node Constructors, copies of all enclosed nodes are created, and the copied nodes get new node identities. As a result, the following query yields false:
let $a := <a/>
let $b := <b>{ $a }</b>
return $b/a is $a

This step can be expensive and memory consuming. If the option is disabled, child nodes will only be linked to the new parent nodes, and the upper query returns true.
The option should be used carefully as it changes the semantics of XQuery. It should preferably be used in Pragmas.

TAILCALLS[edit]

Signature TAILCALLS
Default 256
Summary Specifies how many stack frames of tail-calls are allowed on the stack at any time. When this limit is reached, tail-call optimization takes place and some call frames are eliminated. The feature can be turned off by setting the value to -1.

WITHDB[edit]

Signature WITHDB
Default true
Summary By default, resources specified via fn:doc and fn:collection are looked up both in the database and in the file system. If you always use db:get to access databases, it is recommendable to disable this option:
  • No locks are created for the two functions (see limitations of database locking for more details).
  • Access to local and external resources is faster, as the database lookup is skipped.

DEFAULTDB[edit]

Signature DEFAULTDB
Default false
Summary If this option is turned on, paths specified in the fn:doc and fn:collection functions will first be resolved against a database that has been opened in the global context outside the query (e.g. by the OPEN command). If the path does not match any existing resources, it is resolved as described in the article on accessing database resources.

FORCECREATE[edit]

Signature FORCECREATE [boolean]
Default false
Summary By enabling this option, database instances are created if the XQuery functions fn:doc and fn:collection are called.

CHECKSTRINGS[edit]

Signature CHECKSTRINGS [boolean]
Default true
Summary By default, characters from external sources that are invalid in XML will trigger an error. If the option is set to false, these characters are replaced with the Unicode replacement character FFFD (�). The option affects Java Bindings and string conversion and input functions such as archive:create, archive:extract-text, and archive:update.

WRAPJAVA[edit]

Signature WRAPJAVA [mode]
Default some
Summary Defines the way how values that result from Java code invocation are converted to XQuery items. Allowed values: none, all, some, instance, and void. See Java Bindings for further details.

LSERROR[edit]

Signature LSERROR [error]
Default 0
Summary This option specifies the maximum Levenshtein error for fuzzy full-text matching. By default, if 0 is assigned, the error value is calculated dynamically. See Fuzzy Querying for more details.

RUNQUERY[edit]

Signature RUNQUERY [boolean]
Default true
Summary Specifies if a query is executed or parsed only. This option can also be changed on command line via -R.

RUNS[edit]

Signature RUNS [num]
Default 1
Summary Specifies how often a query is evaluated. The result is serialized only once, and the measured times are averages of all runs. This option can also be changed on command line via -r.

Serialization Options[edit]

SERIALIZE[edit]

Signature SERIALIZE [boolean]
Default true
Summary Results of XQuery expressions are serialized if this option is turned on. For debugging purposes and performance measurements, this option can be set to false. It can also be turned off on command line via -z.

SERIALIZER[edit]

Signature SERIALIZER [params]
Default empty
Summary Parameters for serializing query results:
  • Variable names and values are separated by equality signs.
  • Multiple variables are delimited by commas.
  • Commas must be duplicated if they appear as literals in values.

The option can also be used on command line with the flag -s.

Examples
  • indent=yes : enables automatic indentation of XML nodes. This is recommended if whitespaces have been stripped from a document (see Template:Options).
  • encoding=US-ASCII,omit-xml-declaration=no : sets the encoding to US-ASCII and prints the XML declaration.
  • item-separator=,, : separates serialized items by a single comma.

EXPORTER[edit]

Signature EXPORTER [params]
Default empty
Summary Contains parameters for exporting resources of a database and writing files after updates via the WRITEBACK option. Keys and values are separated by equality signs, multiple parameters are delimited by commas. See Serialization for more details.
Examples
  • indent=no,omit-xml-declaration=no : disables automatic indentation of XML nodes, outputs the XML declaration.

XMLPLAN[edit]

Signature XMLPLAN [boolean]
Default false
Summary Prints the execution plan of an XQuery expression in its XML representation. This option can also be activated on command line via -x.

FULLPLAN[edit]

Signature FULLPLAN [boolean]
Default false
Summary Attaches the file path, line and column of the expressions in the original query string to the query plan. Values (items and sequences) have no input information attached.

Other Options[edit]

AUTOFLUSH[edit]

Signature AUTOFLUSH [boolean]
Default true
Summary Flushes database buffers to disk after each update. If this option is set to false, bulk operations (multiple single updates) are evaluated faster. As a drawback, the chance of data loss increases if the database is not explicitly flushed via the FLUSH command.

WRITEBACK[edit]

Signature WRITEBACK [boolean]
Default false
Summary Propagates updates on main-memory instances of files that have been retrieved via fn:doc and fn:collection back to disk:
  • This option can also be activated on command line via -u.
  • Please take in mind that no backup is created from your original files.
  • The serialization options can be controlled via the EXPORTER option.

MAXSTAT[edit]

Signature MAXSTAT [num]
Default 30
Summary Specifies the maximum number of index occurrences printed by the INFO INDEX command.

Changelog[edit]

Version 11.0
Version 10.0
  • Removed: COMPPLAN, IGNOREHOSTNAME
  • Updated: IGNORECERT: Additionally disable hostname verification.
  • Updated: CATALOG: Renamed (before: CATFILE).
  • Updated: STRIPWS: Renamed (before: CHOP), new default: false.
Version 9.7
  • Updated: GZIP: Support for POST and PUT requests.
Version 9.6
Version 9.5
  • Updated: INLINELIMIT: default reduced to 50.
  • Updated: RESTXQERRORS: additionally suppress stack trace in HTTP response
Version 9.4
Version 9.3
Version 9.2
Version 9.0
Version 8.6
  • Added: FAIRLOCK, PARSERESTXQ
  • Removed: GLOBALLOCK (exclusive use of database lock)
  • Removed: QUERYPATH (will now be internally assigned)
  • Removed: CACHERESTXQ (replaced with PARSERESTXQ)
Version 8.5
Version 8.4
Version 8.3
Version 8.2
  • Removed: EVENTPORT, CACHEQUERY
Version 8.1
Version 8.0
Version 7.8.1
Version 7.8
Version 7.7
Version 7.6
  • Added: GLOBALLOCK
  • Added: store local options in configuration file after # Local Options comments.
Version 7.5
  • Added: options can now be set via system properties
  • Added: a pragma expression can be used to locally change database options
  • Added: USER, PASSWORD, LOG, LOGMSGMAXLEN, WEBPATH, RESTXQPATHHTTPLOCAL, CREATEONLY, STRIPNS
  • Removed: HTTPPATH; HTTPPORT: jetty.xml configuration file is used instead
  • Removed: global options cannot be changed anymore during the lifetime of a BaseX instance
Version 7.3
  • Updated: KEEPALIVE, TIMEOUT: default values changed
  • Removed: WILDCARDS; new index supports both fuzzy and wildcard queries
  • Removed: SCORING; new scoring model will focus on lengths of text nodes and match options
Version 7.2
Version 7.1
Version 7.0