Changes

Jump to navigation Jump to search
567 bytes added ,  12:53, 13 March 2019
no edit summary
==Introduction==
XML documents often rely on Document Type Definitions (DTDs). While parsing a document with BaseX, entities Entities can be resolved with respect to that particular DTD. By default, the DTD is only used for entity resolution.
XHTML, for example, defines its doctype via the following line:
Fetching <code>xhtml1-strict.dtd</code> obviously involves network traffic. When dealing with single files, this may seem tolerable, but importing large collections benefits from caching these resources. Depending on the remote server, you will experience significant speed improvements when caching DTDs locally.
 
To address these issues, the [https://www.oasis-open.org/committees/download.php/14809/xml-catalogs.html XML Catalogs Standard] defines an entity catalog that maps both external identifiers and arbitrary URI references to URI references.
==Usage==
BaseX relies on the Apache-maintained [http://xml.apache.org/commons XML Commons Resolver]. The {{Code|''xml-resolver-1.2.jar}} '' library is included in the full distributions of BaseX. If the resolver is not found in the classpath, and if Java 8 is used, Java’s built-in resolver will be applied (via <code>com.sun.org.apache.xml.internal.resolver.*</code>).
To enable entity resolving you have to provide a valid XML Catalog file, so that the parser knows where to look for mirrored DTDs.
</pre>
This rewrites all systemIds starting with: <code><nowiki>http://www.w3.org/TR/xhtml1/DTD/</nowiki></code> to <code>file:///path/to/dtds/</code>. For example, if the following XML file is parsed: <pre class="brush:xml" start="0"><!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"><html xmlns="http://www.w3.org/1999/xhtml"/></pre> The XHTML DTD <code>xhtml1-stricttransitional.dtd</code> and all its linked resources will now be loaded from the specified path.
The catalog file {{Code|''etc/w3-catalog.xml}} '' in the full distributions can be used out of the box. It defines rewriting for the most popular some common W3 DTD files.
===GUI Mode===
When running BaseX in GUI mode, simply enable DTD parsing and provide the path to your XML Catalog file in the ''Parsing'' Tab of the Database Creation Dialog.
===Console & Server Mode===
To enable Entity Resolving in Console Mode, enable the {{Option|DTD}} option and assign a the path to your XML catalog file path to the {{Option|CATFILE}} option. All subsequent <code>ADD</code> commands for adding documents will use the specified catalog file to resolve entities.
The '''paths''' Paths to your catalog file and the actual DTDs are either absolute or relative to the ''current working directory''. When using BaseX in Clientclient-Server-Modeserver mode, this is relative to they are resolved against the working directory of the ''server's'' working directory.
===Additional Notes===
Entity resolving only works if the [[Parsers#XML Parsers|internal XML parser]] is switched off (which is the default case).
If you use the internal parser, you can manually specify whether you want to parse DTDs and entities or not.
If no The runtime properties of the catalog resolver can be changed by setting system properties, or adding a ''CatalogManager.properties'' property file is found in to the classpath. By default, and if the following system property {{Code|xml.catalog resolver properties .ignoreMissing}} is not assigned, no warnings will be set (see output to standard error if the properties file or resources linked from that file are not found. See [https://xerces.apache.org/xml-commons/components/resolver/resolver-article.html#ctrlresolver Controlling the Catalog Resolver] for more information):* {{Code|xml.catalog.ignoreMissing}}: true* {{Code|xml.catalog.staticCatalog}}: false
==Links==
* [https://www.oasis-open.org/committees/download.php/14809/xml-catalogs.html XML Catalogs. OASIS Standard, Version 1.1. 07-October-2005]
* [http://en.wikipedia.org/wiki/Document_Type_Definition Wikipedia on Document Type Definitions]
* [http://xml.apache.org/commons/components/resolver/resolver-article.html Apache XML Commons Article on Entity Resolving]
* [http://java.sun.com/webservices/docs/1.6/jaxb/catalog.html XML Entity and URI Resolvers], Sun
* [http://www.oasis-open.org/committees/download.php/14810/xml-catalogs.pdf XML Catalogs. OASIS Standard, Version 1.1. 07-October-2005.]
Bureaucrats, editor, reviewer, Administrators
13,550

edits

Navigation menu