Changes

Jump to navigation Jump to search
84 bytes added ,  18:00, 28 February 2012
With [http://home.ccil.org/~cowan/XML/tagsoup/ tagsoup] HTML can be imported in BaseX without any problems. Tagsoup ensures that only well-formed HTML arrives at the XML parser (correct opening and closing tags, etc.). Hence if tagsoup is not available on a system, there will be a lot of cases where importing HTML fails.
If tagsoup is available on the classpath or you run BaseX from the sources and use the Maven build manager, BaseX will automatically use tagsoup to prepare HTML input. Otherwise you may be faced with XML syntax issues during the import process. This applies regardless of whether you use the GUI or the standalone mode. TagSoup is also included in the complete BaseX distributions (BaseX.zip, BaseX.exe, etc.).
===Tagsoup installationon Debian===
# apt-get install libtagsoup-java #Debian example.
===GUI===
administrator, Bureaucrats, editor, reviewer, Administrators
408

edits

Navigation menu