Changes

Jump to navigation Jump to search
41 bytes added ,  15:44, 8 March 2012
TagSoup offers a variety of options to customize the import of HTML. For the complete list
please visit the [http://home.ccil.org/~cowan/XML/tagsoup/ TagSoup] website. BaseX supports
most of this options in general, but there are with a few exceptions:
* '''filesencoding''': booleanBaseX tries to guess the input encoding but this can be overwritten by the* '''html''': booleanuser if necessary.* '''omit-xml-declarationfiles''': booleanNot supported as input documents are piped directly to the XML parser.* '''method''': boolean* Set to 'xml'as default. If this is set to 'doctype-systemhtml''': numfor instance ending tags may bemissing f.i.* '''doctype-publicversion''': num* ''Dismissed, as TagSoup always falls back to 'version1.0''': num, no matter what the input is.* '''standalone''': booleanDeactivated.* '''pyx''': booleanNot supported as the XML parser can't handle this kind of input.* '''pyxin''': boolean* '''nons''': boolean* '''nobogons''': boolean* '''nodefaults''': boolean* '''nocolons''': boolean* '''norestart''': boolean* '''ignorable''': boolean* '''emptybogons''': boolean* '''any''': boolean* '''norootbogons''': boolean* '''lexical''': booleanSee pyx option.* '''reuse''': boolean* '''nocdata''': boolean* '''encoding''': stringNot supported.* '''output-encoding''': stringNot supported, BaseX already takes care of that.* '''help''': booleanNot supported.* '''version''': booleanNot supported.
These options can be changed like any other option in BaseX, for example via
administrator, Bureaucrats, editor, reviewer, Administrators
408

edits

Navigation menu