Web::HTML::ParserData
Data for HTML parser
DESCRIPTION
The Web::HTML::ParserData
module contains data for HTML and XML parsers,
extracted from the HTML Standard.
CONSTANTS
Following constants returning namespace URLs are defined (but not exported): HTML_NS
(HTML namespace),
SVG_NS
(SVG namespace),
MML_NS
(MathML namespace),
XML_NS
(XML namespace),
XMLNS_NS
(XML Namespaces namespace),
and XLINK_NS
(XLink namespace).
VARIABLES
There are following data from the HTML Standard:
$AllVoidElements
-
A hash reference, whose keys are HTML void element names (conforming or non-conforming) and values are true. This list is equal to the list of HTML elements whose "syntax_category" is "void" or "obsolete void" in the JSON data file
<https://github.com/manakai/data-web-defs/blob/master/doc/elements.txt>
. $MathMLTextIntegrationPoints
-
The local names of the MathML text integration point elements
<http://www.whatwg.org/specs/web-apps/current-work/#mathml-text-integration-point>
. Keys are local names and values are true values. $MathMLTextIntegrationPointMathMLElements
-
The tag names of the start tags that are interpreted as MathML elements in MathML text integration point
<http://www.whatwg.org/specs/web-apps/current-work/#tree-construction>
. Keys are tag names (in lowercase) and values are true values. $SVGHTMLIntegrationPoints
-
The local names of the HTML integration point SVG elements
<http://www.whatwg.org/specs/web-apps/current-work/#html-integration-point>
. Keys are local names and values are true values. $MathMLHTMLIntegrationPoints
-
The local names of the HTML integration point MathML elements
<http://www.whatwg.org/specs/web-apps/current-work/#html-integration-point>
. Keys are local names and values are true values.Note that the
annotation-xml
element is NOT in this list (but sometimes it is an HTML integration point). $ForeignContentBreakers
-
The tag names of the start tags that will close foreign elements if they appear in foreign content parsing mode
<http://www.whatwg.org/specs/web-apps/current-work/#parsing-main-inforeign>
. Keys are tag names (in lowercase) and values are true values.Note that the
font
tag name is NOT in this list (but it sometimes closes foreign elements). $MathMLAttrNameFixup
-
Table in adjust MathML attributes
<http://www.whatwg.org/specs/web-apps/current-work/#adjust-mathml-attributes>
. $SVGAttrNameFixup
-
Table in adjust SVG attributes
<http://www.whatwg.org/specs/web-apps/current-work/#adjust-svg-attributes>
. $ForeignAttrNamespaceFixup
-
Table in adjust foreign attributes
<http://www.whatwg.org/specs/web-apps/current-work/#adjust-foreign-attributes>
. $SVGElementNameFixup
-
Table in the rules for parsing tokens in foreign content, any other start tag, an element in the SVG namespace
<http://www.whatwg.org/specs/web-apps/current-work/#parsing-main-inforeign>
. $NamedCharRefs
-
HTML named character references.
$CharRefReplacements
-
The code point replacement table for HTML character references, as specified in HTML Standard
<http://www.whatwg.org/specs/web-apps/current-work/#tokenizing-character-references>
. Keys are original code points (as specified in character references), represented as strings in shortest decimal form, and values are corresponding replaced code points, represented as integers.Note that surrogate code points are not included in this list (but replaced by U+FFFD). Note also that some code points are replaced by the same code point.
$NoncharacterCodePoints
-
The Unicode noncharacter code points. Keys are code points, represented as strings in shortest decimal form, and values are some true values.
Note that variables not mentioned in this section should not be used. They might be removed in later revision of this module.
SPECIFICATION
- HTML
-
HTML Standard
<http://www.whatwg.org/specs/web-apps/current-work/>
.
SOURCES
data-web-defs <https://github.com/manakai/data-web-defs/>
.
data-chars <https://github.com/manakai/data-chars/>
.
LICENSE
You are granted a license to use, reproduce and create derivative works of this file.
The JSON file contains data extracted from HTML Standard. "Written by Ian Hickson (Google, ian@hixie.ch) - Parts © Copyright 2004-2014 Apple Inc., Mozilla Foundation, and Opera Software ASA; You are granted a license to use, reproduce and create derivative works of this document."