The manakai project

Web::HTML::ParserData

Data for HTML parser

DESCRIPTION

The Web::HTML::ParserData module contains data for HTML and XML parsers, extracted from the HTML Standard.

CONSTANTS

Following constants returning namespace URLs are defined (but not exported): HTML_NS (HTML namespace), SVG_NS (SVG namespace), MML_NS (MathML namespace), XML_NS (XML namespace), XMLNS_NS (XML Namespaces namespace), and XLINK_NS (XLink namespace).

VARIABLES

There are following data from the HTML Standard:

$AllVoidElements

A hash reference, whose keys are HTML void element names (conforming or non-conforming) and values are true. This list is equal to the list of HTML elements whose "syntax_category" is "void" or "obsolete void" in the JSON data file <https://github.com/manakai/data-web-defs/blob/master/doc/elements.txt>.

$MathMLTextIntegrationPoints

The local names of the MathML text integration point elements <http://www.whatwg.org/specs/web-apps/current-work/#mathml-text-integration-point>. Keys are local names and values are true values.

$MathMLTextIntegrationPointMathMLElements

The tag names of the start tags that are interpreted as MathML elements in MathML text integration point <http://www.whatwg.org/specs/web-apps/current-work/#tree-construction>. Keys are tag names (in lowercase) and values are true values.

$SVGHTMLIntegrationPoints

The local names of the HTML integration point SVG elements <http://www.whatwg.org/specs/web-apps/current-work/#html-integration-point>. Keys are local names and values are true values.

$MathMLHTMLIntegrationPoints

The local names of the HTML integration point MathML elements <http://www.whatwg.org/specs/web-apps/current-work/#html-integration-point>. Keys are local names and values are true values.

Note that the annotation-xml element is NOT in this list (but sometimes it is an HTML integration point).

$ForeignContentBreakers

The tag names of the start tags that will close foreign elements if they appear in foreign content parsing mode <http://www.whatwg.org/specs/web-apps/current-work/#parsing-main-inforeign>. Keys are tag names (in lowercase) and values are true values.

Note that the font tag name is NOT in this list (but it sometimes closes foreign elements).

$MathMLAttrNameFixup

Table in adjust MathML attributes <http://www.whatwg.org/specs/web-apps/current-work/#adjust-mathml-attributes>.

$SVGAttrNameFixup

Table in adjust SVG attributes <http://www.whatwg.org/specs/web-apps/current-work/#adjust-svg-attributes>.

$ForeignAttrNamespaceFixup

Table in adjust foreign attributes <http://www.whatwg.org/specs/web-apps/current-work/#adjust-foreign-attributes>.

$SVGElementNameFixup

Table in the rules for parsing tokens in foreign content, any other start tag, an element in the SVG namespace <http://www.whatwg.org/specs/web-apps/current-work/#parsing-main-inforeign>.

$NamedCharRefs

HTML named character references.

$CharRefReplacements

The code point replacement table for HTML character references, as specified in HTML Standard <http://www.whatwg.org/specs/web-apps/current-work/#tokenizing-character-references>. Keys are original code points (as specified in character references), represented as strings in shortest decimal form, and values are corresponding replaced code points, represented as integers.

Note that surrogate code points are not included in this list (but replaced by U+FFFD). Note also that some code points are replaced by the same code point.

$NoncharacterCodePoints

The Unicode noncharacter code points. Keys are code points, represented as strings in shortest decimal form, and values are some true values.

Note that variables not mentioned in this section should not be used. They might be removed in later revision of this module.

SPECIFICATION

HTML

HTML Standard <http://www.whatwg.org/specs/web-apps/current-work/>.

SOURCES

data-web-defs <https://github.com/manakai/data-web-defs/>.

data-chars <https://github.com/manakai/data-chars/>.

LICENSE

You are granted a license to use, reproduce and create derivative works of this file.

The JSON file contains data extracted from HTML Standard. "Written by Ian Hickson (Google, ian@hixie.ch) - Parts © Copyright 2004-2014 Apple Inc., Mozilla Foundation, and Opera Software ASA; You are granted a license to use, reproduce and create derivative works of this document."