An HTML and XML tokenizer
THIS MODULE IS DEPRECATED. DON'T USE THIS MODULE FOR NEW APPLICATIONS.
Web::HTML::Tokenizer module provides an implementation of HTML and XML tokenizer.
Unlike its name,
this module can be used for XML documents as well as HTML.
It is not intended to be used directly from general-purpose applications; instead it is used as part of HTML or XML parser,
such as Web::HTML::Parser and Web::XML::Parser.
The module is intended to be a conforming HTML tokenizer according to Web Applications 1.0 specification (though it is meaningless to discuess the conformance of the tokenizer standalone). By setting the XML flag, it can also tokenize XML documents in a way consistent with the HTML tokenization specification. You might consider it as an implementation of the XML5 tokenization algorithm as "patched" by later HTML5 development.
HTML Living Standard
Copyright 2007-2014 Wakaba <email@example.com>.
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.