HTML-Parser
view release on metacpan or search on metacpan
(F) The $p->marked_sections() method was invoked in a HTML::Parser
module that was compiled without support for marked sections.
Unknown boolean attribute (%d)
(F) Something is wrong with the internal logic that set up aliases
for boolean attributes.
Only code or array references allowed as handler
(F) The second argument for $p->handler must be either a subroutine
reference, then name of a subroutine or method, or a reference to an
array.
No handler for %s events
(F) The first argument to $p->handler must be a valid event name;
i.e. one of "start", "end", "text", "process", "declaration" or
"comment".
Unrecognized identifier %s in argspec
(F) The identifier is not a known argspec name. Use one of the names
mentioned in the argspec section above.
Literal string is longer than 255 chars in argspec
(F) The current implementation limits the length of literals in an
argspec to 255 characters. Make the literal shorter.
Backslash reserved for literal string in argspec
(F) The backslash character "\" is not allowed in argspec literals.
It is reserved to permit quoting inside a literal in a later
version.
Unterminated literal string in argspec
(F) The terminating quote character for a literal was not found.
Bad argspec (%s)
(F) Only identifier names, literals, spaces and commas are allowed
in argspecs.
Missing comma separator in argspec
(F) Identifiers in an argspec must be separated with ",".
Parsing of undecoded UTF-8 will give garbage when decoding entities
(W) The first chunk parsed appears to contain undecoded UTF-8 and
one or more argspecs that decode entities are used for the callback
handlers.
The result of decoding will be a mix of encoded and decoded
characters for any entities that expand to characters with code
above 127. This is not a good thing.
The recommended solution is to apply Encode::decode_utf8() on the
data before feeding it to the $p->parse(). For $p->parse_file() pass
a file that has been opened in ":utf8" mode.
The alternative solution is to enable the "utf8_mode" and not decode
before passing strings to $p->parse(). The parser can process raw
undecoded UTF-8 sanely if the "utf8_mode" is enabled, or if the
"attr", @attr or "dtext" argspecs are avoided.
Parsing string decoded with wrong endian selection
(W) The first character in the document is U+FFFE. This is not a
legal Unicode character but a byte swapped "BOM". The result of
parsing will likely be garbage.
Parsing of undecoded UTF-32
(W) The parser found the Unicode UTF-32 "BOM" signature at the start
of the document. The result of parsing will likely be garbage.
Parsing of undecoded UTF-16
(W) The parser found the Unicode UTF-16 "BOM" signature at the start
of the document. The result of parsing will likely be garbage.
SEE ALSO
HTML::Entities, HTML::PullParser, HTML::TokeParser, HTML::HeadParser,
HTML::LinkExtor, HTML::Form
HTML::TreeBuilder (part of the *HTML-Tree* distribution)
<http://www.w3.org/TR/html4/>
More information about marked sections and processing instructions may
be found at <http://www.is-thought.co.uk/book/sgml-8.htm>.
COPYRIGHT
Copyright 1996-2016 Gisle Aas. All rights reserved.
Copyright 1999-2000 Michael A. Chase. All rights reserved.
This library is free software; you can redistribute it and/or modify it
under the same terms as Perl itself.
( run in 0.896 second using v1.01-cache-2.11-cpan-d7f47b0818f )