Plack-App-MCCS
view release on metacpan or search on metacpan
local/lib/perl5/x86_64-linux-thread-multi/HTML/Parser.pm view on Meta::CPAN
(F) The second argument for $p->handler must be either a subroutine
reference, then name of a subroutine or method, or a reference to an
array.
=item No handler for %s events
(F) The first argument to $p->handler must be a valid event name; i.e. one
of "start", "end", "text", "process", "declaration" or "comment".
=item Unrecognized identifier %s in argspec
(F) The identifier is not a known argspec name.
Use one of the names mentioned in the argspec section above.
=item Literal string is longer than 255 chars in argspec
(F) The current implementation limits the length of literals in
an argspec to 255 characters. Make the literal shorter.
=item Backslash reserved for literal string in argspec
(F) The backslash character "\" is not allowed in argspec literals.
It is reserved to permit quoting inside a literal in a later version.
=item Unterminated literal string in argspec
(F) The terminating quote character for a literal was not found.
=item Bad argspec (%s)
(F) Only identifier names, literals, spaces and commas
are allowed in argspecs.
=item Missing comma separator in argspec
(F) Identifiers in an argspec must be separated with ",".
=item Parsing of undecoded UTF-8 will give garbage when decoding entities
(W) The first chunk parsed appears to contain undecoded UTF-8 and one
or more argspecs that decode entities are used for the callback
handlers.
The result of decoding will be a mix of encoded and decoded characters
for any entities that expand to characters with code above 127. This
is not a good thing.
The recommended solution is to apply Encode::decode_utf8() on the data before
feeding it to the $p->parse(). For $p->parse_file() pass a file that has been
opened in ":utf8" mode.
The alternative solution is to enable the C<utf8_mode> and not decode before
passing strings to $p->parse(). The parser can process raw undecoded UTF-8
sanely if the C<utf8_mode> is enabled, or if the C<attr>, C<@attr> or C<dtext>
argspecs are avoided.
=item Parsing string decoded with wrong endian selection
(W) The first character in the document is U+FFFE. This is not a
legal Unicode character but a byte swapped C<BOM>. The result of parsing
will likely be garbage.
=item Parsing of undecoded UTF-32
(W) The parser found the Unicode UTF-32 C<BOM> signature at the start
of the document. The result of parsing will likely be garbage.
=item Parsing of undecoded UTF-16
(W) The parser found the Unicode UTF-16 C<BOM> signature at the start of
the document. The result of parsing will likely be garbage.
=back
=head1 SEE ALSO
L<HTML::Entities>, L<HTML::PullParser>, L<HTML::TokeParser>, L<HTML::HeadParser>,
L<HTML::LinkExtor>, L<HTML::Form>
L<HTML::TreeBuilder> (part of the I<HTML-Tree> distribution)
L<http://www.w3.org/TR/html4/>
More information about marked sections and processing instructions may
be found at L<http://www.is-thought.co.uk/book/sgml-8.htm>.
=head1 COPYRIGHT
Copyright 1996-2016 Gisle Aas. All rights reserved.
Copyright 1999-2000 Michael A. Chase. All rights reserved.
This library is free software; you can redistribute it and/or
modify it under the same terms as Perl itself.
=cut
( run in 0.757 second using v1.01-cache-2.11-cpan-39bf76dae61 )