HTML-Parser

 view release on metacpan or  search on metacpan

README  view on Meta::CPAN

        (F) The $p->marked_sections() method was invoked in a HTML::Parser
        module that was compiled without support for marked sections.

    Unknown boolean attribute (%d)
        (F) Something is wrong with the internal logic that set up aliases
        for boolean attributes.

    Only code or array references allowed as handler
        (F) The second argument for $p->handler must be either a subroutine
        reference, then name of a subroutine or method, or a reference to an
        array.

    No handler for %s events
        (F) The first argument to $p->handler must be a valid event name;
        i.e. one of "start", "end", "text", "process", "declaration" or
        "comment".

    Unrecognized identifier %s in argspec
        (F) The identifier is not a known argspec name. Use one of the names
        mentioned in the argspec section above.

    Literal string is longer than 255 chars in argspec
        (F) The current implementation limits the length of literals in an
        argspec to 255 characters. Make the literal shorter.

    Backslash reserved for literal string in argspec
        (F) The backslash character "\" is not allowed in argspec literals.
        It is reserved to permit quoting inside a literal in a later
        version.

    Unterminated literal string in argspec
        (F) The terminating quote character for a literal was not found.

    Bad argspec (%s)
        (F) Only identifier names, literals, spaces and commas are allowed
        in argspecs.

    Missing comma separator in argspec
        (F) Identifiers in an argspec must be separated with ",".

    Parsing of undecoded UTF-8 will give garbage when decoding entities
        (W) The first chunk parsed appears to contain undecoded UTF-8 and
        one or more argspecs that decode entities are used for the callback
        handlers.

        The result of decoding will be a mix of encoded and decoded
        characters for any entities that expand to characters with code
        above 127. This is not a good thing.

        The recommended solution is to apply Encode::decode_utf8() on the
        data before feeding it to the $p->parse(). For $p->parse_file() pass
        a file that has been opened in ":utf8" mode.

        The alternative solution is to enable the "utf8_mode" and not decode
        before passing strings to $p->parse(). The parser can process raw
        undecoded UTF-8 sanely if the "utf8_mode" is enabled, or if the
        "attr", @attr or "dtext" argspecs are avoided.

    Parsing string decoded with wrong endian selection
        (W) The first character in the document is U+FFFE. This is not a
        legal Unicode character but a byte swapped "BOM". The result of
        parsing will likely be garbage.

    Parsing of undecoded UTF-32
        (W) The parser found the Unicode UTF-32 "BOM" signature at the start
        of the document. The result of parsing will likely be garbage.

    Parsing of undecoded UTF-16
        (W) The parser found the Unicode UTF-16 "BOM" signature at the start
        of the document. The result of parsing will likely be garbage.

SEE ALSO
    HTML::Entities, HTML::PullParser, HTML::TokeParser, HTML::HeadParser,
    HTML::LinkExtor, HTML::Form

    HTML::TreeBuilder (part of the *HTML-Tree* distribution)

    <http://www.w3.org/TR/html4/>

    More information about marked sections and processing instructions may
    be found at <http://www.is-thought.co.uk/book/sgml-8.htm>.

COPYRIGHT
     Copyright 1996-2016 Gisle Aas. All rights reserved.
     Copyright 1999-2000 Michael A. Chase.  All rights reserved.

    This library is free software; you can redistribute it and/or modify it
    under the same terms as Perl itself.



( run in 0.896 second using v1.01-cache-2.11-cpan-d7f47b0818f )