content-type results from the CPAN

content-type
endpoint

CrawlerCommons-RobotRulesParser

view release on metacpan or search on metacpan

lib/CrawlerCommons/RobotRulesParser.pm view on Meta::CPAN

URL string that's parsed in a URI object to provide scheme, authority, and path
for sitemap directive values.  If the directive's value begins with a '/', it
overrides the path value provided by this URL context string.

=item * C<$content>

The text content of the robots.txt file to be parsed.

=item * C<$content_type>

The content-type of the robots.txt content to be parsed.  Assumes text/plain by
default.  If type is text/html, the parser will attempt to strip-out html tags
from the content.

=item * C<$robot_name>

A string signifying for which user-agent(s) the rules should be extracted.

=back

=cut

( run in 1.305 second using v1.01-cache-2.11-cpan-524268b4103 )