CrawlerCommons-RobotRulesParser
view release on metacpan or search on metacpan
lib/CrawlerCommons/RobotRulesParser.pm view on Meta::CPAN
URL string that's parsed in a URI object to provide scheme, authority, and path
for sitemap directive values. If the directive's value begins with a '/', it
overrides the path value provided by this URL context string.
=item * C<$content>
The text content of the robots.txt file to be parsed.
=item * C<$content_type>
The content-type of the robots.txt content to be parsed. Assumes text/plain by
default. If type is text/html, the parser will attempt to strip-out html tags
from the content.
=item * C<$robot_name>
A string signifying for which user-agent(s) the rules should be extracted.
=back
=cut
( run in 1.305 second using v1.01-cache-2.11-cpan-524268b4103 )