HTML-Microformats
view release on metacpan or search on metacpan
lib/HTML/Microformats.pm view on Meta::CPAN
return $self;
}
use Module::Pluggable
require => 1,
inner => 0,
search_path => ['HTML::Microformats::Format'],
only => qr/^HTML::Microformats::Format::[^:]+$/,
sub_name => 'modules',
;
sub formats
{
my $class = shift || __PACKAGE__;
return
sort { lc $a cmp lc $b }
map { s/^HTML::Microformats::Format:://; $_ }
$class->modules;
}
1;
__END__
=head1 NAME
HTML::Microformats - parse microformats in HTML
=head1 SYNOPSIS
use HTML::Microformats;
my $doc = HTML::Microformats
->new_document($html, $uri)
->assume_profile(qw(hCard hCalendar));
print $doc->json(pretty => 1);
use RDF::TrineShortcuts qw(rdf_query);
my $results = rdf_query($sparql, $doc->model);
=head1 DESCRIPTION
The HTML::Microformats module is a wrapper for parser and handler
modules of various individual microformats (each of those modules
has a name like HTML::Microformats::Format::Foo).
The general pattern of usage is to create an HTML::Microformats
object (which corresponds to an HTML document) using the
C<new_document> method; then ask for the data, as a Perl hashref,
a JSON string, or an RDF::Trine model.
=head2 Constructor
=over 4
=item C<< $doc = HTML::Microformats->new_document($html, $uri, %opts) >>
Constructs a document object.
$html is the HTML or XHTML source (string) or an XML::LibXML::Document.
$uri is the document URI, important for resolving relative URL references.
%opts are additional parameters; currently only one option is defined:
$opts{'type'} is set to 'text/html' or 'application/xhtml+xml', to
control how $html is parsed.
=back
=head2 Profile Management
HTML::Microformats uses HTML profiles (i.e. the profile attribute on the
HTML <head> element) to detect which Microformats are used on a page. Any
microformats which do not have a profile URI declared will not be parsed.
Because many pages fail to properly declare which profiles they use, there
are various profile management methods to tell HTML::Microformats to
assume the presence of particular profile URIs, even if they're actually
missing.
=over 4
=item C<< $doc->profiles >>
This method returns a list of profile URIs declared by the document.
=item C<< $doc->has_profile(@profiles) >>
This method returns true if and only if one or more of the profile URIs
in @profiles is declared by the document.
=item C<< $doc->add_profile(@profiles) >>
Using C<add_profile> you can add one or more profile URIs, and they are
treated as if they were found on the document.
For example:
$doc->add_profile('http://microformats.org/profile/rel-tag')
This is useful for adding profile URIs declared outside the document itself
(e.g. in HTTP headers).
Returns a reference to the document.
=item C<< $doc->assume_profile(@microformats) >>
For example:
$doc->assume_profile(qw(hCard adr geo))
This method acts similarly to C<add_profile> but allows you to use
names of microformats rather than URIs.
Microformat names are case sensitive, and must match
HTML::Microformats::Format::Foo module names.
Returns a reference to the document.
=item C<< $doc->assume_all_profiles >>
( run in 1.565 second using v1.01-cache-2.11-cpan-119454b85a5 )