HTML-ListScraper

 view release on metacpan or  search on metacpan

testdata/del.icio.us.html  view on Meta::CPAN

				<li><a href="/popular/css">css</a></li>
				<li><a href="/popular/webdesign">webdesign</a></li>
				<li><a href="/popular/gallery">gallery</a></li>
				<li><a href="/popular/design">design</a></li>
				<li><a href="/popular/directory">directory</a></li>
			</ul>
			</div>
	</div>
	</li>
</ol>
<ol>
	<li>
		<h4><a href="http://www.trustedreviews.com/printers/review/2007/04/21/The-Inkjet-Investigation/p1" rel="nofollow"><img src="http://images.del.icio.us/static/img/thumbnails/d/9/e/d8bb988cc3426e1f0f4eb07a90fc7.png" width="90" height="68" alt="Trusted...
		<span class="savethis">
			<a href="/post?url=http%3A%2F%2Fwww.trustedreviews.com%2Fprinters%2Freview%2F2007%2F04%2F21%2FThe-Inkjet-Investigation%2Fp1&amp;title=TrustedReviews%20-%20The%20Inkjet%20Investigation&amp;jump=no&amp;partner=delfp" rel="nofollow">save this</a>
		</span>
		</h4>
        <div class="meta">
            <strong><span class="label">people</span> <span class="num"><span class="numbox"><a href="/url/b1cf054305a9cfdd915aa5a54e745a53">96</a></span></span></strong>
        </div>
		<div class="tags">	
		<p><span class="smaller">first posted by</span> <a href="/epiphanius">epiphanius</a></p>
			<span class="label">tags</span>
	    	<div>
			<ul>
				<li><a href="/popular/hardware">hardware</a></li>
				<li><a href="/popular/printers">printers</a></li>
				<li><a href="/popular/inkjet">inkjet</a></li>
				<li><a href="/popular/printing">printing</a></li>
				<li><a href="/popular/printer">printer</a></li>
			</ul>
			</div>
	</div>
	</li>
</ol>
</div>
<p class="hotnow" style="margin: 0;">STILL COOL</p>
</div>

<div id="curated">
<h3>tags to watch <span class="linkmore"><a href="/tag/">more ...</a></span></h3>
<ol>
	<li><h4><a href="/popular/html">html</a></h4>
	<ol>
		<li class="first"><a href="http://www.smashingmagazine.com/2007/02/21/printing-the-web-solutions-and-techniques/">Printing the Web</a></li>
		<li><a href="http://www.degreetutor.com/library/career-starter/115-secrets">Secrets of Self Taught Web Developers - DegreeTutor.com</a></li>
		<li><a href="http://arapehlivanian.com/2007/02/14/understanding-and-solving-the-javascriptcss-entanglement-phenomenon/">JavaScript/CSS entanglement phenomenon</a></li>
	</ol>
	</li>
	<li><h4><a href="/popular/audio">audio</a></h4>
	<ol>
		<li class="first"><a href="http://sourceforge.net/projects/buzz-like">SourceForge.net: VioLet Composer</a></li>
		<li><a href="http://www.last.fm/">Last.FM - Your personal online radio station.</a></li>
		<li><a href="http://www.midomi.com/">midomi</a></li>
	</ol>
	</li>
	<li><h4><a href="/popular/cooking">cooking</a></h4>
	<ol>
		<li class="first"><a href="http://blog.ruhlman.com/2007/02/guest_blogging_.html">ruhlman.com: Guest Blogging: A Bourdain Throwdown</a></li>
		<li><a href="http://www.cookingforengineers.com/">Cooking For Engineers</a></li>
		<li><a href="http://cookthink.com/blog/?p=293">cookthink » Blog Archive » A formula for marinating</a></li>
	</ol>
	</li>
	<li><h4><a href="/popular/realestate">realestate</a></h4>
	<ol>
		<li class="first"><a href="http://www.realtytrac.com/">RealtyTrac</a></li>
		<li><a href="http://www.realestateabc.com/">ABCs of Real Estate</a></li>
		<li><a href="http://www.rentometer.com/">Rentometer.com, by iiProperty: enter an address and get rental comps back!</a></li>
	</ol>
	</li>
	<li><h4><a href="/popular/tv">tv</a></h4>
	<ol>
		<li class="first"><a href="http://www.getdemocracy.com/">Democracy - Internet TV Platform - Free and Open Source</a></li>
		<li><a href="http://www.joost.com/">Joostâ„¢</a></li>
		<li><a href="http://wwitv.com/portal.htm">wwiTV.com - Your guide to Live TV broadcasts on the Internet</a></li>
	</ol>
	</li>
</ol>
</div>

<div class="cleardiv">&nbsp;</div>

</div><!--main-->

<script type="text/javascript">document.write('<div id="bottom">&nbsp;<\/div><div style="visibility:hidden">')</script>
<div id="footer"><div id="footer-inner">
<div id="footer-hr"><hr /></div>
<ul>
<li class="first"><img src="http://images.del.icio.us/static/img/delicious.small.gif" width="10" height="10" alt="" /> <a href="http://del.icio.us/">del.icio.us</a></li><li><a href="http://del.icio.us/about">about</a></li><li><a href="http://blog.del...
</div></div>
<script type="text/javascript">document.write('<\/div>'); window.onresize = footer; footer()</script>
<div style="clear:both"><!-- ie bugfix --></div>


<script type="text/javascript">if(Mp3.go) Mp3.go()</script>

</body>
</html>



( run in 0.441 second using v1.01-cache-2.11-cpan-df04353d9ac )