XHTML results from the CPAN

XHTML

HTML-ExtractContent

view release on metacpan or search on metacpan

<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="ja">
<head>
  <link rel="start" href="http://orezdnu.org/" />
  <link rev="made" href="http://orezdnu.org/" />
  <title>Sample for content extraction test (1)</title>
</head>
<body>
  <div id="content">
    <h1>Sample for content extraction test (1)</h1>
    <p>This file is for a simple test that the single content of the page can

t/input2.html view on Meta::CPAN

<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="ja">
<head>
  <link rel="start" href="http://orezdnu.org/" />
  <link rev="made" href="http://orezdnu.org/" />
  <title>Sample for content extraction test (2)</title>
</head>
<body>
  <div id="content">
    <h1>Sample for content extraction test (2)</h1>

( run in 1.486 second using v1.01-cache-2.11-cpan-85f18b9d64f )