aboutsummaryrefslogtreecommitdiffstats
path: root/www/p5-HTML-ExtractMain/pkg-descr
blob: 5dba2ddcf96b0b103c29f595003c581f7e19589c (plain) (blame)
1
2
3
4
5
HTML::ExtractMain is a module which takes HTML content, and uses the
Readability algorithm to detect the main body of the page, usually
skipping headers, footers, navigation, etc.

WWW: http://search.cpan.org/dist/HTML-ExtractMain/