aboutsummaryrefslogtreecommitdiffstats
path: root/textproc/html2text/pkg-descr
blob: 930d340e557c5513c8d02187e9b75fd53dce8553 (plain) (blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
html2text is  a command  line utility, written  in C++,  that converts
HTML documents (HTML 3.2) into plain text (ISO 8859-1).

Each HTML  document is loaded from  a location indicated by  an URI or
read from  standard input, and formatted  into a stream of  plain text
characters that is written to  standard output or into an output-file.
The input-URI may  specify a remote site, from that  the documents are
loaded with  the Hypertext  Transfer Protocol  (HTTP). The  program is
even  able to  preserve the  original  positions of  table fields  and
accepts also syntactically incorrect input, attempting to interpret it
"reasonably".  The rendering  is  largely customisable  through an  RC
file.

WWW: http://userpage.fu-berlin.de/~mbayer/tools/html2text.html

- Simon 'corecode' Schubert