blob: 930d340e557c5513c8d02187e9b75fd53dce8553 (
plain) (
blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
|
html2text is a command line utility, written in C++, that converts
HTML documents (HTML 3.2) into plain text (ISO 8859-1).
Each HTML document is loaded from a location indicated by an URI or
read from standard input, and formatted into a stream of plain text
characters that is written to standard output or into an output-file.
The input-URI may specify a remote site, from that the documents are
loaded with the Hypertext Transfer Protocol (HTTP). The program is
even able to preserve the original positions of table fields and
accepts also syntactically incorrect input, attempting to interpret it
"reasonably". The rendering is largely customisable through an RC
file.
WWW: http://userpage.fu-berlin.de/~mbayer/tools/html2text.html
- Simon 'corecode' Schubert
|