what's the easiest way to de-html-ize files?

Chuck Swiger cswiger at mac.com
Mon May 14 19:09:09 UTC 2007


On May 12, 2007, at 12:54 PM, Gary Kline wrote:
> This is for those of us who appreciate ASCII or straight
> 	ISO_8859-15 rather than marked up files.  I have slapped together
> 	a crude C program that does scotch (or *cleanse*) text of
> 	<B></B> and so on.   Still... is there some standalone converter
> 	that gets rids of markup more elegantly?   Something where i
> 	can say
>
> 	% cmd file_1.html ... file_N.html and output file_1.text ...
> 	file_N.text?

Perhaps:

   lynx -dump file1.html ... > file.text

...?

-- 
-Chuck



More information about the freebsd-questions mailing list