any shortcuts to doc to ascii?

Polytropon freebsd at edvax.de
Thu May 27 03:03:06 UTC 2010


On Wed, 26 May 2010 18:38:47 -0700, Gary Kline <kline at thought.org> wrote:
> 
> 
> guys,
> 
> is there anything that can take these hex triplets such as
> 
> We Don\xe2\x80\x99t
> 
> and render them back to the ascii or keyboard equivalents?
> in this case, the \x99 would be an apostrophe.
> thus:
> 
> 
> We Don't
> 
> tia,
> 
> gsry
> 
> ps: even lynx -dump messes up, i believe.  i'm trying to go from
> DOC  back to typewriter.... 


Yes, even a typewriter is better than DOC. :-)

To process DOC files into ASCII, there are several ways, with
different complexity:

Most complex ones: Use OpenOffice or Abiword, open the file and
save it as ASCII. Included "special characters" should be in
regular ASCII representation now.

Better: Use (from ports) catdoc or antiword.

I'm not sure in how far conflicting codepages may be involved.
It is known that "Windows" does have problems supporting standards,
and this applies to character sets and language variations, too.



-- 
Polytropon
Magdeburg, Germany
Happy FreeBSD user since 4.0
Andra moi ennepe, Mousa, ...


More information about the freebsd-questions mailing list