pdf edit again.
Gary Kline
kline at tao.thought.org
Sat Nov 3 18:55:20 PDT 2007
On Sun, Nov 04, 2007 at 02:39:14AM +0100, cpghost wrote:
> On Sat, 3 Nov 2007 16:38:55 -0800
> Gary Kline <kline at tao.thought.org> wrote:
>
> > A couple weeks ago I skimmed thru the postings on editing PDF
> > files. Wasn't entirely clear what the answer it because I
> > never thought I would need to edit a GUI file. I just found a book
> > from 1883 in pdf format. I would like a text/ASCII/ISO_8859-1
> > version. Tried pfdtotext, but it doesn't work. Nutshell: is
> > there something I can use to edit/look-at this book and get
> > rid of whateveriit is that's causing pdftotext to fail. (sorry for
> > the grammar.... )
>
> Old books in PDF are normally scanned bitmaps. There are no characters
> or whatever therein; just pixels (EPS files). If you want to convert
> that to ASCII, you'd need to extract the EPS files (use something like
> pdfimages from the xpdf port), turn them into some bitmap format, and
> run some kind of OCR software on that. It's a slow, unreliable,
> error-prone and painful process though.
>
> Good luck!
"Arrrgh" (Charlie Brown). If it's that tortured, I'll forget
it; thanks for the clue. Pretty sure this *was* just phot'd and
scanned in.
(Much be how amazon.com has thir zillions of boooks online.
OCR'ing is serious work; I know that first hand.)
gary
>
> -cpghost.
>
> --
> Cordula's Web. http://www.cordula.ws/
> _______________________________________________
> freebsd-questions at freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-questions
> To unsubscribe, send any mail to "freebsd-questions-unsubscribe at freebsd.org"
--
Gary Kline kline at thought.org www.thought.org Public Service Unix
http://jottings.thought.org http://transfinite.thought.org
More information about the freebsd-questions
mailing list