pdf edit again.

cpghost cpghost at cordula.ws
Sat Nov 3 18:39:37 PDT 2007

On Sat, 3 Nov 2007 16:38:55 -0800
Gary Kline <kline at tao.thought.org> wrote:

> 	A couple weeks ago I skimmed thru the postings on editing PDF
> 	files.  Wasn't entirely clear what the answer it because I
> never thought I would need to edit a GUI file.  I just found a book 
> 	from 1883 in pdf format.  I would like a text/ASCII/ISO_8859-1
> 	version.  Tried pfdtotext, but it doesn't work.   Nutshell: is
> 	there something I can use  to edit/look-at this book and get
> rid of whateveriit is that's causing pdftotext to fail.  (sorry for
> 	the grammar.... )

Old books in PDF are normally scanned bitmaps. There are no characters
or whatever therein; just pixels (EPS files). If you want to convert
that to ASCII, you'd need to extract the EPS files (use something like
pdfimages from the xpdf port), turn them into some bitmap format, and
run some kind of OCR software on that. It's a slow, unreliable,
error-prone and painful process though.

Good luck!


Cordula's Web. http://www.cordula.ws/

More information about the freebsd-questions mailing list