[Web4lib] Extracting images from PDF?
Binkley, Peter
Peter.Binkley at ualberta.ca
Wed Dec 14 19:05:24 EST 2005
I've used pdfimage.exe, part of the open-source xpdf package, available
here: http://gnuwin32.sourceforge.net/packages/xpdf.htm. It gives you
the images in pbm format, which you can batch-convert to tiff or
whatever using a standard graphic tool such as IrfanView. I used it on
pdfs containing just black-and-white images. It did the job.
Peter
Peter Binkley
Digital Initiatives Technology Librarian
Information Technology Services
4-30 Cameron Library
University of Alberta Libraries
Edmonton, Alberta
Canada T6G 2J8
Phone: (780) 492-3743
Fax: (780) 492-9243
e-mail: peter.binkley at ualberta.ca
> -----Original Message-----
> From: web4lib-bounces at webjunction.org
> [mailto:web4lib-bounces at webjunction.org] On Behalf Of Robert Sullivan
> Sent: Wednesday, December 14, 2005 01:48 PM
> To: web4lib at webjunction.org; genealib at lists.acomp.usf.edu
> Subject: [Web4lib] Extracting images from PDF?
>
> A local labor organization had some old newsletters scanned
> and presented us with 4 CDs of PDFs, with each issue a separate file.
>
> This would be great, except that they were scanned from a bound volume
> 2 pages at a time, so any given file will contain the last
> page of the previous issue and be missing the last page of
> the issue named. This portends some amount of patron confusion.
>
> I'm considering trying to take these images apart and
> reassemble them in a more useful way. We have Acrobat 7, but
> our graphics staff uses it at a fairly low level so they
> can't help me. I have found references to software which
> will let you save images form PDFs as TIFFs, but I was hoping
> for some real world experience.
>
> Thanks for any advice on the least painful way to handle this,
>
> --
> Bob Sullivan
> Schenectady Digital History Archive
> <http://www.schenectadyhistory.org/>
> Schenectady County (NY) Public Library
> _______________________________________________
> Web4lib mailing list
> Web4lib at webjunction.org
> http://lists.webjunction.org/web4lib/
>
More information about the Web4lib
mailing list