[WEB4LIB] Re:Text extraction from pdf files.

Craighton Hippenhammer CHHammer at olivet.edu
Thu Sep 6 10:29:55 EDT 2001


Redwing from Datawatch will pull text from pdf files.  See:

http://www.datawatch.com/docs/products/redwing/index.html





Craighton Hippenhammer
Information Technology Librarian
Olivet Nazarene University
chhammer at olivet.edu

>>> Tony Barry <me at Tony-Barry.emu.id.au> 09/06/01 02:23AM >>>
At 7:06 PM -0700 5/9/01, Tony Parsons wrote:
>Does anyone know how to extract plain text from a pdf file?

http://www.pdfzone.com/news/bclconversionservice.html 

and

http://access.adobe.com/simple_form.html 

will convert PDF to HTML. After that a conversion to text is trivial.

Tony

-- 
phone  +61 2 6241 7659
mailto:me at Tony-Barry.emu.id.au 
http://purl.oclc.org/NET/Tony.Barry



More information about the Web4lib mailing list