[WEB4LIB] Re:Text extraction from pdf files.
Craighton Hippenhammer
CHHammer at olivet.edu
Thu Sep 6 10:29:55 EDT 2001
Redwing from Datawatch will pull text from pdf files. See:
http://www.datawatch.com/docs/products/redwing/index.html
Craighton Hippenhammer
Information Technology Librarian
Olivet Nazarene University
chhammer at olivet.edu
>>> Tony Barry <me at Tony-Barry.emu.id.au> 09/06/01 02:23AM >>>
At 7:06 PM -0700 5/9/01, Tony Parsons wrote:
>Does anyone know how to extract plain text from a pdf file?
http://www.pdfzone.com/news/bclconversionservice.html
and
http://access.adobe.com/simple_form.html
will convert PDF to HTML. After that a conversion to text is trivial.
Tony
--
phone +61 2 6241 7659
mailto:me at Tony-Barry.emu.id.au
http://purl.oclc.org/NET/Tony.Barry
More information about the Web4lib
mailing list