[Web4lib] pdf summary

Thomas Dowling tdowling at ohiolink.edu
Thu May 19 08:57:16 EDT 2005


Dudart Stéphane wrote:
> Dear all,
> 
>  
> 
>  
> 
> I manage a private library where I have developed an intranet from wich
> users can find information thanks to web applications. One of these
> applications is a full text database using indexing service of
> microsoft. The purpose of indexing service is to permit a full text
> search into one or more folders of the webserver via a web interface.
> The files indexed are in pdf format. I use the metadatas of the pdf file
> (you find them from the adobe writer in File/Document
> Properties/Document Summary) to improve searching with a structure
> indexation. The problem is that the list of metadatas is fixed. By
> example, I can not add other metadatas as date of publication or as
> editor, and so on. Do you know a way to personalize this list of metadatas?


In File/Doc Properties/Description/Additional Metadata/Advanced, there's
a list of Dublin Core properties.  It looks like you can create an RDF
file that encloses other DC properties and import it.  Adobe calls this
their "Extensible Metadata Platform" and has some information at
<http://www.adobe.com/products/xmp/main.html>.  You may also find help
at <http://www.adobe.com/products/xmp/pdfs/xmpspec.pdf> and
<http://www.xml.com/pub/a/2004/09/22/xmp.html>.

-- 
Thomas Dowling
tdowling at ohiolink.edu


More information about the Web4lib mailing list