PDF metadata indexing.

Post Reply
polarbear128
Posts: 5
Joined: Thu Nov 01, 2007 3:46 pm

PDF metadata indexing.

Post by polarbear128 »

Hi,
We're about to purchase the PDF/Doc plugin for Webinator, but I need to know if the plugin indexes all metadata in the PDF, including custom metadata and XMP metadata, or if it just indexes the usual suspects.

Cheers
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

PDF metadata indexing.

Post by mark »

The meta data fields available from PDF documents are:

Author CreationDate ModDate Creator Producer Title Subject Keywords X-Print X-Change X-Copy X-Addnotes X-Linear X-Encrypted X-Pages X-PDF-Version X-Tagged X-Filter-Version
polarbear128
Posts: 5
Joined: Thu Nov 01, 2007 3:46 pm

PDF metadata indexing.

Post by polarbear128 »

Thanks for the reply.
So, just to confirm, Webinator doesn't index custom metadata at all. Has anyone worked out a way around this limitation?
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

PDF metadata indexing.

Post by mark »

If you provide a sample file and what you expect to get out of it someone could take a look into it.
polarbear128
Posts: 5
Joined: Thu Nov 01, 2007 3:46 pm

PDF metadata indexing.

Post by polarbear128 »

I have one here...how do i attach it to a post?
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

PDF metadata indexing.

Post by mark »

You can't attach to a post. You can paste the link to it if it's on a public web server. Otherwise you'll have to open a tech support ticket so you can email it.
polarbear128
Posts: 5
Joined: Thu Nov 01, 2007 3:46 pm

PDF metadata indexing.

Post by polarbear128 »

I've uploaded here:

http://download.yousendit.com/92E1BDCC29499A14

in the pdf, if you open in a text editor and search for mytest, you can see the xml node of the custom property...this seems to be in the same format as the ModifyDate etc nodes...although with a different namespace prefix.
What we're aiming to do is to have things like categories specified in the custom metadata, and filter searches based on that (after modifying the search script of course)..but of course to do that we need webinator to index that arbitrary metadata.
Perhaps a change to the dowalk script would be needed? The implementation isn't really a concern (apart from time), but we just need to know whether it is possible to do.
polarbear128
Posts: 5
Joined: Thu Nov 01, 2007 3:46 pm

PDF metadata indexing.

Post by polarbear128 »

Great!
thanks for your help.
Post Reply