Extracting Title Metadata from MSWord

Post Reply
joerg.hans.pasch
Posts: 13
Joined: Mon Jul 22, 2002 9:37 am

Extracting Title Metadata from MSWord

Post by joerg.hans.pasch »

Hi all,

I could only find rather old threads on this topic; hence this new one from myself now:

By default the dowalk script from Webinator 5.1 does not extract the title metadata (within Word: Menu File > Properties > Summary > Title etc.).

Instead the result list shows only placeholders like:
MSWordDocument (123k)

How can I tell the dowalk script/anytotx plugin to extract that data and display it in the result list?

Thanks a lot for your help.
Kind regards
Joerg
User avatar
John
Site Admin
Posts: 2622
Joined: Mon Apr 24, 2000 3:18 pm
Location: Cleveland, OH
Contact:

Extracting Title Metadata from MSWord

Post by John »

The current version of Webinator should extract the title correctly from word documents. If you can send a copy or link to a word document that doesn't work to support then they can take a look at it.
John Turnbull
Thunderstone Software
joerg.hans.pasch
Posts: 13
Joined: Mon Jul 22, 2002 9:37 am

Extracting Title Metadata from MSWord

Post by joerg.hans.pasch »

Unfortunatley documents are confidential and on Intranet only. But I have seen that they were protected using the Word command "Tools > Protect Document > Protect Document for Forms".

Yet, within Word, I can still look at and read the metadata; they are just "greyed out", that means I could not change them anymore. (Important: opening the Word document is not password protected! It is just editing the document that is prohibited.)

Has Anytotx problems with extracting metadata from a protected Word document?
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Extracting Title Metadata from MSWord

Post by mark »

anytotx is able to get titles etc from protected documents. Perhaps there's something else wrong with those documents. If you can reproduce the problem in a non-confidential file please submit it to tech support.
Post Reply