Indexing Word Docs

Post Reply
User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Indexing Word Docs

Post by Thunderstone »



We are testing the gw walker against a directory full of MS Word Docs. As
the gw runs, we can see in the log that all the word docs are being
retrieved. For example, we see entries like "Retrieving
http://localhost/Flex_Content_Test/Status030198.doc"

Yet, no search of the database ever finds any of these documents. We
included "doc" in the list of extensions for the index run, but nothing
shows. We do see the non-Word (HTML in this case) docs in the searches,
but that's it.

What can we look at in the files or database to see if the MS Word docs (or
their URL's) have been indexed?

Is there something else we have to do to get webinator to recognize and
index a MS Word doc?


Thanks,


Scott Cochran
Online Development



------- Distributed Internet Applications --------
Online Development www.ondev.com
13555 Automobile Blvd #350
Clearwater, FL 33762 (813) 556-0120


User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Indexing Word Docs

Post by Thunderstone »




You need the PDF and Word Processor plugin which is only available to
commercial Webinator users. See
http://www.thunderstone.com/webinator/
for pricing.

You might be able to search for some of the words in the documents, but
without the word processor filter the stored Word documents will be largely
unusable.


User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Indexing Word Docs

Post by Thunderstone »



You dont have the word-processor/PDF plugin.

Its a $600 add-on to to the Commercial Webinator package.




Post Reply