gw spider and re-index

puttakanda
Posts: 29
Joined: Wed Sep 19, 2001 5:47 pm

gw spider and re-index

Post by puttakanda »

We have a licensed version of Texis and I wish to make use of gw to spider the site and create index.
We are planning to have docs (docs, pdf) stored in docroot in a predefined structure based on region(US, Europe etc) and language.

Two questions:
1. I need to spider and create index for the site (if possible in a way identifying the language and region)
2. I need to re-index the site each night (I had problems telling that the url already existed)
puttakanda
Posts: 29
Joined: Wed Sep 19, 2001 5:47 pm

gw spider and re-index

Post by puttakanda »

The above is on a Win2K box..
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

gw spider and re-index

Post by mark »

1) Use the -n option to specify the plugin for the doc and pdf files. See http://www.thunderstone.com/site/gw25man/node52.html

You might want to place each language into it's own database (-d) so english users don't get german hits by accident. You will probably need to use the -j option to keep the walk under the desired language/region.

2) Use -rewalk