Page 1 of 1

Rewalk - what does this really do?

Posted: Wed Mar 31, 1999 3:45 pm
by Thunderstone



Hi -

It's not clear to me what the -rewalk function does. There are two
specific cases I'm interested in:

Say I have a file called /FAQ/index.html, and that's something where the
content gets rotated. If I tell webinator to rewalk that document
(or more likely, the whole document tree), is it going to reindex that
document (i.e. go through all the contents and reindex it)? OR, is it
going to look at the URL and assume that it has that document already?

Also, what happens if a document disappears? Say, I have
/FAQ/marchindex.html, and now that March is (almost) over, I replace
that file with /FAQ/aprilindex.html. Does /FAQ/marchindex.html disappear
from the index if it's not found with a rewalk?

Thanks in advance,
Susan


Susan Alderman Susan_Alderman@brown.edu
Box 1885 vox: 401-863-9466
CIS, Brown University fax: 401-863-7566
Providence, RI 02912



Rewalk - what does this really do?

Posted: Wed Mar 31, 1999 4:26 pm
by Thunderstone


-rewalk does fairly literally what it's name says. It takes the options
and starting URLs specified to gw by you previously. It then creates a
new empty database and does a complete walk using the extracted options
and starting URLs. You may not specify any other options or URLs when
using -rewalk.

When the new walk completes successfully the old database is replaced with
the new one.

The upshot of all of this is that all pages get refetched and missing
ones will not be in the resultant database.