rewalking options

User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

rewalking options

Post by Thunderstone »

When a gw rewalk command is given, does gw use the same command line as the
initial run or are the options stored otherwise? Meaning, I use many
options and many URLS so I use the command

gw -moptions "&URLS"

If I change any of the options in the config file or add/substract a URL,
will gw pick up on this when it rewalks or do I need to start from a new db?

Kevin

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Kevin Ward, SSAI
MODIS Digital Library Manager
NASA Goddard Space Flight Center
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

rewalking options

Post by Thunderstone »



-rewalk uses all of the Urls that have ever been specified on the gw
command line (or in a list file) since the last -wipe or -create. It
uses the same options as the last run of gw. It will not reuse your
list file. So urls added or deleted from that will not be noticed. In
that case you will need to walk to a new database (not using -rewalk),
then make that one live when finished.



sduggan
Posts: 84
Joined: Tue Mar 20, 2001 9:16 am

rewalking options

Post by sduggan »

I've index my site using this command line:

gw -fasp -n"application/pdf,pdf,pdftotx" (site name)

then later on, I successfully indexed another site to the database and the search engine is showing entries from both, so far so good.

I would like to know if I set a scheduled rewalk will it reindex both sites and use the original options(specifically the pdf plugin) I specified on the first one.

Thanks
Shawn
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

rewalking options

Post by mark »

It will use the walk options specified in the latest walk you did. So if you did not specify -n when you added the second site neither will -rewalk. If you always use the same set of options when adding sites then -rewalk will work as desired.