If these sites are virtual hosts on one IP that require "delete from
options where Name='URL'" between runs or are walked with different
option sets, you will not be able to use -rewalk. -rewalk only works
where the entire database can be built using one gw command line with all
of the urls and options.
You should use the manual procedure. What you describe would
go something like this:
#!/bin/sh
cd .../webinator
bin/gw -dnewdb -create
bin/gw -dnewdb -noindex ...options... http://url1
bin/gw -dnewdb "delete from options where Name='URL'"
bin/gw -dnewdb -noindex ...options... http://url2
bin/gw -dnewdb "delete from options where Name='URL'"
...
../../cgi-bin/texis removeunwanted/main.txt
bin/gw -dnewdb -index
mv db olddb
mv newdb db
#rm -f olddb # if all went well
Place all of the above, without leading spaces and adjusting paths to reflect
your installation, into a file and make it executable with:
chmod +x manualrewalk
Then use that script as your cron job.