rewalk

Post Reply
User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

rewalk

Post by Thunderstone »




If these sites are virtual hosts on one IP that require "delete from
options where Name='URL'" between runs or are walked with different
option sets, you will not be able to use -rewalk. -rewalk only works
where the entire database can be built using one gw command line with all
of the urls and options.

You should use the manual procedure. What you describe would
go something like this:
#!/bin/sh
cd .../webinator
bin/gw -dnewdb -create
bin/gw -dnewdb -noindex ...options... http://url1
bin/gw -dnewdb "delete from options where Name='URL'"
bin/gw -dnewdb -noindex ...options... http://url2
bin/gw -dnewdb "delete from options where Name='URL'"
...
../../cgi-bin/texis removeunwanted/main.txt
bin/gw -dnewdb -index
mv db olddb
mv newdb db
#rm -f olddb # if all went well

Place all of the above, without leading spaces and adjusting paths to reflect
your installation, into a file and make it executable with:
chmod +x manualrewalk
Then use that script as your cron job.

User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

rewalk

Post by Thunderstone »




Dear webinator@thunderstone.com

I am attempting to rewalk a database and recall some options.

Having previously run ...

"gw -mwalkoptions -save=options" (see attached walkoptions file)

I then attempted ...

"gw -rewalk -recall=options"

but got the following error ...

Rewalking database
002 Can't make up tmp names from /u5/liz/proof1/webinator/db
002 Cleanup old databases in /u5/liz/proof1/webinator starting with _ or O

What I want to do is rewalk the site but using a bunch of options (mainly
URL exclusions).

Any advice ?

Thanks,

Simon

User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

rewalk

Post by Thunderstone »



That should work. The problem is that you've run many rewalks already
that have not completed. Either because you stopped them or there was a
problem walking. In those cases, the "new" database is left with it's
temporary name so you can examine or use it if desired. If you don't
want the temporary database, you should delete it. The new temporary
database will be the same name as the master except it will have an "_"
prepended to the name. The old database will have an "O" prepended to
the name. Up to 8 of these will be prepended. When that limit is
reached you get the message:

002 Can't make up tmp names from /u5/liz/proof1/webinator/db
002 Cleanup old databases in /u5/liz/proof1/webinator starting with _ or O

You need to follow the instructions in the message and delete the old
databases beginning with "_" or "O". Then your -rewalk should work normally.




Post Reply