Todo list

Post Reply
User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Todo list

Post by Thunderstone »



Greetings,

Is it possible to dump the todo list without wiping the database? I've
fed a URL list to webinator and pointed it at an alternate database. I
came back after the weekend to discover that Webinator was fetching many
more pages than I'd anticipated. I used Ctrl-C to break the fetching
batch and then indexed. Is there a way to now purge the todo list, keep
the current database holdings, and start over?

While searching this database, I've noticed a slew of entries that I did
not intend. I assume that this comes from using a url list (eg.
&urllist.lst). I'd like to use the -j switch for all of the pages that I
fetch. Is this possible with an option list such as one uses with the -m
switch?

Server stats:
OS: NT4 sp3
Mem: 256MB
Webinator version: 2.1.874085834

Thanks in advance,

Mark

| Mark J. Weixel | "To new shores! Despite the tempest, the |
| University of Pittsburgh | shoals and the hidden rocks... |
| 4G Forbes Quadrangle | To new shores!" |
| weixel+@pitt.edu | --Musorgskij |



User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Todo list

Post by Thunderstone »




You can see what's in todo with:
gw -st "select * from todo"
You can purge todo with:
gw -st "delete from todo"
but that might zap ones you want as well as ones you don't.
You can be selective by adding a where clause. Something like
this might do what you want:
gw -st "delete from todo where Url matches 'www.somesite.com/%'

You can resume by running gw with the same options you used before.


You need to specify a -j... option for each url path you want to restrict.
You can specify them, as any other option, in an option file using -m... .


Post Reply