Happy Holidays!
Its nice to see that so many people are using the Webinator software.
We've noticed that people are having some issues with the some of
the command line the gw program. Here's a quick refresher:
1: Walk your site:
gw -d/tmp/webdb http://www.mysite.com/
2: Create the index:
gw -d/tmp/webdb -index
3: Add stuff to the DB
gw -d/tmp/webdb http://www.more-sites.com/
4: Update the index
gw -d/tmp/webdb -index
5: Re-walk the sites
gw -d/tmp/webdb -e1996-12-31
6: Update the index
gw -d/tmp/webdb -index
7: Review the URL list
gw -d/tmp/webdb -st "select Url from html"
8: Delete a page
gw -d/tmp/webdb -st "delete from html where Url like 'www.mysite.com/testpage.html"
9: Clear the database
gw -d/tmp/webdb -wipe
Notes:
A: GW will not re-walk a Url once it has been fetched unless you specify the
-e (expire) option
B: _Please_ don't forget to create or update your indexes once you have
modified the database. The software will not be able to search as quickly
without an up-to-date index.
C: Relevance ranking and more-like-this will not work without an index.
If you have troubles or questions on the Webinator, don't hesitate to
post your question to this list!
Thanks,
Bart Richards
bart@thunderstone.com