Hi,
I have 4 problems with Webinator 2.0 maybe someone can help me with.
ADDING MULTIPLE PAGES
I am am having trouble adding multiple pages to an existing index. I an
using -g (without the -a) to get a page and all of its links.
gw -d/www/doc/webinator/newindex -g
http://www.itc.virginia.edu/progress/97/toc.html
However, gw only indexes the first page.
Getting http://128.143.22.53/robots.txt...Not there...Ok.
http://www.itc.virginia.edu/progress/97/toc.html
1/23
Visited 1 pages total
Indexing new pages
How do I get it to add all 23 of the links from this page to the index? In
the documentation it says if you omit the -a option it will get all the
URLs from the page. However, whenever I run it, it is only indexing the
single page not the rest of the section (everything in the /progress/97/
subdirectory that is linked in from the starting page). When I run it with
the -a option I get the exact same results.
METADATA
It does not seem like Webinator is counting metadata when it does relvancy
ranking. I used the -meta=keywords,description,author option when creating
my index. I get all types of messages, sometimes it says "You must rebuild
the database to enable meta data storage." Does this mean I have to wipe my
index and start over? Why is that? How can I make sure the metadata is
being indexed?
REINDEXING ON A SCHEDULE
I have not had any luck with this option (gw -rewalk="every day at 1am").
How can I make sure that when my index is updated each night that the same
rules will apply as when I first created the index? I have at least 10
options I created the index with and then added some individual pages after
that so when the update happens each night is is still following the
indexing behavior I specified when I originally created the index?
MEMORY FAULT
When I add too many options to the gw command (more than 8, I usually get a
memory fault error). When I tried to specify the options in a separate
file using the -m option, this does not work because I have a list of -j
URLs and GW quits and says the URL is not allowed (seems like it is
confusing -j with -x) so since I can not use the -m option and I get a
memory fault when adding the list of -j URLs on the command line, I am not
able to create the index I want.
Thanks,
Lara