i think something's broken

User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

i think something's broken

Post by Thunderstone »



I'm trying to index a directory on our server to have a database of its own
so i can just search this particular directory. Here is what happens:

{28} (~/html/webinator/bin) [www@shell] 12:00 > gw
-d/home/www/html/webinator/intra http://www.athenet.net/~helpdesk/index.html
Getting http://209.103.198.2/robots.txt...Not there...Ok.
Adding todo: http://www.athenet.net/~helpdesk/
http://www.athenet.net/~helpdesk/
0/0
Visited 1 pages total
Indexing new pages

Why is this not starting at the index.html file and following it through
that directory as happened when I initially indexed another directory using
the default database?

thanks-
mike



User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

i think something's broken

Post by Thunderstone »




It is removing the "index.html" because it's an alias for the directory name
itself. See the -I option.

I notice that that page is blocked from world access. Does the machine where
you're running gw have permission to fetch that page? Do you get any error
messages? Try turning your verbosity (-v5) up to see what's happening.
Try:
geturl "http://www.athenet.net/~helpdesk/"
to see what your web server is returning.