Bad Connection; Site Not Completely Walked

Post Reply
User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Bad Connection; Site Not Completely Walked

Post by Thunderstone »




1) Suddenly, gw has stopped working! Every command emits
"timed out: bad handshake". What can I do to correct this?

2) Before this occurred, I had added some new directories
under http://www.mccc.edu They are not being indexed. My
"walk" command is

$WWW/webinator/bin/gw -d$WWW/webinator/db -xhttp://www.mccc.edu/njcate -L -N -z300000 http://www.mccc.edu
$WWW/webinator/bin/gw -d$WWW/webinator/db -L -N -z300000 http://www.mccc.edu/SCHED
$WWW/webinator/bin/gw -d$WWW/webinator/db-uh -L -N -z300000 -jhttp://www.mccc.edu/unixhelp http://www.mccc.edu/unixhelp
$WWW/webinator/bin/gw -d$WWW/webinator/db-njcate -L -N -z300000 -jhttp://www.mccc.edu/njcate http://www.mccc.edu/njcate

3) Come to think of it, I'm not sure what the above
actually does! The first two lines refer to a database
called "db". In line 1, I excluded the "njcate" directory
hierarchy, but didn't exclude the "unixhelp" one. Yet the
"unixhelp" doesn't get indexed.

Looks like I had to add line 2 to get the "SCHED" directory
hierarchy indexed. Why wasn't SCHED included by the command
in 1?

Lines 3 and 4 work fine.

Thanks,
Pete



User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Bad Connection; Site Not Completely Walked

Post by Thunderstone »



You have lost connection to the license server ls.thunderstone.com . Free
webinator must be able to talk to ls.thunderstone.com on port 80 (no proxies)
and in a timely manner. Did you just put up a firewall or something?



Are unixhelp and SCHED linked into the site somewhere that a web user
would find them while surfing your site? If not, webinator will not
find them either.



User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Bad Connection; Site Not Completely Walked

Post by Thunderstone »



On Wed, 9 Dec 1998, Mark Willson wrote:


Nope. Must have been a problem on the 'Net; it's OK now.


SCHED is but unixhelp is not.

I ran into another problem. I tried to execute the


with -Inewindex.html inserted before the last argument. It didn't work.

What did work was changing the last argument to http://www.mccc.edu/newindex.html

Is that a bug or did I do something stupid?

Thanks,
Pete



User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Bad Connection; Site Not Completely Walked

Post by Thunderstone »




gw walks what you tell it to. It doesn't append anything to the url.
-Inewindex.html tells it that "/newindex.html" is that same as "/" to
help it avoid fetching redundant pages.


Post Reply