Running gw from the web server

User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Running gw from the web server

Post by Thunderstone »



When I run gw from my web server, it doesn't walk pages on that server. It
pulls the page I point it to and stops. When I point it at other servers,
or even virtual servers, it walks them. How do I get it to walk the pages
on the same server?


______________________________________________________________
Carl Dickson cdickson@govsolutions.com
Government Solutions Phone: 703-847-3601
http://www.govsolutions.com Fax: 703-760-7899
8000 Towers Crescent Dr., Suite 1350, Vienna, VA 22182

User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Running gw from the web server

Post by Thunderstone »




Perhaps it thinks the pages referenced from the home page are on a different
machine that the url you started with for some reason. Is this a multi-homed
machine?

Use the -v4 option to get more info about what it thinks about your links.

When you report problems about "I can't walk a site" and the like, a
Url will go a long way toward helping us determine the issue.
User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Running gw from the web server

Post by Thunderstone »



At 03:38 PM 12/13/96 EST, you wrote:

It is a multi-homed machine. The other virtual-home worked.

-v4 showed the links as "Off site".

Here's the relevant portion of the log file:


What's supposed to be in the "hosts" table?


______________________________________________________________
Carl Dickson cdickson@govsolutions.com
Government Solutions Phone: 703-847-3601
http://www.govsolutions.com Fax: 703-760-7899
8000 Towers Crescent Dr., Suite 1350, Vienna, VA 22182

User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Running gw from the web server

Post by Thunderstone »




The hosts table is only used when -dnscache is enabled. You see the warning
because verbosity was turned up. Don't worry about it.

Perhaps you have a hosts/nameserver issue of some kind that makes gw think
the subsequent pages are offsite. When I start a walk on that site from here
it works fine. Try -v8 to see if that's more illustrative.
User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Running gw from the web server

Post by Thunderstone »





The server hosts 2 virtual roots. It reports that it's chasing 10 links,
but the urls are all for the wrong virtual root. It then reports that each
one is off-site, which is correct. How do I get it back on track?


______________________________________________________________
Carl Dickson cdickson@govsolutions.com
Government Solutions Phone: 703-847-3601
http://www.govsolutions.com Fax: 703-760-7899
8000 Towers Crescent Dr., Suite 1350, Vienna, VA 22182

User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Running gw from the web server

Post by Thunderstone »




Could this (below) happen if the html files are using relative URLs
rather than absolute URLs?

On Fri, 13 Dec 1996, Carl Dickson wrote:


Regards
Anthony,
:~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~:
:J. Anthony Waldron : Anthony.Waldron@innosoft.com :
:~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~:
: Innosoft International, Inc : Telephone: +1.818.919.3600 :
: 1050 East Garvey Avenue South : FAX: +1.818.919.3614 :
: West Covina, California 91790 : URL: http://www.innosoft.com :
:~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~:


User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Running gw from the web server

Post by Thunderstone »




Please supply some of the the "wrong" urls (and the initial one).
Also send the output of gw -h .