gw limited to httpd port by default?
Posted: Fri Mar 31, 2000 4:09 pm
Webinator crowd ...
I apologize if this topic has been covered here or is in the FAQ, but I
couldn't find it in the documentation.
I'm looking for assurance that gw will exclude pages that are on the same
machine, but are only served by a webserver listening to a different port.
Documentation for the -o option states:
"Allow grabbing of individual off-site pages. By default gw will not
retrieve pages that are not on the same machine as the initial URL. With
this option pages not on the initial machine will be retrieved, but none of
the pages that they reference will be."
I have 2 docroots used by 2 webservers listening to separate ports. Will
gw, by default, also ignore links referencing the other webserver on the
same machine? I have specified the URL http://url:4440/ in the gw command
but there are probably links served by this server that don't list the port
(e.g. http://url/). Will these be included because they are on the same
machine or is gw smart enough to check the port too?
Thanks for any help I receive.
Steven Vorhees
svorhees@creativepro.com