Page 1 of 1

exclusions

Posted: Thu Oct 29, 1998 5:19 pm
by Thunderstone



I want to walk a site http://www.site.com and exclude all the URLs on this
site that begin with http://www.otherlinks.com/many_files/listed/not_wanted
I have tried using the command gw -xhttp://www.otherlink.com/ but to no
avail. It still rewalks all the URLs beginning with
http://www.otherlinks.com/ Please note I am walking another site, not my
own, that I have created a separate database for. The database is reading
URLs on a file that I have named music.dat The actual command I am using is
this

gw -rewalk -D0 -fshtml -fdat -xhttp://www.otherlinks.com -d/the_way/to
_my/webinator_directory/music_database "&music.dat"

everything seems to go find except the exclusion.
Any suggestions on how I might be able to exclude these files beginning with
http://www.otherlinks.com/ without excluding the whole page that they are
on listed in my music.dat file.

Thanks, Steve

------------------------------------------------


Attachment: http://thunder/texis/intra/attach/at.bin/+/BAAExDlY_
text/html




exclusions

Posted: Thu Oct 29, 1998 5:51 pm
by Thunderstone



For starters, -rewalk ignores all other options and uses those from
the last walk. If you're changing options, you need to do a new walk,
not use -rewalk.

Also, gw will not walk to http://www.otherlink.com unless you specify it
in one of the urls in music.dat . So make sure it's not in there if you
don't want it.