Page 1 of 1

dowalk_beta 'query' support

Posted: Mon Nov 26, 2001 7:55 am
by neil.munro
Hi,
I am using the dowalk_beta script that I downloaded from your web site.
At the moment, it works Ok except that it doesn't include urls that have a parameter at the end of it. For eg, we have some news items that have a URL something like:
http://www.mywebsite.com/news/news-arch ... rparm=6789
But at the moment, the dowalk script doesn't index it or even look at it.
So I think some section in the settings has to be changed.. maybe acceptsufs.... but I am not sure and if it is, then I don't know the correct syntax.
Similarly, how would I specify a particular query to NOT be accepted..like the -exquery parameter in the normal gw?

Any help/areas of the manual (besides REX because I don't understand it fully anyway) would be great...

Regards,

dowalk_beta 'query' support

Posted: Mon Nov 26, 2001 10:41 am
by mark
Urls with /cgi-bin/ & ? + or ~ in them are ignored by default. See $rejects to change that (eg: remove "\&", "\?", and maybe "\+").

You can add a REX expression to $rejects to specify a specific query to reject. Maybe something like:
">>\?=!SKIPME*SKIPME"