Page 1 of 1

-x not working for asp

Posted: Tue Apr 30, 2002 8:54 am
by sduggan
Hello, when indexing our intranet I use the -x option to exclude a section of the site using an ASP application, but it doesn't seem to work. Here is the command:
gw -xhttp://(sitename)/depts/hr/leaders/ http://(sitename)/ ... then a bunch of other options
but when it starts to index the site I see it indexing:
http://(sitename)/Depts/HR/Leaders/calendar.asp?date=4/30/02 and it ends up going in a loop through all the dates in the calendar and then craps out eventually. The only way I found thus far to exclude that directory is to rename it while I'm indexing and then name it back. Are there any particular issues with excluding asp pages.

Here is the complete command line I use:

gw -dd:\inetpub2\wwwroot\webinator\db2 -n"application\pdf,pdf,pdfttotx" -n"application\msword\doc,pdftotx" -k"\alnum={2,30}" -k"\alnum=[\alnum/]{1,29}" -fasp -fmsw -xhttp://sitename/depts/hr/leaders/ http://(sitename)/
Thanks
Shawn

-x not working for asp

Posted: Tue Apr 30, 2002 9:13 am
by John
The "-x" command is case sensitive as you have used it, so you should make sure it matches the URLs on the site. You might also try "-x/" to exclude based on expression.

-x not working for asp

Posted: Tue Apr 30, 2002 9:35 am
by sduggan
I'm not sure what you mean by "try -x/" to exclude based on expression." Could you give me an example?

-x not working for asp

Posted: Tue Apr 30, 2002 10:26 am
by mark