It seems that Webinator 4.3 is ignoring robots.txt files when it discovers links from one site pointing to an area of a different site and that area is supposed to be off limits due to the robots.txt file.
Example: site1.com has the following statement in the robots.txt file
Disallow: /softcopy_keys/asp/mob_ground
When I did a pattern search after a walk, I get several pages in the index that has the above pattern.
http://site1.com/softcopy_keys/asp/mob_ ... .asp?id=14
The one thing that appears to be in common with all of the fetched pages is that they all have parents coming from a different site. This leads me to believe that Webinator is not following the robots.txt file in this type of case.
Example: site1.com has the following statement in the robots.txt file
Disallow: /softcopy_keys/asp/mob_ground
When I did a pattern search after a walk, I get several pages in the index that has the above pattern.
http://site1.com/softcopy_keys/asp/mob_ ... .asp?id=14
The one thing that appears to be in common with all of the fetched pages is that they all have parents coming from a different site. This leads me to believe that Webinator is not following the robots.txt file in this type of case.