index problem

rkruger
Posts: 1
Joined: Fri Jun 08, 2001 4:27 am

index problem

Post by rkruger »

bart
Posts: 251
Joined: Wed Apr 26, 2000 12:42 am

index problem

Post by bart »

The problems are different at each site. www.schnigge.de has a large number of pages that are not URL driven. Many of of its links are generated by client side javascript . The crawler will not follow links that require it to execute client side javascript.

www.javamagazin.de does not have any content of its own. All of its content really resides at http://entwickler.com/ . There is only one page actually located at www.javamagazin.de .
User avatar
Kai
Site Admin
Posts: 1272
Joined: Tue Apr 25, 2000 1:27 pm

index problem

Post by Kai »

Also, www.schnigge.de has an off-site <IFRAME>; since it is the only <IFRAME> that page fails. You could fetch just that page with -o and -g, then continue the walk without -o and -g to get the rest of the site.