Ignoring robots.txt file

sunnedaze
Posts: 22
Joined: Mon Jul 28, 2003 2:07 pm

Ignoring robots.txt file

Post by sunnedaze »

Nothing like that. Just what I copied above
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Ignoring robots.txt file

Post by mark »

sunnedaze
Posts: 22
Joined: Mon Jul 28, 2003 2:07 pm

Ignoring robots.txt file

Post by sunnedaze »

I put /wlc050403 in the exclusion prefix section, did a new walk & yes, the site came up when I did a search.
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Ignoring robots.txt file

Post by mark »

In the "Exclusion Prefix" box you have to enter the entire url prefix, like I gave it. You can enter just
/wlc050403
in the plain "Exclusions" box though.
sunnedaze
Posts: 22
Joined: Mon Jul 28, 2003 2:07 pm

Ignoring robots.txt file

Post by sunnedaze »

Sorry...I entered it in the exclusions box /wlc050403. I don't see the "exclusions prefix" box (looking in the basic walk settings for that profile)?
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Ignoring robots.txt file

Post by mark »

It's under "all walk settings" as mentioned previously.
sunnedaze
Posts: 22
Joined: Mon Jul 28, 2003 2:07 pm

Ignoring robots.txt file

Post by sunnedaze »

Yes, that worked. The site did not come up in the search. What next?
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Ignoring robots.txt file

Post by mark »

Wasn't not coming up the desired goal?

If the exclusion works in "Exclusion Prefix" there must be something odd about your robots.txt file since that's where the robots.txt rules are placed during the walk. Without being able to fetch it from here it's hard to say what.

In your installation directory there should be a program called "geturl.exe". From a command prompt run
geturl http://cleohsenet01.napa.ad.etn.com/robots.txt
and paste the full results here.
sunnedaze
Posts: 22
Joined: Mon Jul 28, 2003 2:07 pm

Ignoring robots.txt file

Post by sunnedaze »

I don't seem to have access to a prompt & when I run this at the Windows2000 'run' command (off the start menu), the display flashes by too quickly to be read.
F:/Thunderstone Software/Webinator/geturl.exe http://cleohsenet01.napa.ad.etn.com. Have tried typing it several ways. This way doen't give me the 'can't find components' message, but no output either.
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Ignoring robots.txt file

Post by mark »

find and click the MSDOS icon to get a prompt.
Post Reply