Query re robots tags

Post Reply
legedza.henry
Posts: 142
Joined: Wed Jul 24, 2002 11:52 pm

Query re robots tags

Post by legedza.henry »

Hi there,

We are hitting the limits of our current license (20,000) and are trying to find ways to reduce the number of pages being indexed.

An example of such a page is this:

http://www.sacereview.sa.gov.au/a8_publ ... avgrp=1003

When you view it you'll notice it is only a menu bar. Ecah item then links to specific content

It has a robots tag of noindex/Follow yet it is stored within the DB.

Is this what should be happening?
Is there anyway to avoid these pages being stored?

Thanks
User avatar
John
Site Admin
Posts: 2622
Joined: Mon Apr 24, 2000 3:18 pm
Location: Cleveland, OH
Contact:

Query re robots tags

Post by John »

Yes, you can change the dowalk script to not keep those by changing the line:

<$SSc_metarobotsplaceholder=Y> <!-- keep url for "noindex" pages -->

to set the variable to "N". That will lose the ability to walk through parent pages in the List/Edit urls.
John Turnbull
Thunderstone Software
legedza.henry
Posts: 142
Joined: Wed Jul 24, 2002 11:52 pm

Query re robots tags

Post by legedza.henry »

Thanks for the tip. My only other question is how does one find out about these particular functionsd of dowalk.

I've had a look through the docs and the website and can't find any mention of this function (metarobotsplaceholder). is there anywhere that I can find these things documented?

Thanks
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Query re robots tags

Post by mark »

I think that may be the only internal option (not on admin interface). It's very rarely used and breaks the parent link surfing so it's not mentioned. Looking at the "defaults" function in dowalk can provide some insight.
Post Reply