Another Stay Under Query

Post Reply
legedza.henry
Posts: 142
Joined: Wed Jul 24, 2002 11:52 pm

Another Stay Under Query

Post by legedza.henry »

Hi there,

We have a site which comprises of numerous (approx 80)small websites. The addresses range from www.ddd.aa.gov.au/websitename to those with completely different domains: www.aaa.edu.au - but they all belong to us.

I would like to simply index these 80 sites without the index wandering off externally or to other sites within our larger entity.

I have entered all of the separate URLs and then selected Stay Under.

For some reason it seems to index some correctly but for others returns unwanted prefix for sub-directories.

For example: all of the following URLs have been entered in the BASE URL but return Unwanted Prefix errors.

2006-03-15 10:56:56 http://www.decs.sa.gov.au/deptinit Unwanted prefix
2006-03-15 10:56:56 http://www.decs.sa.gov.au/mediacentre Unwanted prefix
2006-03-15 10:56:56 http://www.decs.sa.gov.au/custserve Unwanted prefix
2006-03-15 10:56:56 http://www.decs.sa.gov.au/ministers Unwanted prefix

Likewise this one: http://www.decs.sa.gov.au/audit/a8_publ ... navgrp=139 Unwanted prefix

www.decs.sa.gov.au/audit/ is in the base url.

Am I doing something obviously incorrect?

Thanks
Henry
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Another Stay Under Query

Post by mark »

That's correct. You started at www.decs.sa.gov.au/audit/ so it stays under the audit directory. You can turn off stay under if you want it to wander about anywhere on the site. Or give the top of the site as the base url.

FYI, Webinator will not leave the specified sites unless told to do so with options like other base urls, extra domains or networks, enterprise, etc. Stay under applies to subdirectories of an individual site and is not needed to keep Webinator from leaving the specified sites.
legedza.henry
Posts: 142
Joined: Wed Jul 24, 2002 11:52 pm

Another Stay Under Query

Post by legedza.henry »

I must be misinterpreting what Stay Under does.

I have www.decs.sa.gov.au/audit/ listed in the BASE URL but it won;t index the page telling me:

http://www.decs.sa.gov.au/audit/a8_publ ... asp?navgrp =139 Unwanted prefix

a8_publish is a subdirectory of audit and as such should be indexed, am I correct in that assumption?

In addition each of these sites are listed as separate entries in the BASE URL field but are ignored.

http://www.decs.sa.gov.au/deptinit Unwanted prefix
http://www.decs.sa.gov.au/mediacentre Unwanted prefix
http://www.decs.sa.gov.au/custserve Unwanted prefix
http://www.decs.sa.gov.au/ministers Unwanted prefix

If I have 80 sites listed in the BASE URLs and STAY Under selected should all of the 80 sites (and their subdirectories) be indexed?

If not, what settings would I need for this to happen?

Thanks
Henry
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Another Stay Under Query

Post by mark »

Is your base url "http://www.decs.sa.gov.au/deptinit" or "http://www.decs.sa.gov.au/deptinit/" (trailing slash)? If the base has the trailing slash and there's an href without it it will be considered unwanted because the prefix doesn't match the base url. But that's ok since it already has that page.

For your other one, I'm not able to replicate anything like that. Make sure you have the latest scripts from the website.
legedza.henry
Posts: 142
Joined: Wed Jul 24, 2002 11:52 pm

Another Stay Under Query

Post by legedza.henry »

The base urls all of have the trailing slash.

I'm still confused...I have another example:

i have 2 urls entered in the base url:

http://www.decs.sa.gov.au/animalethics/
http://www.decs.sa.gov.au/accountability/

I have stay under on.

When I run a reindex, animalethics indexes perfectly but nothing in accountability is indexed eg.

http://www.decs.sa.gov.au/accountabilit ... cg0001005/ (Unwanted prefix )
http://www.decs.sa.gov.au/accountabilit ... emography/ (Unwanted prefix )

My question is why is wasn't indexed.
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Another Stay Under Query

Post by mark »

I don't know. What version of scripts do you have?
If you download and install the latest scripts does the problem persist?
legedza.henry
Posts: 142
Joined: Wed Jul 24, 2002 11:52 pm

Another Stay Under Query

Post by legedza.henry »

We are running V5.0.2 - scripts were last updated late 2004
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Another Stay Under Query

Post by mark »

Post Reply