Strange indexing behaviour

Post Reply
joseph.gresham
Posts: 28
Joined: Thu Oct 20, 2005 9:05 am

Strange indexing behaviour

Post by joseph.gresham »

Hi

We are experiencing some strange behaviour with the indexing of binary documents on our site. Documents placed in the non-secure area of the site seem to be indexed correctly by Webinator. However documents placed in the secure section of the site are not indexed correctly. Although the documents can be found by using the 'List/Edit URLs' option, viewing the contents of the document reveals that the body has not been indexed (displays -NONE-).

Another observation is that the results that have no body content have a Charset of 'ISO-8859-1' and the documents that have been indexed correctly have a Charset of 'UTF-8', is this significant? Any help you could provide would be greatly appreciated.
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Strange indexing behaviour

Post by mark »

Is the charset declared for those pages? If not, what's your default source charset set to? And your storage charset?

What does list/edit detail say about errors?

Also make sure the page has no meta robots telling it to noindex and that you don't haven't setup Exclude by Field that would match those pages.
Post Reply