Page 1 of 1

Strange indexing behaviour

Posted: Fri Oct 20, 2006 4:36 am
by joseph.gresham
Hi

We are experiencing some strange behaviour with the indexing of binary documents on our site. Documents placed in the non-secure area of the site seem to be indexed correctly by Webinator. However documents placed in the secure section of the site are not indexed correctly. Although the documents can be found by using the 'List/Edit URLs' option, viewing the contents of the document reveals that the body has not been indexed (displays -NONE-).

Another observation is that the results that have no body content have a Charset of 'ISO-8859-1' and the documents that have been indexed correctly have a Charset of 'UTF-8', is this significant? Any help you could provide would be greatly appreciated.

Strange indexing behaviour

Posted: Fri Oct 20, 2006 11:16 am
by mark
Is the charset declared for those pages? If not, what's your default source charset set to? And your storage charset?

What does list/edit detail say about errors?

Also make sure the page has no meta robots telling it to noindex and that you don't haven't setup Exclude by Field that would match those pages.