Preventing duplicates

Post Reply
dietric
Posts: 100
Joined: Fri May 20, 2005 10:57 am

Preventing duplicates

Post by dietric »

I have three URL's in an index that reference the same piece of content, but are not seen as duplicates:
http://www.rnweb.com/rnweb/Assessment/W ... oryId=6042
http://www.rnweb.com/rnweb/Assessment/W ... oryId=6063
http://www.rnweb.com/rnweb/article/arti ... ?id=109931

The duplicate fields in the search settings are set to "Duplicate Check Fields". I did a diff on the actual body stored in the appliance, and it seems to be identical. However, the hash key of the three indexes is different. What else could cause a difference?
User avatar
mark
Site Admin
Posts: 5514
Joined: Tue Apr 25, 2000 6:56 pm

Preventing duplicates

Post by mark »

Did you happen to change the duplicate check field settings between when those pages were walked?
dietric
Posts: 100
Joined: Fri May 20, 2005 10:57 am

Preventing duplicates

Post by dietric »

No, they were always set to "Body".

-ds
dietric
Posts: 100
Joined: Fri May 20, 2005 10:57 am

Preventing duplicates

Post by dietric »

Do you have any more insight into this?

Thanks
-ds
User avatar
mark
Site Admin
Posts: 5514
Joined: Tue Apr 25, 2000 6:56 pm

Preventing duplicates

Post by mark »

Not really. Maybe we could spot something if we had access to the appliance. If you want us to look open a ticket with appliance access information (hostname, login, password, profile).
Post Reply