Duplicate PDF documents
Posted: Mon Apr 08, 2002 6:49 pm
We have some scanned PDF documents that are showing up in the walk results as duplicates - e.g.,
The link: http://countynet/procure/purchguide/boardapprmsa.pdf
Referenced by : http://countynet/procure/purchguide/
Is a duplicate of: http://countynet/misc/boscalendar.pdf
(these are on our Intranet, so you won't be able to reach the site.)
I realize that scanned PDF's don't have any body text that can be indexed, but we did give these PDF's unique PDF titles, subjects, and descriptions & then reindexed, and they still showed up as duplicates. Is there anything else we can try?
The link: http://countynet/procure/purchguide/boardapprmsa.pdf
Referenced by : http://countynet/procure/purchguide/
Is a duplicate of: http://countynet/misc/boscalendar.pdf
(these are on our Intranet, so you won't be able to reach the site.)
I realize that scanned PDF's don't have any body text that can be indexed, but we did give these PDF's unique PDF titles, subjects, and descriptions & then reindexed, and they still showed up as duplicates. Is there anything else we can try?