Not Duplicate but appliance says they are.

Post Reply
jgdoke
Posts: 167
Joined: Wed Jul 14, 2004 10:52 am

Not Duplicate but appliance says they are.

Post by jgdoke »

User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Not Duplicate but appliance says they are.

Post by mark »

Go to list/edit urls and see what text was extracted from each. It's probably the same.
jgdoke
Posts: 167
Joined: Wed Jul 14, 2004 10:52 am

Not Duplicate but appliance says they are.

Post by jgdoke »

User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Not Duplicate but appliance says they are.

Post by mark »

I tried a crawl of just those 2 pages with dups off. Both seem to come up with no body text. Not sure why. Will require more study of the html on those pages.
jgdoke
Posts: 167
Joined: Wed Jul 14, 2004 10:52 am

Not Duplicate but appliance says they are.

Post by jgdoke »

You are correct. The list url's shows zero bytes text from the page..

ABjournal is one of our high traffic areas, please let me know an answer ASAP.

Thank you
John
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Not Duplicate but appliance says they are.

Post by mark »

Those pages are returning different content based on user-agent. Adjust your user-agent to something the webserver likes. Maybe something like this will make it behave

Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.0.3705; .NET CLR 1.1.4322)
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Not Duplicate but appliance says they are.

Post by mark »

Post Reply