I tried a crawl of just those 2 pages with dups off. Both seem to come up with no body text. Not sure why. Will require more study of the html on those pages.
Those pages are returning different content based on user-agent. Adjust your user-agent to something the webserver likes. Maybe something like this will make it behave
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.0.3705; .NET CLR 1.1.4322)
Not sure I what I was looking at before, but looking at this again it would appear that the problem is not client related, but is that both of those pages have no text content, only . The appliance can find the links to the desired pages, http://www.ab.com/abjournal/nov2004/index.html and http://www.ab.com/networks/ethernet/index.html, but won't try to follow any links on the duplicate (empty) page.