Page 1 of 2

Bad characters in pdf

Posted: Wed Mar 01, 2006 12:04 pm
by nick107
http://icscsearch.icsc.org/texis/search ... mit=Submit

On many of the pdfs on the site, the Summary and Match Info results of the pdf are coming back as a bunch of garbage charaters. Any ideas?

I searched through the forum but couldn't find exactly what I was looking for.

Bad characters in pdf

Posted: Wed Mar 01, 2006 2:35 pm
by nick107
I also have all updates applied on the search appliance.

Bad characters in pdf

Posted: Wed Mar 01, 2006 5:55 pm
by mark
Looks like some issue with font/charset mapping. We're checking into it...

Bad characters in pdf

Posted: Thu Mar 02, 2006 1:03 pm
by Kai
We've found the issue; a fix (actually a workaround for lack of Unicode info in the PDF) should be available as an update sometime next week. Will let you know.

Bad characters in pdf

Posted: Wed Mar 22, 2006 4:09 pm
by nick107
I am getting garbage characters again on the following search result:

http://icscsearch.icsc.org/texis/search ... &prox=page

It looks like strange characters are the periods on the index page in between the section title and the page number

Bad characters in pdf

Posted: Wed Mar 22, 2006 5:25 pm
by John
What do you have Display Charset set to? If in the browser I tell it to display UTF-8 it seems correct.

Bad characters in pdf

Posted: Wed Mar 22, 2006 7:03 pm
by nick107
Both the display character and storage character set are set to blank, so it should be using UTF-8 right?

Bad characters in pdf

Posted: Wed Mar 22, 2006 9:00 pm
by John
Correct. That means in your look and feel you want to make sure you specify the charset as UTF-8 in the <head>, e.g.

<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">

Bad characters in pdf

Posted: Wed Mar 22, 2006 10:36 pm
by nick107
Thanks, that solved that issue.

Bad characters in pdf

Posted: Wed Mar 29, 2006 10:02 am
by nick107
Ok, I am getting still getting some bad characters in search results, even with the charset set to UTF-8. On the following page:

http://icscsearch.icsc.org/texis/search ... review&cq=

the second and third result are good examples, among others.

Any ideas?