Page 1 of 2

HTML encoded characters

Posted: Mon Oct 17, 2005 10:16 am
by tmarino
Just installed Webinator 5.1. It looks like Texis is having trouble with HTML encoded characters. One result has an n-dash character – displaying as –. If you click the link in Texis to view the actual page, the character is displaying properly.

Any ideas?

HTML encoded characters

Posted: Mon Oct 17, 2005 10:22 am
by mark
Can you provide the url for the problem page?

HTML encoded characters

Posted: Mon Oct 17, 2005 10:50 am
by tmarino

HTML encoded characters

Posted: Mon Oct 17, 2005 11:50 am
by mark
That's not a public server and I was looking for the link to the page with the &ndash on it. Open a tech support ticket and attach the full output of
geturl http://pagewithndash

HTML encoded characters

Posted: Mon Oct 17, 2005 12:49 pm
by Kai
Note that – is a multi-byte Unicode character (U+2013) that cannot be converted to a monobyte character set. If your Display Charset is ISO-8859-1 or another monobyte charset, such characters will be translated to `?' because they are out of range.

HTML encoded characters

Posted: Mon Oct 17, 2005 1:19 pm
by tmarino

HTML encoded characters

Posted: Mon Oct 17, 2005 2:28 pm
by mark
I just indexed that page with 5.1.24 and default settings and all looks well. What version was your data indexed with?

HTML encoded characters

Posted: Mon Oct 17, 2005 3:26 pm
by tmarino
Webinator 5.1.0-Unix-w/plugin

HTML encoded characters

Posted: Mon Oct 17, 2005 3:34 pm
by mark
Download the latest scripts and try indexing (mode new) again.

HTML encoded characters

Posted: Mon Oct 17, 2005 5:40 pm
by tmarino
Can you post the URL of where to download the latest scripts?