UTF-8 Byte Order Marks and timport xml

Post Reply
sroth
Posts: 44
Joined: Mon Jul 23, 2007 11:21 am

UTF-8 Byte Order Marks and timport xml

Post by sroth »

When I attempt to use tmiport on a UTF-8 XML file that includes byte order marks (EF BB BF), I get the following error:
000 XML parsing error reported at byte offset 0.

Does texis read the byte order marks or rely on the xml declaration to know the encoding? If I remove the BOM, it appears that timport doesn't interpret the multi byte character correctly based on the fist line XML declaration.
User avatar
jason112
Site Admin
Posts: 347
Joined: Tue Oct 26, 2004 5:35 pm

UTF-8 Byte Order Marks and timport xml

Post by jason112 »

timport does not understand BOMs, but it wouldn't need th em for UTF-8.

What problems are you having with the multi-byte sequences?
Post Reply