I do get the following error when fetching the Chinese page:
<!-- 018 test:25: Cannot completely convert charset gb2312 to UTF-8 via converter "C:\MORPH3\etc\iconv" -f gb2312 -t UTF-8 -c: returned exit code
1 in the function httransbuf -->
Could this possibly be the cause of the errors in the xml?
Also, is there way to check if what has been fetched is already UTF-8 encoded? I'm thinking that I would need to add this check, before I use strfmt %hhV when outputing to xml.
<!-- 018 test:25: Cannot completely convert charset gb2312 to UTF-8 via converter "C:\MORPH3\etc\iconv" -f gb2312 -t UTF-8 -c: returned exit code
1 in the function httransbuf -->
Could this possibly be the cause of the errors in the xml?
Also, is there way to check if what has been fetched is already UTF-8 encoded? I'm thinking that I would need to add this check, before I use strfmt %hhV when outputing to xml.