On Mon, 2009-04-27 at 18:28 +0100, Chris Young wrote:
> On Mon, 27 Apr 2009 00:49:19 +0100, John-Mark Bell wrote:
> 
> > In which case, either the iconv filter in libparserutils is doing
> > something odd that doesn't work with the iconv() implementation you're
> > using, or the iconv() implementation itself is broken in some way.
> > 
> > It's almost impossible to tell with the information available, I'm
> > afraid.
> 
> I'm testing with it disabled, if it doesn't noticeably break any sites
> then I'm not particulary bothered about investigating further.

It will break sites. It rather depends on how many users you have that
require the ability to read Russian, Middle-Eastern, and CJK languages.

> > Does the csdetect test still fail? That's certainly odd, but likely
> > unrelated (as, unless I've forgotten how that works, iconv() isn't
> > involved)
> 
> Yes, it does:
> 
> 1: Detected charset windows-1252 (2252) Source 1 Expected  (0)
> FAIL - mibenum == parserutils_charset_mibenum_from_name( expected, 
> strlen(expected)) at line 133

OK. There's two problems here. Firstly, it's detected Windows-1252 and
not UTF-8. Secondly, it appears that there's no expected value. Is the
test data in the correct format?

It should be:

#data
<data goes here>
#encoding
UTF-8


J.


Reply via email to