Package: iso-codes
Version: 1.5-1
Severity: normal
Tags: l10n

The file iso_639_3.xml is encoded in ISO 8859-1, despite being declared as
UTF-8:

% head -1 /usr/share/xml/iso-codes/iso_639_3.xml 
<?xml version="1.0" encoding="UTF-8" ?>

For example:

% od -Ax -tx1a /usr/share/xml/iso-codes/iso_639_3.xml | grep -A1 ^024ba0
024ba0 46 72 61 6e 63 6f 2d 50 72 6f 76 65 6e e7 61 6c
         F   r   a   n   c   o   -   P   r   o   v   e   n   g   a   l

The character 'e7' is a "c with cedilla" encoded in ISO 8859-1.

Best regards,

-Christian

-- System Information:
Debian Release: lenny/sid
  APT prefers unstable
  APT policy: (500, 'unstable'), (500, 'testing'), (500, 'stable'), (1, 
'experimental')
Architecture: i386 (i686)

Kernel: Linux 2.6.23.1-mooch.1 (PREEMPT)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) (ignored: LC_ALL 
set to en_US.UTF8)
Shell: /bin/sh linked to /bin/bash

-- no debconf information



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to