Package: iso-codes Version: 1.5-1 Severity: normal Tags: l10n The file iso_639_3.xml is encoded in ISO 8859-1, despite being declared as UTF-8:
% head -1 /usr/share/xml/iso-codes/iso_639_3.xml <?xml version="1.0" encoding="UTF-8" ?> For example: % od -Ax -tx1a /usr/share/xml/iso-codes/iso_639_3.xml | grep -A1 ^024ba0 024ba0 46 72 61 6e 63 6f 2d 50 72 6f 76 65 6e e7 61 6c F r a n c o - P r o v e n g a l The character 'e7' is a "c with cedilla" encoded in ISO 8859-1. Best regards, -Christian -- System Information: Debian Release: lenny/sid APT prefers unstable APT policy: (500, 'unstable'), (500, 'testing'), (500, 'stable'), (1, 'experimental') Architecture: i386 (i686) Kernel: Linux 2.6.23.1-mooch.1 (PREEMPT) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) (ignored: LC_ALL set to en_US.UTF8) Shell: /bin/sh linked to /bin/bash -- no debconf information -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]