Re: [Bug 233674] kipi-plugins: unknown-locale-code hne i8n

2010-04-10 Thread Bjørn Steensrud
 Fredag 9. april 2010 23.21.45 skrev Raphael Geissert :
 [should probably drop some CCs as this is becoming partially
 off-topic; adding CC to eglibc]

Other NO people may be better qualified to reply than I am, but here goes:

 Locale 'no' is not supported
 Locale 'no_NO' is not supported 
 Locale 'no_NY' is not supported


Good!We have been trying to get rid of those locales for quite some time. 
The two we use are nb_NO and nn_NO, for Bomål and Nynorsk, respectively. 
I know of only one other active locale for _NO:  se_NO,  Northern Sámi in 
Norway.

Bjørn, who translates KDE to nb and occasionally has time to do a debconf 
translation.


--
To UNSUBSCRIBE, email to debian-qt-kde-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/201004100907.26005.bjor...@skogkatt.homelinux.org



Re: [Bug 233674] kipi-plugins: unknown-locale-code hne i8n

2010-04-09 Thread Christian PERRIER
Jumping in that thread (you would have guessed I would, right? :-))

 Mark Purcell wrote:
 
  On Friday 09 April 2010 08:30:14 Albert Astals Cid wrote:
  hm? eh? what? why would we remove a perfectly valid translation to a
  language spoken by 11 million people?
  
  Albert,
  
  I don't think the issue is the removal of the translations, rather the
  fact that the locale-code hne isn't defined in ISO 639-1  ISO 639-2.
  
  My reading of Chhattisgarhi_language[1] shows that the ISO 639-3 code is
  hne, whilst the ISO 639-2 code for this family is inc.
  
  I am raising this as an inconsistency. If we are accepting ISO 639-3
  locales then that is fine too and we will fix lintian.

We are:

bubu...@mykerinos:~/src/debian/iso-codes/git/iso_639_3 ls -l 
/usr/share/i18n/locales/hne_IN
-rw-r--r-- 1 root root 5393  7 févr. 18:26 /usr/share/i18n/locales/hne_IN

We also have several other locales with ISO-639-3 codes:

/usr/share/i18n/locales/ast_ES: Asturian (Spain)
/usr/share/i18n/locales/byn_ER: Bilin (Eritrea)
/usr/share/i18n/locales/crh_UA: Crimean Turkish (Ukraine)
/usr/share/i18n/locales/csb_PL: Kashubian (Poland)
/usr/share/i18n/locales/fil_PH: Filipino (Philippines)
/usr/share/i18n/locales/fur_IT: Friulian( Italy)
/usr/share/i18n/locales/gez_ER: Geez (Eritrea) 
/usr/share/i18n/locales/gez...@abegede
/usr/share/i18n/locales/gez_ET: Geez (Ethiopia) 
/usr/share/i18n/locales/gez...@abegede
/usr/share/i18n/locales/hne_IN: Chhattisgarhi (India)
/usr/share/i18n/locales/hsb_DE: Upper Sorbian
/usr/share/i18n/locales/mai_IN: Maithili (India)
/usr/share/i18n/locales/nan...@latin: Min Nan Chinese (Taiwan)
/usr/share/i18n/locales/nds_DE: Lower Saxon (Germany)
/usr/share/i18n/locales/nds_NL: Lower Saxon (Netherlands)
/usr/share/i18n/locales/nso_ZA: Northern Sotho (South Africa)
/usr/share/i18n/locales/pap_AN: Papiamento (Netherlands Antilles)
/usr/share/i18n/locales/shs_CA: Shuswap (Canada)
/usr/share/i18n/locales/sid_ET: Sidamo (Ethiopia)
/usr/share/i18n/locales/tig_ER: Tigre (Eritrea)
/usr/share/i18n/locales/wal_ET: Wolayttia (Ethiopia)

There are even some locales that correspond to *no* ISO-639-3 code
/usr/share/i18n/locales/ber_DZ: Berber (Algeria)
/usr/share/i18n/locales/ber_MA: Berber (Morocco)

(here I think this is a bug. The code to use should be the code for
Tamazight...but indeed the Berber/Tamazight language uses two scripts,
the fact that Berber and Tamazight are different or not is debated
between Algeria and Morocco and all this is a highly sensitive
political issue  and that may explain this inconsistency...which
is, still, done the wrong way)


All of them are here (and come from glibc upstream) because the said
languages have no ISO-639-2 code.

  
 
 Right, but the first faulty package is isoquery. The isoquery program is not 
 able to display the ISO 639-3 codes:
  TODO for Isoquery
  =
  
  - allow use of ISO 639-3
 
 The reason I assume is that there's no (given the nature of the ISO 639-3 
 standard) translation table between ISO 639-3 and ISO 639-2 codes.
 
 So I guess the most we can do is just inject the 639-3 codes and hope for 
 the best (i.e. hope people is going to use the best and most appropriate ISO 
 639-1..3 code.)

That's correct about isoquery's capabilities that should be enhanced.

Maybe, if possible, check that a locale exists for the said language.

We can quite safely assume that, if a locale exists with a given
ISO-639-3 code, then it has been carefully though by glibc maintainers
(either upstream or Debian/Ubuntu maintainers).

Indeed, if a translation file exists for a locale that doesn't exist,
there are chances that it isn't used.particularly because users
will have more problems in setting this locale in their environment.

So, maybe lintian could check that, if foo_BAR.po file exists in a
package, there is at least one foo_* file in /usr/share/i18n/locales

And, subliminal message to Chhattisgarhi translators: I would very
much welcome a work on localization for Debian Installer that would
help in adding Yet Another Indic Language to the already quite long
list of supported Indic languages in Debian and Ubuntu installers..:-)





signature.asc
Description: Digital signature


Re: [Bug 233674] kipi-plugins: unknown-locale-code hne i8n

2010-04-09 Thread Raphael Geissert
On 9 April 2010 00:16, Christian PERRIER bubu...@debian.org wrote:
 Jumping in that thread (you would have guessed I would, right? :-))


:-)

 There are even some locales that correspond to *no* ISO-639-3 code
 /usr/share/i18n/locales/ber_DZ: Berber (Algeria)
 /usr/share/i18n/locales/ber_MA: Berber (Morocco)

(for lintian's current purposes, the country code is ignored; FWIW)

 Maybe, if possible, check that a locale exists for the said language.

 We can quite safely assume that, if a locale exists with a given
 ISO-639-3 code, then it has been carefully though by glibc maintainers
 (either upstream or Debian/Ubuntu maintainers).

I considered using /usr/share/i18n/SUPPORTED as the source of
information but it is far from being complete.

 So, maybe lintian could check that, if foo_BAR.po file exists in a
 package, there is at least one foo_* file in /usr/share/i18n/locales

Am I missing something or wouldn't adding the 639-3 codes to the list
of known and valid codes be enough?

Cheers,
-- 
Raphael Geissert - Debian Developer
www.debian.org - get.debian.net


-- 
To UNSUBSCRIBE, email to debian-qt-kde-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/y2he9fb436d1004082325m81f94ec3qf0a5d78f65bc6...@mail.gmail.com



Re: [Bug 233674] kipi-plugins: unknown-locale-code hne i8n

2010-04-09 Thread Christian PERRIER

  So, maybe lintian could check that, if foo_BAR.po file exists in a
  package, there is at least one foo_* file in /usr/share/i18n/locales
 
 Am I missing something or wouldn't adding the 639-3 codes to the list
 of known and valid codes be enough?


Actually, if a locale doesn't exist in the glibc, translations for
that locale won't be used, so checking against
/usr/share/i18n/SUPPORTED seems to be the right thing to do.


I'm not sure, though, that it is really a good idea to check PO files
against supported locales, for upstream translations (ie non-debconf
ones) with lintian. The only case where it's relevant to lintian,
imho, is the case where the package is native (so upstream is
Debian).




signature.asc
Description: Digital signature


Re: [Bug 233674] kipi-plugins: unknown-locale-code hne i8n

2010-04-09 Thread Raphael Geissert
[should probably drop some CCs as this is becoming partially
off-topic; adding CC to eglibc]

On 9 April 2010 12:59, Christian PERRIER bubu...@debian.org wrote:
 Actually, if a locale doesn't exist in the glibc, translations for
 that locale won't be used, so checking against
 /usr/share/i18n/SUPPORTED seems to be the right thing to do.

If that's true, then (after playing with Contents-i386.gz), by
comparing /usr/share/locale/*/ with the locales in SUPPORTED:
Locale 'ab' is not supported
Locale 'ace' is not supported
Locale 'ak' is not supported
Locale 'ang' is not supported
Locale 'ara' is not supported
Locale 'ar_PS' is not supported
Locale 'ay' is not supported
Locale 'az_IR' is not supported
Locale 'bal' is not supported
Locale 'b...@latin' is not supported
Locale 'bem' is not supported
Locale 'be-tarask' is not supported
Locale 'bi' is not supported
Locale 'bist' is not supported
Locale 'C' is not supported
Locale 'cat' is not supported
Locale 'c...@valencia' is not supported
Locale 'ckb' is not supported
Locale 'cn' is not supported
Locale 'cn_ZH' is not supported
Locale 'co' is not supported
Locale 'cpf' is not supported
Locale 'cpp' is not supported
Locale 'cs_CZ.utf8' is not supported
Locale 'cs-windows-1250' is not supported
Locale 'cz' is not supported
Locale 'de_DE.utf8' is not supported
Locale 'dk' is not supported
Locale 'dv' is not supported
Locale 'e...@boldquot' is not supported
Locale 'en_IGID' is not supported
Locale 'e...@quot' is not supported
Locale 'e...@shaw' is not supported
Locale 'en_UK' is not supported
Locale 'en...@piglatin' is not supported
Locale 'en_US.utf8' is not supported
Locale 'es_ES.utf8' is not supported
Locale 'fa_AF' is not supported
Locale 'ff' is not supported
Locale 'fi_FI.utf8' is not supported
Locale 'fr_FR.utf8' is not supported
Locale 'fr_FX' is not supported
Locale 'frp' is not supported
Locale 'gn' is not supported
Locale 'gos' is not supported
Locale 'gr' is not supported
Locale 'haw' is not supported
Locale 'io' is not supported
Locale 'it_IT.utf8' is not supported
Locale 'jbo' is not supported
Locale 'jp' is not supported
Locale 'jv' is not supported
Locale 'kab' is not supported
Locale 'kok' is not supported
Locale 'kos' is not supported
Locale 'la' is not supported
Locale 'lb' is not supported
Locale 'mal' is not supported
Locale 'malkovich' is not supported
Locale 'md' is not supported
Locale 'mus' is not supported
Locale 'mvo' is not supported
Locale 'my' is not supported
Locale 'my_MM' is not supported
Locale 'nap' is not supported
Locale 'nl_NL.utf8' is not supported
Locale 'no' is not supported
Locale 'no_NO' is not supported
Locale 'no_NY' is not supported
Locale 'oj' is not supported
Locale 'pis' is not supported
Locale 'pms' is not supported
Locale 'ps' is not supported
Locale 'pseudo' is not supported
Locale 'pt-br' is not supported
Locale 'qu' is not supported
Locale 'racv' is not supported
Locale 'rm' is not supported
Locale 'ru_RU.cp1251' is not supported
Locale 'ru_RU.koi8r' is not supported
Locale 'ru_RU.utf8' is not supported
Locale 'sco' is not supported
Locale 'son' is not supported
Locale 'sp' is not supported
Locale 's...@ije' is not supported
Locale 's...@ijekavian' is not supported
Locale 's...@ijekavianlatin' is not supported
Locale 's...@latin' is not supported
Locale 's...@latn' is not supported
Locale 'sw' is not supported
Locale 'tet' is not supported
Locale 'tlh' is not supported
Locale 'tpi' is not supported
Locale 'tvl' is not supported
Locale 'tw' is not supported
Locale 'twi' is not supported
Locale 'tzo' is not supported
Locale 'ua' is not supported
Locale 'urd' is not supported
Locale 'u...@cyrillic' is not supported
Locale 'wal' is not supported
Locale 'x-test' is not supported
Locale 'xx' is not supported
Locale 'zam' is not supported
Locale 'zh_cn' is not supported
Locale 'zh_TW.Big5' is not supported
Total: 106
Total excluding anything after [...@_.-]: 67

However, if I compare that list with the codes of ISO 639-1..3
(excluding anything after [...@_.-] -- similar to lintian's check):

Locale 'bist' is not supported
Locale 'C' is not supported
Locale 'cn' is not supported
Locale 'cz' is not supported
Locale 'dk' is not supported
Locale 'gr' is not supported
Locale 'jp' is not supported
Locale 'malkovich' is not supported
Locale 'md' is not supported
Locale 'pseudo' is not supported
Locale 'racv' is not supported
Locale 'sh' is not supported
Locale 'sp' is not supported
Locale 'ua' is not supported
Locale 'x' is not supported
Locale 'xx' is not supported
Total: 16

So, we have 67 - 16 = 51 locales that are not supported by glibc but
that we ship translations for. Great.

Something needs to be done.

 I'm not sure, though, that it is really a good idea to check PO files
 against supported locales, for upstream translations (ie non-debconf
 ones) with lintian. The only case where it's relevant to lintian,
 imho, is the case where the package is native (so upstream is
 Debian).

In fact, what lintian checks is the 

Re: [Bug 233674] kipi-plugins: unknown-locale-code hne i8n

2010-04-08 Thread Mark Purcell
On Friday 09 April 2010 08:30:14 Albert Astals Cid wrote:
 hm? eh? what? why would we remove a perfectly valid translation to a
 language spoken by 11 million people?

Albert,

I don't think the issue is the removal of the translations, rather the fact 
that the locale-code hne isn't defined in ISO 639-1  ISO 639-2.

My reading of Chhattisgarhi_language[1] shows that the ISO 639-3 code is 
hne, whilst the ISO 639-2 code for this family is inc.

I am raising this as an inconsistency. If we are accepting ISO 639-3 locales 
then that is fine too and we will fix lintian.

Mark

W: kipi-plugins: unknown-locale-code hne
N: 
N:The package appears to ship locales for a language but uses an unknown
N:locale code as a subdirectory of /usr/share/locale. This usually results
N:in users of the intended target language not finding the locale. The
N:language codes used in the locale directories are those from the ISO
N:639-1 and ISO 639-2 standards, not those usually used as TLDs (which are
N:from the ISO 3166 standard).
N:
N:It is possible that the language code was mistyped or incorrectly
N:guessed from the language's or country's name.
N:
N:Refer to http://www.loc.gov/standards/iso639-2/php/code_list.php for
N:details.
N:
N:Severity: normal, Certainty: certain
N:
http://lintian.debian.org/tags/unknown-locale-code.html

[1] http://en.wikipedia.org/wiki/Chhattisgarhi_language


signature.asc
Description: This is a digitally signed message part.


Re: [Bug 233674] kipi-plugins: unknown-locale-code hne i8n

2010-04-08 Thread Russ Allbery
Mark Purcell m...@debian.org writes:

 I don't think the issue is the removal of the translations, rather the
 fact that the locale-code hne isn't defined in ISO 639-1  ISO 639-2.

 My reading of Chhattisgarhi_language[1] shows that the ISO 639-3 code is
 hne, whilst the ISO 639-2 code for this family is inc.

 I am raising this as an inconsistency. If we are accepting ISO 639-3
 locales then that is fine too and we will fix lintian.

I suspect that we should be accepting ISO 639-3 codes as well as -1 and -2
codes.  inc looks like a catch-all that wouldn't be very useful for the
end user in language selection.

-- 
Russ Allbery (r...@debian.org)   http://www.eyrie.org/~eagle/


-- 
To UNSUBSCRIBE, email to debian-qt-kde-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/87ochtv56d@windlord.stanford.edu



Re: [Bug 233674] kipi-plugins: unknown-locale-code hne i8n

2010-04-08 Thread Raphael Geissert
Mark Purcell wrote:

 On Friday 09 April 2010 08:30:14 Albert Astals Cid wrote:
 hm? eh? what? why would we remove a perfectly valid translation to a
 language spoken by 11 million people?
 
 Albert,
 
 I don't think the issue is the removal of the translations, rather the
 fact that the locale-code hne isn't defined in ISO 639-1  ISO 639-2.
 
 My reading of Chhattisgarhi_language[1] shows that the ISO 639-3 code is
 hne, whilst the ISO 639-2 code for this family is inc.
 
 I am raising this as an inconsistency. If we are accepting ISO 639-3
 locales then that is fine too and we will fix lintian.
 

Right, but the first faulty package is isoquery. The isoquery program is not 
able to display the ISO 639-3 codes:
 TODO for Isoquery
 =
 
 - allow use of ISO 639-3

The reason I assume is that there's no (given the nature of the ISO 639-3 
standard) translation table between ISO 639-3 and ISO 639-2 codes.

So I guess the most we can do is just inject the 639-3 codes and hope for 
the best (i.e. hope people is going to use the best and most appropriate ISO 
639-1..3 code.)

Sorry for not noticing ISO 639-3 codes were being used back when I 
implemented the checks. I verified some of the cases that would trigger the 
warning but they were all true positives.

And like Russ said, switching to the inc code would be incorrect.

Cheers,
-- 
Raphael Geissert - Debian Developer
www.debian.org - get.debian.net


-- 
To UNSUBSCRIBE, email to debian-qt-kde-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/4bbe9167.9f15f10a.4bd2.2...@mx.google.com