> The UTF-8 is valid, Properly, I think the 3 byte representation for a glyph (ñ) than in UTF-8 is two bytes does not fit in UTF-8 valid, but more in Unicode NFKD, isn't it?
> it just may not be in the ideal normalization > form. The strings that MARC::Charset produces when it converts from > MARC-8 are in a decomposed Unicode normalization form, either NFD or > NFKD. Some web browsers can render NFD strings without any > difficulty, while other ones seem to work better if NFC is used. > Right now Koha passes UTF-8 strings to the browser without > renormalizing them, but perhaps we should be automatically converting > them to NFC? Has to be mantined in NFKD for compatibility with...? In that case shouldn't be wider audience to split all possibilities, given marc8 to utf-8 with unicode normalization in NFD, NFKD and NFKC, apart from the preferred option NFC? Ignacio Javier Gómez Rodríguez Analista - Programador Tfno: 902905590 - Fax: 981571425 [EMAIL PROTECTED] www.coremain.com > > Regards, > > Galen > -- > Galen Charlton > Koha Application Developer > LibLime > [EMAIL PROTECTED] > p: 1-888-564-2457 x709 _______________________________________________ Koha-devel mailing list Koha-devel@nongnu.org http://lists.nongnu.org/mailman/listinfo/koha-devel