DO NOT REPLY [Bug 15787] - IANA charset names list inefficient; useful?

bugzilla Mon, 27 Jan 2003 14:35:56 -0800

DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=15787>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.


http://nagoya.apache.org/bugzilla/show_bug.cgi?id=15787

IANA charset names list inefficient; useful?





------- Additional Comments From [EMAIL PROTECTED]  2003-01-27 22:37 -------
> So does ICU4C 2.2+ have an API that I can query which standard that a charset
name belongs to?

ICU4C 2.4 has ucnv_getCanonicalName() which can do this. For example:

bool isIANAEncoding(char* encodingName) const {
    UErrorCode errorCode=U_ZERO_ERROR;
    return
        ucnv_getCanonicalName(encodingName, "IANA", &errorCode)
        !=NULL;
}

See http://oss.software.ibm.com/icu/apiref/ucnv_8h.html#a95

Note that the IANA charset list contains some entries that are not charsets, and
that many entries are not well-defined. See in my original bug posting the
paragraph "I also wonder whether..."

ICU 2.4 does not have all possible IANA names in its convrtrs.txt, but you can
add them to it for your ICU distribution - and you can help us get a complete
list into ICU. For the latter please contact us. (icu list ->
http://oss.software.ibm.com/icu/archives/)

Best regards,
markus

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

DO NOT REPLY [Bug 15787] - IANA charset names list inefficient; useful?

Reply via email to