The serializer's Encodings.properties file is out of date
---------------------------------------------------------

         Key: XALANJ-2198
         URL: http://issues.apache.org/jira/browse/XALANJ-2198
     Project: XalanJ2
        Type: Improvement
    Versions: 2.7    
    Reporter: Brian Minchau


The serializer's Encodings.properties file is out of date.

Each line in this properties file is of this form:
javaName   mimeName1,mimeName2,...,mimeNameN  0xabdc

The various mime names are names that can appear for the value of the encoding 
in the stylesheet, and all are aliases of the encoding. 

The javaName is the name used internally by our XSLT implementation when 
opening the writer, in our case the implementation is in Java so we need a name 
that the JRE understands when doing this:
        new OutputStreamWriter(OutputStream, javaName);

The last thing on the line is the hex value of the code point of the largest 
code point in  the encoding. The largest code point value is no longer used and 
should be dropped.


There are many many encodings missing when you look compare to 
http://www.iana.org/assignments/character-sets. 
For example consider this encoding on that web page:
Name: IBM275                                              [RFC1345,KXS2]
MIBenum: 2032
Source: IBM NLS RM Vol2 SE09-8002-01, March 1990
Alias: EBCDIC-BR
Alias: cp275
Alias: csIBM275

The "Name:" value is the primary name of the encoding.
It is unclear to me if the "Name:" field has meaning for us. Is it ever the 
javaName or a mime name?
Which one of the aliases the javaName?  In most cases we do consider one of the 
aliases to be the javaName.  Which one?
Are all aliases, other than perhaps the javaName, mime names that can appear in 
stylesheets?
What about the "MIBenum:" field.  I've found that on my JRE, for a large number 
of encodings, that it to can be used as a javaName to create the 
OutputStreamWriter.  In this case the string  "2032" could be used.  Does that 
make it a mime name?

If anyone can shed light on how to extract information from 
http://www.iana.org/assignments/character-sets or other sources, in order to 
get this properties file correct, and current, it would be appreciated.



-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to