DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=4000>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=4000

class CharInfo - Loading of entities file assumes default platform encoding

           Summary: class CharInfo - Loading of entities file assumes
                    default platform encoding
           Product: XalanJ2
           Version: 2.2.x
          Platform: Other
        OS/Version: Other
            Status: UNCONFIRMED
          Severity: Normal
          Priority: Other
         Component: org.apache.xalan.serialize
        AssignedTo: [EMAIL PROTECTED]
        ReportedBy: [EMAIL PROTECTED]
                CC: [EMAIL PROTECTED]


org.apache.xalan.serialize.CharInfo, line 159:

      reader = new BufferedReader(new InputStreamReader(is));

This bug reports sounds like a continuation to the ongoing discussion about bug 
#2291; yet, here we're not talking about any probably-standardized file type.

XMLEntities.res, HTMLEntities.res etc. are simple text files, encoded in ASCII. 
Using the above line to construct a bufferred reader to read this file, causes 
garbage to be read in non-ASCII platforms.

No standard is set on "user supplied" text files inside JAR files. Therefore, 
we cannot depend on the platform-specific encoding. If this file is released 
from Apache as ASCII, then CharInfo must explicitly treat it as an ASCII file.

This line should be changed to:

reader = new BufferedReader(new InputStreamReader(is,"ISO8859-1"));


   - Isaac

Reply via email to