We are  getting following error:
java.io.CharConversionException: Not an ISO 8859-1 character: <EF><BF><83>.
It is not just <83>. Sorry I missed those last time. 

We are working with java6. If I use tomcat 5.5.23 it is working... But we
would like to use the tomcat 6.


awarnier wrote:
> 
> pramodpm wrote:
>> We are facing an encoding issue in apache-tomcat-6.0.20. This is working
>> in
>> tomcat 5.5.23.  We are trying to  make a get request to external site.
>> The
>> page contains some utf-8 characters. 
> 
> No.  The page probably contains Unicode characters, all encoded in the 
> UTF-8 encoding.  What you probably mean is that some of these characters 
> have a Unicode codepoint above 127 decimal, and are thus represented by 
> 2 or more bytes in UTF-8.
> 
>   When we access the page from the
>> application we are getting the following error. 
>> Can you please help us to resolve this issue. Any help is appreciated.
>> 
>  From your log below, it does not look like you have problems when 
> accessing the external page.  Reading the page is fine, and the content 
> of the page is being properly translated, from its original UTF-8 
> encoding, into a Unicode string in Java (in your servlet).
> 
> However, what happens next is that your servelt is trying to output this 
> string to the servlet output stream, which is specified as having the 
> ISO-8859-1 charset/encoding.  And at least one of these internal Unicode 
> characters does not have a valid representation in ISO-8859-1.  So Java 
> complains at the moment you are trying to write out this character, 
> because it cannot translate it from the internal Unicode, to the 
> external desired ISO-8859-1 (because that particular character does not 
> exist in ISO-8859-1 (which contains only the 256 characters that are 
> part of the latin-1 set, which covers only some Western European
> languages).
> 
> Now having written all that, I am still a bit uneasy, if the <83> below 
> represents the hexadecimal Unicode codepoint of this character.  Because 
> 0083 is a character known as "NBH", which looks like some kind of 
> control character. So where would that one come from, in a html page ?
> 
>>  
>> 
>> 
>> WARNING: Handler caused Not an ISO 8859-1 character: <83>
>> 
>> 
>> java.io.CharConversionException: Not an ISO 8859-1 character: <83>
>> 
>> at javax.servlet.ServletOutputStream.print(ServletOutputStream.java:89)
>> 
>> at 
>> 
>> at java.io.OutputStream.write(OutputStream.java:99)
>> 
>> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
> For additional commands, e-mail: users-h...@tomcat.apache.org
> 
> 
> 

-- 
View this message in context: 
http://old.nabble.com/Tomcat-6-encoding-issue-tp26313150p26315285.html
Sent from the Tomcat - User mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org

Reply via email to