[ 
https://issues.apache.org/jira/browse/SOLR-4265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13544121#comment-13544121
 ] 

Dawid Weiss commented on SOLR-4265:
-----------------------------------

This isn't the right approach. The double conversion from iso8859-1 to UTF-8 is 
simply wrong -- you assume identity conversion to bytes but in fact the 
characters may be screwed up already by that time.

This is a container configuration issue. Jetty (or Tomcat, or whatever else) 
should be configured to decode servlet parameters using UTF-8. For Tomcat the 
instructions are here:

http://wiki.apache.org/tomcat/FAQ/CharacterEncoding#Q8

If you find a similar configuration setting for Jetty it should be applied for 
the default distribution.
                
> Encoding problem from test console
> ----------------------------------
>
>                 Key: SOLR-4265
>                 URL: https://issues.apache.org/jira/browse/SOLR-4265
>             Project: Solr
>          Issue Type: Bug
>          Components: web gui
>    Affects Versions: 4.0
>         Environment: Windows but, environment independent
>            Reporter: Alex Rocher
>            Priority: Blocker
>
> When you type an accent (in french language for example) in the console query 
> tester, there's no charset conversion (servlet request charset conversion)
> Eg.: "même" is converted into it's ISO-8859-1 representation ==> fail
> The reason : getCharacterEncoding from HTTPRequest is not tested. Il it's 
> null, il will assume to convert an UTF-8 encoding charset.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to