[ 
https://issues.apache.org/jira/browse/SOLR-214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12491926
 ] 

Toru Matsuzawa commented on SOLR-214:
-------------------------------------

This problem can be confirmed with tomcat 5.5.23.

This problem had occurred by "/update" before the correction of SOLR-197.
stream.getReader() is acquired by org.apache.catalina.connector.CoyoteReader. 

CoyoteReader use org.apache.catalina.connector.InputBuffer#realReadBytes().
realReadBytes() is read with byte order.
Therefore, garbled characters in the index.


> deficit of InputStreamReader support in anonymous class of ContentStream
> ------------------------------------------------------------------------
>
>                 Key: SOLR-214
>                 URL: https://issues.apache.org/jira/browse/SOLR-214
>             Project: Solr
>          Issue Type: Bug
>            Reporter: Koji Sekiguchi
>         Attachments: UseInputStreamReader.patch
>
>
> After SOLR-197 is applied, POSTed Japanese XML contents turn into garbled 
> characters in the index.
> I can see the garbled characters through Luke. The issue was never seen 
> before SOLR-197.
> The cause of this problem is that the deficit of InputStreamReader support in 
> the anonymous class of ContentStream in 
> SolrRequestParsers.parseParamsAndFillStreams() method.
> Before SOLR-197, InputStreamReader was used in 
> XmlUpdateRequestHandler.handleRequestBody() method:
>     // Cycle through each stream
>     for( ContentStream stream : req.getContentStreams() ) {
>       String charset = getCharsetFromContentType( stream.getContentType() );
>       Reader reader = null;
>       if( charset == null ) {
>         reader = new InputStreamReader( stream.getStream() );
>       }
>       else {
>         reader = new InputStreamReader( stream.getStream(), charset );
>       }
>       rsp.add( "update", this.update( reader ) );
>       
>       // Make sure its closed
>       try { reader.close(); } catch( Exception ex ){}
>     }
> The patch will apply this effect to SolrRequestParsers.
> regards,

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to