[ 
https://issues.apache.org/jira/browse/SOLR-2381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler updated SOLR-2381:
--------------------------------

    Attachment: SOLR-ServletOutputWriter.patch

Robert and me discussed about the Jetty OutputWriter and found out:

- It's much more broken, as it would even not support writing half surrogates 
in write(char[], ofset, length), which may also fail for other 
ResponseWriters!!!
- Jettys implementation is SLOOOOOOOOOOOW!

The attached patch now uses no Writer supplied by Jetty or any other servlet 
container at all - it just handles HTTP as it is: a binary protocol using byte 
streams. Like for UpdateReqHandler it uses its own mapper inside Solr (on the 
input side ContentStream is used for that).

As most output in solr is done using UTF-8 (the default), it uses a pre-looked 
up NIO Charset for that.

> The included jetty server does not support UTF-8
> ------------------------------------------------
>
>                 Key: SOLR-2381
>                 URL: https://issues.apache.org/jira/browse/SOLR-2381
>             Project: Solr
>          Issue Type: Bug
>            Reporter: Robert Muir
>            Assignee: Robert Muir
>            Priority: Blocker
>             Fix For: 3.1, 4.0
>
>         Attachments: SOLR-2381.patch, SOLR-2381_xmltest.patch, 
> SOLR-ServletOutputWriter.patch, SOLR-ServletOutputWriter.patch, 
> jetty-6.1.26-patched-JETTY-1340.jar, jetty-6.1.26-patched-SOLR-2381.jar, 
> jetty-util-6.1.26-patched-JETTY-1340.jar, 
> jetty-util-6.1.26-patched-SOLR-2381.jar, post_utf8enhanced.sh, 
> utf8enhanced.xml
>
>
> Some background here: 
> http://www.lucidimagination.com/search/document/6babe83bd4a98b64/which_unicode_version_is_supported_with_lucene
> Some possible solutions:
> * wait and see if we get resolution on 
> http://jira.codehaus.org/browse/JETTY-1340. To be honest, I am not even sure 
> where jetty is being maintained (there is a separate jetty project at 
> eclipse.org with another bugtracker, but the older releases are at codehaus).
> * include a patched version of jetty with correct utf-8, using that patch.
> * remove jetty and include a different container instead.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to