[
https://issues.apache.org/jira/browse/SOLR-231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12494551
]
Yonik Seeley commented on SOLR-231:
-----------------------------------
> Solr should assume UTF-8 encoding unless the contentType says otherwise.
In general yes (when Solr is asked for a Reader).
For XML, we should probably give the parser an InputStream.
http://www.nabble.com/double-curl-calls-in-post.sh--tf2287469.html#a6369448
> By default, use UTF-8 for posted content streams
> ------------------------------------------------
>
> Key: SOLR-231
> URL: https://issues.apache.org/jira/browse/SOLR-231
> Project: Solr
> Issue Type: Improvement
> Reporter: Ryan McKinley
> Fix For: 1.2
>
> Attachments: SOLR-231-ContentType-UTF8.patch,
> SOLR-231-ContentType-UTF8.patch
>
>
> Solr should assume UTF-8 encoding unless the contentType says otherwise. To
> change the contentType and encoding set the header value with contentType
> ="text/xml; charset=utf-8"
> likewise, with stream.body=xxxx, will default to UTF-8 unless the
> stream.contentType says otherwise.
>
> For previous discussion, see:
> http://www.nabble.com/resin-and-UTF-8-in-URLs-tf3152910.html
> http://www.nabble.com/charset-in-POST-from-browser-tf3153057.html
> http://www.nabble.com/Re%3A-svn-commit%3A-r536048----lucene-solr-trunk-src-webapp-src-org-apache-solr-servlet-SolrRequestParsers.java-tf3712816.html
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.