Tomcat has to be configured to use UTF-8.

http://wiki.apache.org/solr/SolrTomcat?highlight=%28tomcat%29#URI_Charset_Config

On Fri, Mar 25, 2011 at 6:58 PM, kushti <sandyl...@gmail.com> wrote:
>
> Grijesh wrote:
>>
>> Try to send HTML data using format CDATA .
>>
> Doesn't work with
>
>
>> $content = "";
>>
>
> And my goal is not to avoid extraction, but have no problems with
> non-english chars
>
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/SOLR-problems-with-non-english-symbols-when-extracting-HTML-tp2729126p2733858.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>



-- 
Lance Norskog
goks...@gmail.com

Reply via email to