Tomcat has to be configured to use UTF-8. http://wiki.apache.org/solr/SolrTomcat?highlight=%28tomcat%29#URI_Charset_Config
On Fri, Mar 25, 2011 at 6:58 PM, kushti <sandyl...@gmail.com> wrote: > > Grijesh wrote: >> >> Try to send HTML data using format CDATA . >> > Doesn't work with > > >> $content = ""; >> > > And my goal is not to avoid extraction, but have no problems with > non-english chars > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/SOLR-problems-with-non-english-symbols-when-extracting-HTML-tp2729126p2733858.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Lance Norskog goks...@gmail.com