Hi, What happens when you set the default encoder to solr.highlight.DefaultEncoder in solrconfig.xml?
<!-- Configure the standard encoder --> <encoder name="html" default="true" class="solr.highlight.DefaultEncoder"/> ----- Original Message ----- From: Pavel Kukačka <pavel.kuka...@seznam.cz> To: solr-user@lucene.apache.org Cc: Sent: Saturday, April 30, 2011 1:31 PM Subject: Re: Highlighting words with non-ascii chars Hello, I've hit a (probably trivial) roadblock I don't know how to overcome with Solr 3.1: I have a document with common fields (title, keywords, content) and I'm trying to use highlighting. With queries using ASCII characters there is no problem; it works smoothly. However, when I search using a czech word including non-ascii chars (like "slovíčko" for example - http://localhost:8983/solr/select/?q=slov%C3%AD%C4%8Dko&version=2.2&start=0&rows=10&indent=on&hl=on&hl.fl=*), the document is found, but the response doesn't contain the highlighted snippet in the highlighting node - there is only an empty node - like this: ****************** . . . <lst name="highlighting"> <lst name="2009"/> </lst> ************************ When searching for the other keyword ( http://localhost:8983/solr/select/?q=slovo&version=2.2&start=0&rows=10&indent=on&hl=on&hl.fl=*), the resulting response is fine - like this: ************************************ <lst name="highlighting"> <lst name="2009"> <arr name="user_keywords"> <str>slov&#237;&#269;ko <em id="highlighting">slovo</em></str> </arr> </lst> </lst> ************************************ Did anyone come accross this problem? Cheers, Pavel