Hi, thanks for pointing me to the encoder config - this change alone didn't solve it, though - it just leaves normal characters without HTML entities - like this in the non-problematic case: ****************************** <lst name="highlighting"> <lst name="2009"> <arr name="user_keywords"> <str>slovíčko <em id="highlighting">slovo</em></str> </arr> </lst> </lst> ****************************** Searching for non-ascii stuff isn't changed. I've went through the wiki & guides but haven't found anything related to this.
Thanks though, Pavel Ahmet Arslan píše v So 30. 04. 2011 v 14:10 +0200: > Hi, > > What happens when you set the default encoder to > solr.highlight.DefaultEncoder in solrconfig.xml? > > <!-- Configure the standard encoder --> > <encoder name="html" default="true" class="solr.highlight.DefaultEncoder"/> > > > > > > ----- Original Message ----- > From: Pavel Kukačka <pavel.kuka...@seznam.cz> > To: solr-user@lucene.apache.org > Cc: > Sent: Saturday, April 30, 2011 1:31 PM > Subject: Re: Highlighting words with non-ascii chars > > Hello, > > I've hit a (probably trivial) roadblock I don't know how to overcome with > Solr 3.1: > I have a document with common fields (title, keywords, content) and I'm > trying to use highlighting. > With queries using ASCII characters there is no problem; it works > smoothly. However, > when I search using a czech word including non-ascii chars (like "slovíčko" > for example - > http://localhost:8983/solr/select/?q=slov%C3%AD%C4%8Dko&version=2.2&start=0&rows=10&indent=on&hl=on&hl.fl=*), > the document is found, but > the response doesn't contain the highlighted snippet in the highlighting node > - there is only an > empty node - like this: > ****************** > . > . > . > <lst name="highlighting"> > <lst name="2009"/> > </lst> > ************************ > > > When searching for the other keyword ( > http://localhost:8983/solr/select/?q=slovo&version=2.2&start=0&rows=10&indent=on&hl=on&hl.fl=*), > the resulting response is fine - like this: > ************************************ > <lst name="highlighting"> > <lst name="2009"> > <arr name="user_keywords"> > <str>slov&#237;&#269;ko <em > id="highlighting">slovo</em></str> > </arr> > </lst> > </lst> > > ************************************ > > Did anyone come accross this problem? > Cheers, > Pavel