Hi

Sorry about doubble messaging, it was from mistake

I am working on an application that searches documents with lucene. The documents are taken from a database and the basic field is in HTML. The index and searching works fine with Greek characters. I am using the lucene to take the ID and then the to get the corresponding record from database. My problem is that: i want to highlight the field with HTML query string that using on searching.

Is it possible to  do  with  Hightlighter or  i must  write my own  class?




Grant Ingersoll wrote:
Because there are no matches? Have you checked your index, etc? Do you get matches for that query normally in Greek against your index (nevermind highlighting)? Are your analyzers the same? Are your English Fields stored and the Greek ones not? Does field "f" contain Greek? It could be a whole host of reasons and that snippet of code is not enough to go from. Check your index with Luke, etc. See the FAQ for more help, too.

Also, please don't repost the same exact question 45 minutes later b/c you didn't get an immediate response at 3:48 AM East Coast US time. It just clutters email and makes it less likely that you will get a response.

-Grant

On Jun 26, 2008, at 3:48 AM, [EMAIL PROTECTED] wrote:

Hello

i am having the following code to highlight a text

public String highlight(String text, String query ) throws IOException {
TermQuery query = new TermQuery(new Term("f", query));
   QueryScorer scorer = new QueryScorer(query);
   SimpleHTMLFormatter formatter =
       new SimpleHTMLFormatter("<span class=\"highlight\">",
           "</span>");
   Highlighter highlighter = new Highlighter(formatter, scorer);
   Fragmenter fragmenter = new SimpleFragmenter(50);
   highlighter.setTextFragmenter(fragmenter);

   TokenStream tokenStream = new GreekAnalyzer()
       .tokenStream("f", new StringReader(text));

String result = highlighter.getBestFragments(tokenStream, text, 5, "...");

 return result;
}

if i use a text with english characters works fine if i try a text with Greek Character i am getting en empty string as result.
Why?

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


--------------------------
Grant Ingersoll
http://www.lucidimagination.com

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ








---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
------------------------------------------------------------------------


No virus found in this incoming message.
Checked by AVG. Version: 8.0.101 / Virus Database: 270.4.3/1525 - Release Date: 29/6/2008 3:09 ìì






---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to