500 is the max size of text fragments to be returned by highlight. It shouldn't 
be the problem here, as far as I understand highlight.

Gong li, how is defined the field "contents" ? Is it the only field on which 
the search is made ?

Pierre

-----Message d'origine-----
De : Ian Lea [mailto:ian....@gmail.com] 
Envoyé : mercredi 16 mars 2011 22:29
À : java-user@lucene.apache.org
Objet : Re: About highlighter

I know nothing about highlighting but that 500 looks like a good place
to start investigating.


--
Ian.


On Tue, Mar 15, 2011 at 8:47 PM, Cescy <ee07b...@gmail.com> wrote:
> Hi,
>
>
> My highlight code is shown as following:
>
>
>  QueryScorer scorer = new QueryScorer(query);
>  Highlighter highlighter = new Highlighter(simpleHTMLFormatter, scorer);
>  highlighter.setTextFragmenter(new SimpleSpanFragmenter(scorer, 500));
>  String contents = doc.get("contents");
>  TokenStream tokenStream = 
> TokenSources.getAnyTokenStream(searcher.getIndexReader(), 
> topDocs.scoreDocs[i].doc, "contents", doc, analyzer);
>  String[] snippet = highlighter.getBestFragments(tokenStream, contents, 10);
>
>
>
> snippet is the result contexts and then I will print out them on the screen.
> But If I may search for a keyword at the last few paragraph and the essay is 
> too long (1000-2000 words), it will return "document found" and 
> snippet.length=0 (i.e. document is found but context is NOT found). Why???
>
>
> How could I fix the problem?

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to