Thanks Jörn for the answer,

I use post tool to index html documents, so the html tags are stripped when 
indexed and stored. The remaining text is mapped to the field content by 
default. 

hl.fragsize=0 works perfect for the indexed document, but I can only display 
highlighted text-only version of html document because the html tags are 
stripped.

So is it possible to index and store the html document without stripping the 
html tags, so that when the document is displayed with hl.fragsize=0 parameter, 
it is displayed as original html document?

Or

Is it possible to give a whole html document as a parameter to the Unified 
highlighter so that output is also a highlighted html document?

Or 

Do you have a better idea to highlight the keywords of the whole html document? 

 Thanks,
 
 Serkan

-----Original Message-----
From: Jörn Franke [mailto:jornfra...@gmail.com] 
Sent: Sunday, May 24, 2020 1:22 PM
To: solr-user@lucene.apache.org
Subject: Re: highlighting a whole html document using Unified highlighter

hl.fragsize=0

https://lucene.apache.org/solr/guide/8_5/highlighting.html



> Am 24.05.2020 um 11:49 schrieb Serkan KAZANCI <ser...@kazanci.com.tr>:
> 
> Hi,
> 
> 
> 
> I use solr to search over a million html documents, when a document is
> searched and displayed, I want to highlight the keywords that are used to
> find and access the document.
> 
> 
> 
> Unified highlighter is fast, accurate and supports different languages but
> only highlights passages with given parameters.
> 
> 
> 
> How can I highlight a whole html document using Unified highlighter? I have
> written a php code but it cannot do the complex word stemming functions.
> 
> 
> 
> 
> 
> Thanks,
> 
> 
> 
> Serkan
> 

Reply via email to