How to write a 'fq' by joining two different indexes.

2020-05-24 Thread Siva ReddySM
Hi, We have a requirement to get a list of products based on product attribute and inventory. Problem, use case and test data is given below. InentoryIndex is a sub index to ProductIndex. Both ProductIndex and InventoryIndex share common node called "catentry_id". Example of ProductIndex is

Re: highlighting a whole html document using Unified highlighter

2020-05-24 Thread Jörn Franke
Hmm maybe more insights on the use case would be useful. It looks like what David says about metadata could make sense in your scenario depending on the requirements... > Am 24.05.2020 um 13:20 schrieb Serkan KAZANCI : > > Thanks Jörn for the answer, > > I use post tool to index html

Re: highlighting a whole html document using Unified highlighter

2020-05-24 Thread Serkan KAZANCI
All clear. Thanks David, > On 24 May 2020, at 18:57, David Smiley wrote: > > These strategies are not mutually exclusive. Yes I do suggest having the > HTML in whole go into one searchable field to satisfy your highlighting > use-case. But I can imagine you will also want some document

Re: highlighting a whole html document using Unified highlighter

2020-05-24 Thread David Smiley
These strategies are not mutually exclusive. Yes I do suggest having the HTML in whole go into one searchable field to satisfy your highlighting use-case. But I can imagine you will also want some document metadata in separate fields. It's up to you to parse that out somehow and add it. You

RE: highlighting a whole html document using Unified highlighter

2020-05-24 Thread Serkan KAZANCI
Hi David, I have many meta-tags in html documents like which matches the field descriptions in schema file. As I understand, you propose to index the whole html document as one text file and map it to a search field (do you?) . That would take care of the html highlight issue, however I

Re: highlighting a whole html document using Unified highlighter

2020-05-24 Thread David Smiley
Instead of stripping the HTML for the stored value, leave it be and remove it during the analysis stage with solr.HTMLStripCharFilterFactory This means the searchable text

keyword match statistics information

2020-05-24 Thread Serkan KAZANCI
Hi, Related to my previous question. How is it possible to retrieve keyword match statistics information for a matched document? Information like which variations of keyword has matched (as is/stemmed versions/wildcard versions) and frequency of those matches. Let’s say we searched

RE: highlighting a whole html document using Unified highlighter

2020-05-24 Thread Serkan KAZANCI
Thanks Jörn for the answer, I use post tool to index html documents, so the html tags are stripped when indexed and stored. The remaining text is mapped to the field content by default. hl.fragsize=0 works perfect for the indexed document, but I can only display highlighted text-only version

Re: highlighting a whole html document using Unified highlighter

2020-05-24 Thread Jörn Franke
hl.fragsize=0 https://lucene.apache.org/solr/guide/8_5/highlighting.html > Am 24.05.2020 um 11:49 schrieb Serkan KAZANCI : > > Hi, > > > > I use solr to search over a million html documents, when a document is > searched and displayed, I want to highlight the keywords that are used to >

highlighting a whole html document using Unified highlighter

2020-05-24 Thread Serkan KAZANCI
Hi, I use solr to search over a million html documents, when a document is searched and displayed, I want to highlight the keywords that are used to find and access the document. Unified highlighter is fast, accurate and supports different languages but only highlights passages with given