How many documents in the collection, how many groups, and how long is it 
taking to do the grouping vs no grouping? 

Also, if you remove the custom sort is it still slow?

From: java-user@lucene.apache.org At: 10/09/20 12:27:25To:  Diego Ceccarelli 
(BLOOMBERG/ LONDON ) ,  java-user@lucene.apache.org
Subject: Re: Deduplication of search result with custom with custom sort

Yes, it is

пт, 9 окт. 2020 г. в 14:25, Diego Ceccarelli (BLOOMBERG/ LONDON) <
dceccarel...@bloomberg.net>:

> Is the field that you are using to dedupe stored as a docvalue?
>
> From: java-user@lucene.apache.org At: 10/09/20 12:18:04To:
> java-user@lucene.apache.org
> Subject: Deduplication of search result with custom with custom sort
>
> Hi,
> I need to deduplicate search results by specific field and I have no idea
> how to implement this properly.
> I have tried grouping with setGroupDocsLimit(1) and it gives me expected
> results, but has not very good performance.
> I think that I need something like DiversifiedTopDocsCollector, but
> suitable for collecting TopFieldDocs.
> Is there any possibility to achieve deduplication with existing lucene
> components, or do I need to implement my own DiversifiedTopFieldsCollector?
>
>
>


Reply via email to