How many documents in the collection, how many groups, and how long is it taking to do the grouping vs no grouping?
Also, if you remove the custom sort is it still slow? From: [email protected] At: 10/09/20 12:27:25To: Diego Ceccarelli (BLOOMBERG/ LONDON ) , [email protected] Subject: Re: Deduplication of search result with custom with custom sort Yes, it is пт, 9 окт. 2020 г. в 14:25, Diego Ceccarelli (BLOOMBERG/ LONDON) < [email protected]>: > Is the field that you are using to dedupe stored as a docvalue? > > From: [email protected] At: 10/09/20 12:18:04To: > [email protected] > Subject: Deduplication of search result with custom with custom sort > > Hi, > I need to deduplicate search results by specific field and I have no idea > how to implement this properly. > I have tried grouping with setGroupDocsLimit(1) and it gives me expected > results, but has not very good performance. > I think that I need something like DiversifiedTopDocsCollector, but > suitable for collecting TopFieldDocs. > Is there any possibility to achieve deduplication with existing lucene > components, or do I need to implement my own DiversifiedTopFieldsCollector? > > >
