On Sat, Sep 17, 2011 at 12:56 AM, Marvin Humphrey <[email protected]> wrote: > On Fri, Sep 16, 2011 at 03:00:21PM +0200, goran kent wrote: >> Any support for collapsing duplicate documents based on a field? > > I wrote a DedupingSearcher class for KinoSearch a while ago that did exactly > this, and I'd be happy to contribute it to the ASF. It will take some > modernizing to get it compatible with Lucy, though.
Any possibility of squeezing that into your schedule? > >> Such a thing possible? > > The algorithm is to rerun the search if there is not sufficient diversity in > the search results, adding exclusions to the query each time to suppress the > unwanted hits. ouch, that doesn't sound good for performance. Am I right?
