Lucene used to have a DuplicateFilter to do this, but we removed it recently ... see https://issues.apache.org/jira/browse/LUCENE-6633 for some discussion as to why.
Mike McCandless http://blog.mikemccandless.com On Thu, Dec 1, 2016 at 2:39 PM, Matt Hicks <[email protected]> wrote: > I'm trying to write a LeafCollector that filters out duplicates for a > specific field. However, looking at the JavaDoc for `collect` it says not > to call `IndexSearch.doc` or `IndexReader.document`. How am I supposed to > determine the value of a field and then exclude it? --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
