[ https://issues.apache.org/jira/browse/LUCENE-3129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael McCandless updated LUCENE-3129: --------------------------------------- Attachment: LUCENE-3129.patch Patch. I ran quick perf test -- single pass was ~18% faster than two-pass (using cache). Not as much as I expected... but every bit counts! > Single-pass grouping collector based on doc blocks > -------------------------------------------------- > > Key: LUCENE-3129 > URL: https://issues.apache.org/jira/browse/LUCENE-3129 > Project: Lucene - Java > Issue Type: Improvement > Components: modules/grouping > Reporter: Michael McCandless > Assignee: Michael McCandless > Fix For: 3.2, 4.0 > > Attachments: LUCENE-3129.patch > > > LUCENE-3112 enables adding/updating a contiguous block of documents to > the index, guaranteed (yet, experimental!) to retain adjacent docID > assignment through the full life of the index as long the app doesn't > delete individual docs from the block. > When an app does this, it can enable neat features like LUCENE-2454 > (nested documents), post-group facet counting (LUCENE-3097). > It also makes single-pass grouping possible, when you group by > the "identifier" field shared by the doc block, since we know we will > see a given group only once with all of its docs within one block. > This should be faster than the fully general two-pass collectors we > already have. > I'm working on a patch but not quite there yet... -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org