[
https://issues.apache.org/jira/browse/LUCENE-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12668971#action_12668971
]
Jason Rutherglen commented on LUCENE-1476:
------------------------------------------
{quote}
Just run sortBench2.py in contrib/benchmark of trunk & patch
areas. Then run sortCollate2.py to make the Jira table (-jira) or
print a human readable output (default). You'll have to make your own
Wikipedia indices with the pctg deletes, then edit sortBench2.py &
sortCollate2.py to fix the paths.
All they do is write an alg file, run the test, and parse the output
file to gather best of 5.
{quote}
This seems like something we can port to Java and get into
contrib/benchmark. Particularly automatically creating the indexes.
> BitVector implement DocIdSet, IndexReader returns DocIdSet deleted docs
> -----------------------------------------------------------------------
>
> Key: LUCENE-1476
> URL: https://issues.apache.org/jira/browse/LUCENE-1476
> Project: Lucene - Java
> Issue Type: Improvement
> Components: Index
> Affects Versions: 2.4
> Reporter: Jason Rutherglen
> Priority: Trivial
> Attachments: hacked-deliterator.patch, LUCENE-1476.patch,
> LUCENE-1476.patch, LUCENE-1476.patch, LUCENE-1476.patch, LUCENE-1476.patch,
> quasi_iterator_deletions.diff, quasi_iterator_deletions_r2.diff,
> quasi_iterator_deletions_r3.diff, searchdeletes.alg, sortBench2.py,
> sortCollate2.py, TestDeletesDocIdSet.java
>
> Original Estimate: 12h
> Remaining Estimate: 12h
>
> Update BitVector to implement DocIdSet. Expose deleted docs DocIdSet from
> IndexReader.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]