[
https://issues.apache.org/jira/browse/LUCENE-1539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718907#action_12718907
]
Michael McCandless commented on LUCENE-1539:
--------------------------------------------
bq. When existing deletes are over 50%, we loop through termdocs instead.
OK good, except it's deleting too aggressively when > 50% deletions are already
present (using nextBoolean()). Can you change that to target a certain
deletion rate? Ie if you need to delete 20%, then do random.nextDouble() <
0.20 to do the delete? But then I guess put a floor on that rate so that it
doesn't get too slow on the "tail"? It won't be perfectly random when it hits
that tail but I think that's OK.
> Improve Benchmark
> -----------------
>
> Key: LUCENE-1539
> URL: https://issues.apache.org/jira/browse/LUCENE-1539
> Project: Lucene - Java
> Issue Type: Improvement
> Components: contrib/benchmark
> Affects Versions: 2.4
> Reporter: Jason Rutherglen
> Assignee: Michael McCandless
> Priority: Minor
> Fix For: 2.9
>
> Attachments: LUCENE-1539.patch, LUCENE-1539.patch, LUCENE-1539.patch,
> LUCENE-1539.patch, LUCENE-1539.patch, LUCENE-1539.patch, sortBench2.py,
> sortCollate2.py
>
> Original Estimate: 336h
> Remaining Estimate: 336h
>
> Benchmark can be improved by incorporating recent suggestions posted
> on java-dev. M. McCandless' Python scripts that execute multiple
> rounds of tests can either be incorporated into the codebase or
> converted to Java.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]