[
https://issues.apache.org/jira/browse/ACCUMULO-251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13180485#comment-13180485
]
David Medinets commented on ACCUMULO-251:
-----------------------------------------
>From K. Turner on the mailing list: The flushes after each insert are there
>for a specific purpose, to
ensure the data written with different seeds ends up in different
files. This is done to show that at scan time the bloom filter will
let you skip seeking 2 of 3 files.
Part of the confusion is that I was not reading the text of the README well
enough. I missed this part "To illustrate this two
identical tables were created using the following process." While I expected
the README to walk me through the steps so that I could replicate the results,
the purpose of the README is simply to report times.
I suggest the README be expanded into a step-by-step process to replicate the
results. If y'all agree, it can be a separate ticket.
> Add wording to README.bloom about reason for flushing.
> ------------------------------------------------------
>
> Key: ACCUMULO-251
> URL: https://issues.apache.org/jira/browse/ACCUMULO-251
> Project: Accumulo
> Issue Type: Improvement
> Components: docs
> Reporter: David Medinets
> Assignee: Adam Fuchs
> Priority: Trivial
>
> The README.bloom file says this:
> * Insert 1 million entries using RandomBatchWriter with a seed of 7
> * Flush the table using the shell
> * Insert 1 million entries using RandomBatchWriter with a seed of 8
> * Flush the table using the shell
> * Insert 1 million entries using RandomBatchWriter with a seed of 9
> * Flush the table using the shell
> However, no reasons are given for why three flushes are used instead of one.
> Please explain the reasons.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira