keith-turner commented on PR #3934:
URL: https://github.com/apache/accumulo/pull/3934#issuecomment-1800218982

   Adding to @cshannon  comment, we did come up with ways to work around the 
issues identified.  However, we came to the conclusion that making merge able 
to create a single tablet with more files than can fit in memory would cause 
problems for split, compaction, scan, etc.  Realized it was not worthwhile 
pursing these changes and decided it would be best to do the following.
   
    * Add a property that limits the amount files that would be merged into a 
single tablet.  For example if the limit was 10K and if a merge tried to run on 
1000 tablets each having 100 files then it would fail before making any 
metadata changes.  If the merge ran then it would create a single tablet with 
100K files.
    * Still allow merging a large number of tablets as long as there is not a 
large number of files.  For example merging 1 million tablets with a total of 
100 file across all tablets would still work.
   
   Going to close this PR, rework this code and open a new PR that supports the 
above.  
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to