Re: Effect of region size on compaction performance

2014-03-23 Thread Kevin O'dell
Hey David, What is your write pattern? If you are bulkloading the data into HBase this gives you the ability to add more regions and control your compactions. If not, a high number of regions as Vlad indicated can cause some weird issues. How many region servers do you have? What is the curre

RE: Effect of region size on compaction performance

2014-03-23 Thread Vladimir Rodionov
How small is small and how large is large? Recommended region size is usually between 5-10GB. Too small regions results in more frequent flushes/compactions and have additional overhead in RS RAM. >>I am thinking about extending TableInputFormat to override the >>1-map-per-region default policy a