Re: MAX_FILESIZE and hbase.hregion.max.filesize are both 10Gb

2013-07-28 Thread lars hofhansl
It starts with smaller regions and then ramps up the size as the table in question is growing in size. Makes make for a more even distribution of regions. Into how many region did you presplit your table? How many region servers do you have available in this cluster? Maybe that number was less

RE: MAX_FILESIZE and hbase.hregion.max.filesize are both 10Gb

2013-07-28 Thread Vladimir Rodionov
Thanks, Ted. Are there any rationals behind IncreasingToUpperBoundRegionSplitPolicy ? Why is it better than ConstantSizeRegionSplitPolicy? Best regards, Vladimir Rodionov Principal Platform Engineer Carrier IQ, www.carrieriq.com e-mail: vrodio...@carrieriq.com _

Re: MAX_FILESIZE and hbase.hregion.max.filesize are both 10Gb

2013-07-28 Thread Ted Yu
In 0.94 there're several split policies available: ./src/main/java/org/apache/hadoop/hbase/regionserver/ConstantSizeRegionSplitPolicy.java ./src/main/java/org/apache/hadoop/hbase/regionserver/DelimitedKeyPrefixRegionSplitPolicy.java ./src/main/java/org/apache/hadoop/hbase/regionserver/DisabledRegi

RE: MAX_FILESIZE and hbase.hregion.max.filesize are both 10Gb

2013-07-28 Thread Vladimir Rodionov
Yes, I pre-split the table Out of 109 regions only 3 are empty (wrong assumption on key distribution), If we split region on 10GB we will have > 500GB in 109 regions - not 376GB I do not understand how does region splitting algorithm work. Best regards, Vladimir Rodionov Principal Platform Engine

[jira] [Created] (HBASE-9070) Properly clean up snapshots in tearDown() method of snapshot related tests

2013-07-28 Thread Ted Yu (JIRA)
Ted Yu created HBASE-9070: - Summary: Properly clean up snapshots in tearDown() method of snapshot related tests Key: HBASE-9070 URL: https://issues.apache.org/jira/browse/HBASE-9070 Project: HBase I

Re: MAX_FILESIZE and hbase.hregion.max.filesize are both 10Gb

2013-07-28 Thread Jean-Marc Spaggiari
"Yes it works, of course." It's not working for me ;) so was not sure. It's normal to have regions under the half of the MAX_FILESIZE. When a regions is more than MAX_FILESIZE it's splitted in 2. So one can be more, and the other one can be less. I will say, average 5GB will have been a good valu

Re: [UPDATE] Finishing up 0.96 --> WAS Re: 0.95 and 0.96 remaining issues

2013-07-28 Thread Andrew Purtell
On Sun, Jul 28, 2013 at 4:01 PM, Stack wrote: > I just did a review of tags. They look like they need a good bit of > work yet. > Yes that patch I put up on RB is the one that was cluster tested and profiled. Let me work with Ram about getting the latest up there. We also needed to get a sense

RE: MAX_FILESIZE and hbase.hregion.max.filesize are both 10Gb

2013-07-28 Thread Vladimir Rodionov
The final stats: Total HDFS size - 376GB #regions: 109 - avg. region size ~ 3.6GB Something is wrong here. I expected fewer regions. The regions get split at sizes much lower than hbase.hregion.max.filesize and/or MAX_FILESIZE. Best regards, Vladimir Rodionov Principal Platform Engineer Carri

Re: [UPDATE] Finishing up 0.96 --> WAS Re: 0.95 and 0.96 remaining issues

2013-07-28 Thread Stack
On Sun, Jul 28, 2013 at 11:14 AM, Andrew Purtell wrote: > On Sat, Jul 27, 2013 at 2:52 PM, Stack wrote: > > > Regards the big features that are racing to make the 0.96 cutoff -- > namely > > namespaces, tags, and serialization lib -- as I see it, Francis needs > > reviews if namespaces are to mak

RE: MAX_FILESIZE and hbase.hregion.max.filesize are both 10Gb

2013-07-28 Thread Vladimir Rodionov
Yes it works, of course. Its in original post - ~ 10gB hbase.hregion.max.filesize 10737418240 hbase-site.xml Best regards, Vladimir Rodionov Principal Platform Engineer Carrier IQ, www.carrieriq.com e-mail: vrodio...@carrieriq.com From: Jean-Mar

[jira] [Reopened] (HBASE-7980) TestZKInterProcessReadWriteLock fails occasionally in QA test run

2013-07-28 Thread stack (JIRA)
[ https://issues.apache.org/jira/browse/HBASE-7980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack reopened HBASE-7980: -- Let me reopen this one to keep account of failings. See here: https://builds.apache.org/job/HBase-TRUNK/4306/test

[jira] [Created] (HBASE-9069) estThriftServerCmdLine.testRunThriftServer[18] fails

2013-07-28 Thread stack (JIRA)
stack created HBASE-9069: Summary: estThriftServerCmdLine.testRunThriftServer[18] fails Key: HBASE-9069 URL: https://issues.apache.org/jira/browse/HBASE-9069 Project: HBase Issue Type: Bug

Re: MAX_FILESIZE and hbase.hregion.max.filesize are both 10Gb

2013-07-28 Thread Jean-Marc Spaggiari
Hi Vladimir, Is this link working for you? http://MASTERURL:60010/conf ? If yes, what do you have for hbase.hregion.max.filesize? To make sure the property below is considerered. For the table config, did you get it from the webui? JM 2013/7/28 Vladimir Rodionov > but all regions keep getting

MAX_FILESIZE and hbase.hregion.max.filesize are both 10Gb

2013-07-28 Thread Vladimir Rodionov
but all regions keep getting split at 1Gb I have 71 regions and 70GB of data in 'usertable' despite the fact that: table config is: {NAME => 'usertable', DEFERRED_LOG_FLUSH => 'true', MAX_FILESIZE => '100', FAMILIES => [{NAME => 'cf', BLOOMFILTER => 'ROWCOL', VERSIONS => '1', COMPRESSIO

Re: [DISCUSS] Making hadoop 2 the default precommit for trunk ones we get green builds

2013-07-28 Thread Ted Yu
I didn't find one. I opened HBASE-9068 and attached a patch there. Cheers On Sat, Jul 27, 2013 at 9:30 PM, Ted Yu wrote: > Is there a JIRA to switch precommit build for trunk to run tests against > hadoop 2 ? > > Cheers > > > On Thu, Apr 25, 2013 at 8:47 PM, Andrew Purtell wrote: > >> +1 to El

Re: [UPDATE] Finishing up 0.96 --> WAS Re: 0.95 and 0.96 remaining issues

2013-07-28 Thread Andrew Purtell
On Sat, Jul 27, 2013 at 2:52 PM, Stack wrote: > Regards the big features that are racing to make the 0.96 cutoff -- namely > namespaces, tags, and serialization lib -- as I see it, Francis needs > reviews if namespaces are to make it, tags ditto, and the serialization > libs are nice-to-have auxi

[jira] [Created] (HBASE-9068) Make hadoop 2 the default precommit for trunk

2013-07-28 Thread Ted Yu (JIRA)
Ted Yu created HBASE-9068: - Summary: Make hadoop 2 the default precommit for trunk Key: HBASE-9068 URL: https://issues.apache.org/jira/browse/HBASE-9068 Project: HBase Issue Type: Test Re