[jira] [Updated] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Liu Shaohui (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liu Shaohui updated HBASE-10370:


Attachment: HBASE-10370-v2.diff

Update the log as [~zjushch]'s advice

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Attachments: HBASE-10370-v1.diff, HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Created] (HBASE-10374) add jemalloc into the choice of memstore allocation

2014-01-17 Thread Bing Jiang (JIRA)
Bing Jiang created HBASE-10374:
--

 Summary: add jemalloc into the choice of memstore allocation 
 Key: HBASE-10374
 URL: https://issues.apache.org/jira/browse/HBASE-10374
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Affects Versions: 0.96.1.1
Reporter: Bing Jiang
Priority: Minor


https://blog.twitter.com/2013/netty-4-at-twitter-reduced-gc-overhead introduced 
that Netty used jemalloc to gain benefits from GC.
It can be a good choice for memstore block allocation.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10349) Table became unusable when master balanced its region after table was dropped

2014-01-17 Thread Matteo Bertozzi (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13874613#comment-13874613
 ] 

Matteo Bertozzi commented on HBASE-10349:
-

[~jxiang] can you explain why the region is still in META?
the patch looks good to me to avoid any assignment transition after delete,
but I still don't see anything in the balancer/assignment manager that can add 
back the region to META.

 Table became unusable when master balanced its region after table was dropped
 -

 Key: HBASE-10349
 URL: https://issues.apache.org/jira/browse/HBASE-10349
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Ted Yu
Assignee: Jimmy Xiang
 Fix For: 0.98.0, 0.99.0

 Attachments: 
 10349-hadoop-hdfs-namenode-hor11n14.gq1.ygridcore.net.zip, 10349-output.log, 
 10349-v1.txt, 10349-v2.txt, HBASE-10349-meta-test-and-debug.patch, 
 hbase-10349.patch, hbase-hbase-master-hor15n05.gq1.ygridcore.net.log.tar.gz


 0.98 was used.
 This was sequence of events:
 create 'tablethree_mod'
 snapshot 'tablethree_mod', 'snapshot_tablethree_mod'
 disable 'tablethree_mod'
 2014-01-15 09:34:51,749   restore_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:35:07,210   enable 'tablethree_mod'
 2014-01-15 09:35:46,134   delete_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:41:42,210   disable 'tablethree_mod'
 2014-01-15 09:41:43,610   drop 'tablethree_mod'
 create 'tablethree_mod'
 For the last table creation request:
 {code} 
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'create 'tablethree_mod',
 {NAME = 'f1', VERSIONS = 3}
 ,
 {NAME = 'f2', VERSIONS = 3}
 ,
 {NAME = 'f3', VERSIONS = 3}
 '
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'exists 'tablethree_mod''
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '0', 'f1:q1', 'value-0', 10'
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '1', 'f1:q1', 'value-1', 20'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '2', 'f2:q2', 'value-2', 30'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '3', 'f3:q3', 'value-3', 40'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '4', 'f3:q3', 'value-4', 50'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO|Done writing commands to 
 file. Will execute them now.
 2014-01-15 10:03:53,000|beaver.machine|INFO|RUNNING: /usr/lib/hbase/bin/hbase 
 shell /grid/0/tmp/hwqe/artifacts/tmp-471142
 2014-01-15 10:03:55,878|beaver.machine|INFO|2014-01-15 10:03:55,878 INFO 
 [main] Configuration.deprecation: hadoop.native.lib is deprecated. Instead, 
 use io.native.lib.available
 2014-01-15 10:03:57,283|beaver.machine|INFO|2014-01-15 10:03:57,283 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,669|beaver.machine|INFO|2014-01-15 10:03:57,669 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,720|beaver.machine|INFO|2014-01-15 10:03:57,720 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 2014-01-15 10:03:57,997|beaver.machine|INFO|ERROR: Table already exists: 
 tablethree_mod!
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 {code}
 This was an intermittent issue after using Snapshots, a table is not properly 
 dropped / and not able to properly re-create with the same name. And a 
 HRegion is empty or null Error occurs. (When you try to drop the table it 
 says it does not exist, and when you try to create the table it says that it 
 does already exist).
 {code}
 2014-01-15 10:04:02,462|beaver.machine|INFO|ERROR: HRegionInfo was null or 
 empty in hbase:meta, row=keyvalues=
 {tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:seqnumDuringOpen/1389778905355/Put/vlen=8/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:server/1389778905355/Put/vlen=32/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:serverstartcode/1389778905355/Put/vlen=8/mvcc=0}
  
 {code}
 Thanks to Huned who discovered this issue.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread ramkrishna.s.vasudevan (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13874615#comment-13874615
 ] 

ramkrishna.s.vasudevan commented on HBASE-10370:


Patch looks good to me.  Did not spend time to check the root cause. 

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Attachments: HBASE-10370-v1.diff, HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10030) [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method

2014-01-17 Thread Eric Charles (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13874632#comment-13874632
 ] 

Eric Charles commented on HBASE-10030:
--

finally fails also via the maven command line if I clean before compiling
$ mvn -v
Apache Maven 3.0.5 (r01de14724cdef164cd33c7c8c2fe155faf9602da; 2013-02-19 
14:51:28+0100)
Maven home: /opt/apache-maven-3.0.5
Java version: 1.8.0-ea, vendor: Oracle Corporation
Java home: /opt/jdk1.8.0-b117/jre
Default locale: en_US, platform encoding: UTF-8
OS name: linux, version: 3.12.0-4-generic, arch: amd64, family: unix


 [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method
 --

 Key: HBASE-10030
 URL: https://issues.apache.org/jira/browse/HBASE-10030
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: Andrew Purtell
Priority: Trivial
 Fix For: 0.98.0

 Attachments: 10030.patch


 On JDK 8, the erasure of PoolMap#remove(K,V) conflicts with superclass method 
 remove(Object,Object).



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10336) Remove deprecated usage of Hadoop HttpServer in InfoServer

2014-01-17 Thread Eric Charles (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Charles updated HBASE-10336:
-

Attachment: HBASE-10336-5.patch

You got it [~stack]. Upon that, please welcome additional config keys 
(hbase.http.filter.initializers and hbase.http.max.threads) and a separate 
webapp context attribute (CONF_CONTEXT_ATTRIBUTE = hbase.conf) which would 
allow mixed hadoop/hbase webapp to coexist in the same JVM (quite theorical, 
but could be useful for extended embedeed minicluster test cases).

The attached patch HBASE-10336-5.patch adds in hbase-default.xml the 2 new keys 
and pulls 2 more needed test classes from hadoop.

About the pom, yes I have thought about the exclusion/inclusion exercise, but 
would really prefer to perform this when submitting 
another patch via HBASE-6581. For now, I have quite some branches and would 
prefer rebasing on trunk.

And yes, if you commit to trunk, everything compiles/works as before on the 
various hadoop versions. So to me, you can commit HBASE-10336-5.patch.


 Remove deprecated usage of Hadoop HttpServer in InfoServer
 --

 Key: HBASE-10336
 URL: https://issues.apache.org/jira/browse/HBASE-10336
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.99.0
Reporter: Eric Charles
 Attachments: HBASE-10336-1.patch, HBASE-10336-2.patch, 
 HBASE-10336-3.patch, HBASE-10336-4.patch, HBASE-10336-5.patch


 Recent changes in Hadoop HttpServer give NPE when running on hadoop 
 3.0.0-SNAPSHOT. This way we use HttpServer is deprecated and will probably be 
 not fixed (see HDFS-5760). We'd better move to the new proposed builder 
 pattern, which means we can no more use inheritance to build our nice 
 InfoServer.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13874689#comment-13874689
 ] 

Hadoop QA commented on HBASE-10370:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12623612/HBASE-10370-v2.diff
  against trunk revision .
  ATTACHMENT ID: 12623612

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop1.1{color}.  The patch compiles against the hadoop 
1.1 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8459//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8459//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8459//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8459//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8459//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8459//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8459//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8459//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8459//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8459//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8459//console

This message is automatically generated.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Attachments: HBASE-10370-v1.diff, HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store 

[jira] [Commented] (HBASE-10333) Assignments are not retained on a cluster start

2014-01-17 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13874721#comment-13874721
 ] 

Hudson commented on HBASE-10333:


FAILURE: Integrated in hbase-0.96-hadoop2 #177 (See 
[https://builds.apache.org/job/hbase-0.96-hadoop2/177/])
HBASE-10333 Assignments are not retained on a cluster start (jxiang: rev 
1558965)
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/master/HMaster.java


 Assignments are not retained on a cluster start
 ---

 Key: HBASE-10333
 URL: https://issues.apache.org/jira/browse/HBASE-10333
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0, 0.96.1.1
Reporter: Devaraj Das
Assignee: Jimmy Xiang
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: hbase-10333.patch


 When a cluster is fully shutdown and then started up again with 
 hbase.master.startup.retainassign set to true, I noticed that the assignments 
 are not retained. Upon digging, it seems like HBASE-10101 made a change due 
 to which the server holding the META previously is added to dead-servers (in 
 _HMaster.assignMeta_). Later on, this makes the AssignmentManager think that 
 the master recovered from a failure as opposed to a fresh cluster start (the 
 ServerManager.deadServers list is not empty in the check within 
 _AssignmentManager.processDeadServersAndRegionsInTransition_)



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10289) Avoid random port usage by default JMX Server. Create Custome JMX server

2014-01-17 Thread nijel (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

nijel updated HBASE-10289:
--

Attachment: HBASE-10289.patch

Attached initial patch with the Custom JMX server.
Please have a look. If it is ok, i will change the main classes to start this 
JMX server.

 Avoid random port usage by default JMX Server. Create Custome JMX server
 

 Key: HBASE-10289
 URL: https://issues.apache.org/jira/browse/HBASE-10289
 Project: HBase
  Issue Type: Improvement
Reporter: nijel
Priority: Minor
 Attachments: HBASE-10289.patch


 If we enable JMX MBean server for HMaster or Region server  through VM 
 arguments, the process will use one random which we cannot configure.
 This can be a problem if that random port is configured for some other 
 service.
 This issue can be avoided by supporting  a custom JMX Server.
 The ports can be configured. If there is no ports configured, it will 
 continue the same way as now.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10366) 0.94 filterRow() may be skipped in 0.96(or onwards) code

2014-01-17 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13874869#comment-13874869
 ] 

Ted Yu commented on HBASE-10366:


The javadoc in Filter.java should be revised:
{code}
   * @return True if this filter actively uses filterRow(List) or filterRow().
   */
  abstract public boolean hasFilterRow();
{code}

 0.94 filterRow() may be skipped in 0.96(or onwards) code
 

 Key: HBASE-10366
 URL: https://issues.apache.org/jira/browse/HBASE-10366
 Project: HBase
  Issue Type: Bug
  Components: Filters
Reporter: Jeffrey Zhong
Assignee: Jeffrey Zhong
Priority: Critical
 Fix For: 0.98.0, 0.96.2

 Attachments: hbase-10366.patch


 HBASE-6429 combines both filterRow  filterRow(ListKeyValue kvs) functions 
 in Filter. 
 While 0.94 code or older, it may not implement hasFilterRow as HBase-6429 
 expected because 0.94(old) hasFilterRow only returns true when 
 filterRow(ListKeyValue kvs) is overridden not the filterRow(). Therefore, 
 the filterRow() will be skipped.
 Since we don't ask 0.94 users to update their existing filter code, the issue 
 will cause scan returns unexpected keyvalues and break the backward 
 compatibility.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10366) 0.94 filterRow() may be skipped in 0.96(or onwards) code

2014-01-17 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13874886#comment-13874886
 ] 

stack commented on HBASE-10366:
---

Echo what Andrew says for 0.96

 0.94 filterRow() may be skipped in 0.96(or onwards) code
 

 Key: HBASE-10366
 URL: https://issues.apache.org/jira/browse/HBASE-10366
 Project: HBase
  Issue Type: Bug
  Components: Filters
Reporter: Jeffrey Zhong
Assignee: Jeffrey Zhong
Priority: Critical
 Fix For: 0.98.0, 0.96.2

 Attachments: hbase-10366.patch


 HBASE-6429 combines both filterRow  filterRow(ListKeyValue kvs) functions 
 in Filter. 
 While 0.94 code or older, it may not implement hasFilterRow as HBase-6429 
 expected because 0.94(old) hasFilterRow only returns true when 
 filterRow(ListKeyValue kvs) is overridden not the filterRow(). Therefore, 
 the filterRow() will be skipped.
 Since we don't ask 0.94 users to update their existing filter code, the issue 
 will cause scan returns unexpected keyvalues and break the backward 
 compatibility.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10289) Avoid random port usage by default JMX Server. Create Custome JMX server

2014-01-17 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13874889#comment-13874889
 ] 

stack commented on HBASE-10289:
---

Tell me more [~nijel]  In hbase-env.sh in trunk you can specify ports for JMX 
(IIRC).  Or are you talking about fixing the RMI madness where it picks a 
random port to respond?  With this patch in place, can we remove the talk in 
hbase-env.sh of flags for starting JMX?  What happens if someone passes the 
flags?  Will we have two JMX servers running?

Thanks for doing this.



 Avoid random port usage by default JMX Server. Create Custome JMX server
 

 Key: HBASE-10289
 URL: https://issues.apache.org/jira/browse/HBASE-10289
 Project: HBase
  Issue Type: Improvement
Reporter: nijel
Priority: Minor
 Attachments: HBASE-10289.patch


 If we enable JMX MBean server for HMaster or Region server  through VM 
 arguments, the process will use one random which we cannot configure.
 This can be a problem if that random port is configured for some other 
 service.
 This issue can be avoided by supporting  a custom JMX Server.
 The ports can be configured. If there is no ports configured, it will 
 continue the same way as now.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10349) Table became unusable when master balanced its region after table was dropped

2014-01-17 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-10349:


Status: Open  (was: Patch Available)

 Table became unusable when master balanced its region after table was dropped
 -

 Key: HBASE-10349
 URL: https://issues.apache.org/jira/browse/HBASE-10349
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Ted Yu
Assignee: Jimmy Xiang
 Fix For: 0.98.0, 0.99.0

 Attachments: 
 10349-hadoop-hdfs-namenode-hor11n14.gq1.ygridcore.net.zip, 10349-output.log, 
 10349-v1.txt, 10349-v2.txt, HBASE-10349-meta-test-and-debug.patch, 
 hbase-10349.patch, hbase-hbase-master-hor15n05.gq1.ygridcore.net.log.tar.gz


 0.98 was used.
 This was sequence of events:
 create 'tablethree_mod'
 snapshot 'tablethree_mod', 'snapshot_tablethree_mod'
 disable 'tablethree_mod'
 2014-01-15 09:34:51,749   restore_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:35:07,210   enable 'tablethree_mod'
 2014-01-15 09:35:46,134   delete_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:41:42,210   disable 'tablethree_mod'
 2014-01-15 09:41:43,610   drop 'tablethree_mod'
 create 'tablethree_mod'
 For the last table creation request:
 {code} 
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'create 'tablethree_mod',
 {NAME = 'f1', VERSIONS = 3}
 ,
 {NAME = 'f2', VERSIONS = 3}
 ,
 {NAME = 'f3', VERSIONS = 3}
 '
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'exists 'tablethree_mod''
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '0', 'f1:q1', 'value-0', 10'
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '1', 'f1:q1', 'value-1', 20'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '2', 'f2:q2', 'value-2', 30'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '3', 'f3:q3', 'value-3', 40'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '4', 'f3:q3', 'value-4', 50'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO|Done writing commands to 
 file. Will execute them now.
 2014-01-15 10:03:53,000|beaver.machine|INFO|RUNNING: /usr/lib/hbase/bin/hbase 
 shell /grid/0/tmp/hwqe/artifacts/tmp-471142
 2014-01-15 10:03:55,878|beaver.machine|INFO|2014-01-15 10:03:55,878 INFO 
 [main] Configuration.deprecation: hadoop.native.lib is deprecated. Instead, 
 use io.native.lib.available
 2014-01-15 10:03:57,283|beaver.machine|INFO|2014-01-15 10:03:57,283 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,669|beaver.machine|INFO|2014-01-15 10:03:57,669 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,720|beaver.machine|INFO|2014-01-15 10:03:57,720 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 2014-01-15 10:03:57,997|beaver.machine|INFO|ERROR: Table already exists: 
 tablethree_mod!
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 {code}
 This was an intermittent issue after using Snapshots, a table is not properly 
 dropped / and not able to properly re-create with the same name. And a 
 HRegion is empty or null Error occurs. (When you try to drop the table it 
 says it does not exist, and when you try to create the table it says that it 
 does already exist).
 {code}
 2014-01-15 10:04:02,462|beaver.machine|INFO|ERROR: HRegionInfo was null or 
 empty in hbase:meta, row=keyvalues=
 {tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:seqnumDuringOpen/1389778905355/Put/vlen=8/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:server/1389778905355/Put/vlen=32/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:serverstartcode/1389778905355/Put/vlen=8/mvcc=0}
  
 {code}
 Thanks to Huned who discovered this issue.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10349) Table became unusable when master balanced its region after table was dropped

2014-01-17 Thread Jimmy Xiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13874924#comment-13874924
 ] 

Jimmy Xiang commented on HBASE-10349:
-

The test is ok locally.  Let me take a look and post a new patch.

[~mbertozzi], it is a racing between table deletion and balancer.  The balancer 
operates on a copy of the region assignments. When the balancer tries to 
balance a region, the region has been deleted (including the meta entry), and 
the table is removed (including the znode).  In assigning the deleted region, 
the region server puts the region back in the meta again with just the server 
info. That's why the region info in the meta row is missing.

 Table became unusable when master balanced its region after table was dropped
 -

 Key: HBASE-10349
 URL: https://issues.apache.org/jira/browse/HBASE-10349
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Ted Yu
Assignee: Jimmy Xiang
 Fix For: 0.98.0, 0.99.0

 Attachments: 
 10349-hadoop-hdfs-namenode-hor11n14.gq1.ygridcore.net.zip, 10349-output.log, 
 10349-v1.txt, 10349-v2.txt, HBASE-10349-meta-test-and-debug.patch, 
 hbase-10349.patch, hbase-hbase-master-hor15n05.gq1.ygridcore.net.log.tar.gz


 0.98 was used.
 This was sequence of events:
 create 'tablethree_mod'
 snapshot 'tablethree_mod', 'snapshot_tablethree_mod'
 disable 'tablethree_mod'
 2014-01-15 09:34:51,749   restore_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:35:07,210   enable 'tablethree_mod'
 2014-01-15 09:35:46,134   delete_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:41:42,210   disable 'tablethree_mod'
 2014-01-15 09:41:43,610   drop 'tablethree_mod'
 create 'tablethree_mod'
 For the last table creation request:
 {code} 
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'create 'tablethree_mod',
 {NAME = 'f1', VERSIONS = 3}
 ,
 {NAME = 'f2', VERSIONS = 3}
 ,
 {NAME = 'f3', VERSIONS = 3}
 '
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'exists 'tablethree_mod''
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '0', 'f1:q1', 'value-0', 10'
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '1', 'f1:q1', 'value-1', 20'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '2', 'f2:q2', 'value-2', 30'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '3', 'f3:q3', 'value-3', 40'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '4', 'f3:q3', 'value-4', 50'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO|Done writing commands to 
 file. Will execute them now.
 2014-01-15 10:03:53,000|beaver.machine|INFO|RUNNING: /usr/lib/hbase/bin/hbase 
 shell /grid/0/tmp/hwqe/artifacts/tmp-471142
 2014-01-15 10:03:55,878|beaver.machine|INFO|2014-01-15 10:03:55,878 INFO 
 [main] Configuration.deprecation: hadoop.native.lib is deprecated. Instead, 
 use io.native.lib.available
 2014-01-15 10:03:57,283|beaver.machine|INFO|2014-01-15 10:03:57,283 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,669|beaver.machine|INFO|2014-01-15 10:03:57,669 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,720|beaver.machine|INFO|2014-01-15 10:03:57,720 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 2014-01-15 10:03:57,997|beaver.machine|INFO|ERROR: Table already exists: 
 tablethree_mod!
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 {code}
 This was an intermittent issue after using Snapshots, a table is not properly 
 dropped / and not able to properly re-create with the same name. And a 
 HRegion is empty or null Error occurs. (When you try to drop the table it 
 says it does not exist, and when you try to create the table it says that it 
 does already exist).
 {code}
 2014-01-15 10:04:02,462|beaver.machine|INFO|ERROR: HRegionInfo was null or 
 empty in hbase:meta, row=keyvalues=
 {tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:seqnumDuringOpen/1389778905355/Put/vlen=8/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:server/1389778905355/Put/vlen=32/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:serverstartcode/1389778905355/Put/vlen=8/mvcc=0}
  
 {code}
 Thanks to Huned who discovered this issue.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10249) Intermittent TestReplicationSyncUpTool failure

2014-01-17 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13874932#comment-13874932
 ] 

Lars Hofhansl commented on HBASE-10249:
---

Yeah, scary stuff, especially since we have multi enabled here. +1

Should change the title to something more descriptive since this in an actual 
bug in the replication code.

The condition here is always false, so removing has no effect, right?
{code}
  if (parent.equals(rsServerNameZnode)) {
LOG.warn(Won't lock because this is us, we're dead!);
return false;
  }
{code}


 Intermittent TestReplicationSyncUpTool failure
 --

 Key: HBASE-10249
 URL: https://issues.apache.org/jira/browse/HBASE-10249
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Jean-Daniel Cryans
 Fix For: 0.98.0, 0.96.2, 0.99.0, 0.94.17

 Attachments: HBASE-10249-0.94-v0.patch, HBASE-10249-0.94-v1.patch, 
 HBASE-10249-trunk-v0.patch, HBASE-10249-trunk-v1.patch


 New issue to keep track of this.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10123) Change default ports; move them out of linux ephemeral port range

2014-01-17 Thread Jonathan Hsieh (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Hsieh updated HBASE-10123:
---

Attachment: hbase-10123.v4.patch

v4 is what I'm committing to trunk only.

 Change default ports; move them out of linux ephemeral port range
 -

 Key: HBASE-10123
 URL: https://issues.apache.org/jira/browse/HBASE-10123
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.1.1
Reporter: stack
Assignee: Jonathan Hsieh
Priority: Critical
 Fix For: 0.98.0

 Attachments: hbase-10123.patch, hbase-10123.v2.patch, 
 hbase-10123.v3.patch, hbase-10123.v4.patch


 Our defaults clash w/ the range linux assigns itself for creating come-and-go 
 ephemeral ports; likely in our history we've clashed w/ a random, short-lived 
 process.  While easy to change the defaults, we should just ship w/ defaults 
 that make sense.  We could host ourselves up into the 7 or 8k range.
 See http://www.ncftp.com/ncftpd/doc/misc/ephemeral_ports.html



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10123) Change default ports; move them out of linux ephemeral port range

2014-01-17 Thread Jonathan Hsieh (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Hsieh updated HBASE-10123:
---

Release Note: 
After 0.98 hbase's default ports have changed to be outside of the ephemeral 
port range:.

hbase.master.port :  6 - 16000
hbase.master.info.port (http): 60010 - 16010
hbase.regionserver.port : 60020 - 16020
hbase.regionserver.info.port (http): 60030 - 16030
hbase.status.multicast.port : 60100 - 16100

HBase up to and including 0.98.x chose default tcp ports in the 600xx port 
range which conflicts with the standard unix/linux ephmeral port range of 
49152-65535 or 32768-61000.  This meant if any other processes chose to bind to 
a os specified port it could bind to the ports that hbase had as default.  If 
hbase processes were restarted some hbase nodes would fail to start.  The 
probability of this happening increases as the number of nodes in the cluster 
increases.

Hadoop Flags: Incompatible change,Reviewed

 Change default ports; move them out of linux ephemeral port range
 -

 Key: HBASE-10123
 URL: https://issues.apache.org/jira/browse/HBASE-10123
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.1.1
Reporter: stack
Assignee: Jonathan Hsieh
Priority: Critical
 Fix For: 0.98.0

 Attachments: hbase-10123.patch, hbase-10123.v2.patch, 
 hbase-10123.v3.patch, hbase-10123.v4.patch


 Our defaults clash w/ the range linux assigns itself for creating come-and-go 
 ephemeral ports; likely in our history we've clashed w/ a random, short-lived 
 process.  While easy to change the defaults, we should just ship w/ defaults 
 that make sense.  We could host ourselves up into the 7 or 8k range.
 See http://www.ncftp.com/ncftpd/doc/misc/ephemeral_ports.html



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10123) Change default ports; move them out of linux ephemeral port range

2014-01-17 Thread Jonathan Hsieh (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13874988#comment-13874988
 ] 

Jonathan Hsieh commented on HBASE-10123:


Commited to trunk.  Thanks for revies.  I'vd done minimal doc updates and added 
release notes. lmk if you guys think I should do more.

 Change default ports; move them out of linux ephemeral port range
 -

 Key: HBASE-10123
 URL: https://issues.apache.org/jira/browse/HBASE-10123
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.1.1
Reporter: stack
Assignee: Jonathan Hsieh
Priority: Critical
 Fix For: 0.98.0

 Attachments: hbase-10123.patch, hbase-10123.v2.patch, 
 hbase-10123.v3.patch, hbase-10123.v4.patch


 Our defaults clash w/ the range linux assigns itself for creating come-and-go 
 ephemeral ports; likely in our history we've clashed w/ a random, short-lived 
 process.  While easy to change the defaults, we should just ship w/ defaults 
 that make sense.  We could host ourselves up into the 7 or 8k range.
 See http://www.ncftp.com/ncftpd/doc/misc/ephemeral_ports.html



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10123) Change default ports; move them out of linux ephemeral port range

2014-01-17 Thread Jonathan Hsieh (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Hsieh updated HBASE-10123:
---

Release Note: 
After 0.98 hbase's default ports have changed to be outside of the ephemeral 
port range:.

hbase.master.port :  6 - 16000
hbase.master.info.port (http): 60010 - 16010
hbase.regionserver.port : 60020 - 16020
hbase.regionserver.info.port (http): 60030 - 16030
hbase.status.multicast.port : 60100 - 16100


To set the old defaults you can add this to your hbase-site.xml:

 property 
namehbase.master.port/name
value6/value
  /property
  property
namehbase.master.info.port/name
value60010/value
  /property
  property
namehbase.regionserver.port/name
value60020/value
  /property
  property
namehbase.regionserver.info.port/name
value60030/value
  /property
  

HBase up to and including 0.98.x chose default tcp ports in the 600xx port 
range which conflicts with the standard unix/linux ephmeral port range of 
49152-65535 or 32768-61000.  This meant if any other processes chose to bind to 
a os specified port it could bind to the ports that hbase had as default.  If 
hbase processes were restarted some hbase nodes would fail to start.  The 
probability of this happening increases as the number of nodes in the cluster 
increases.


  was:
After 0.98 hbase's default ports have changed to be outside of the ephemeral 
port range:.

hbase.master.port :  6 - 16000
hbase.master.info.port (http): 60010 - 16010
hbase.regionserver.port : 60020 - 16020
hbase.regionserver.info.port (http): 60030 - 16030
hbase.status.multicast.port : 60100 - 16100

HBase up to and including 0.98.x chose default tcp ports in the 600xx port 
range which conflicts with the standard unix/linux ephmeral port range of 
49152-65535 or 32768-61000.  This meant if any other processes chose to bind to 
a os specified port it could bind to the ports that hbase had as default.  If 
hbase processes were restarted some hbase nodes would fail to start.  The 
probability of this happening increases as the number of nodes in the cluster 
increases.



 Change default ports; move them out of linux ephemeral port range
 -

 Key: HBASE-10123
 URL: https://issues.apache.org/jira/browse/HBASE-10123
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.1.1
Reporter: stack
Assignee: Jonathan Hsieh
Priority: Critical
 Fix For: 0.98.0

 Attachments: hbase-10123.patch, hbase-10123.v2.patch, 
 hbase-10123.v3.patch, hbase-10123.v4.patch


 Our defaults clash w/ the range linux assigns itself for creating come-and-go 
 ephemeral ports; likely in our history we've clashed w/ a random, short-lived 
 process.  While easy to change the defaults, we should just ship w/ defaults 
 that make sense.  We could host ourselves up into the 7 or 8k range.
 See http://www.ncftp.com/ncftpd/doc/misc/ephemeral_ports.html



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10289) Avoid random port usage by default JMX Server. Create Custome JMX server

2014-01-17 Thread nijel (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875000#comment-13875000
 ] 

nijel commented on HBASE-10289:
---

can we remove the talk in hbase-env.sh of flags for starting JMX?

 Avoid random port usage by default JMX Server. Create Custome JMX server
 

 Key: HBASE-10289
 URL: https://issues.apache.org/jira/browse/HBASE-10289
 Project: HBase
  Issue Type: Improvement
Reporter: nijel
Priority: Minor
 Attachments: HBASE-10289.patch


 If we enable JMX MBean server for HMaster or Region server  through VM 
 arguments, the process will use one random which we cannot configure.
 This can be a problem if that random port is configured for some other 
 service.
 This issue can be avoided by supporting  a custom JMX Server.
 The ports can be configured. If there is no ports configured, it will 
 continue the same way as now.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10123) Change default ports; move them out of linux ephemeral port range

2014-01-17 Thread Jonathan Hsieh (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Hsieh updated HBASE-10123:
---

Fix Version/s: (was: 0.98.0)
   0.99.0

 Change default ports; move them out of linux ephemeral port range
 -

 Key: HBASE-10123
 URL: https://issues.apache.org/jira/browse/HBASE-10123
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.1.1
Reporter: stack
Assignee: Jonathan Hsieh
Priority: Critical
 Fix For: 0.99.0

 Attachments: hbase-10123.patch, hbase-10123.v2.patch, 
 hbase-10123.v3.patch, hbase-10123.v4.patch


 Our defaults clash w/ the range linux assigns itself for creating come-and-go 
 ephemeral ports; likely in our history we've clashed w/ a random, short-lived 
 process.  While easy to change the defaults, we should just ship w/ defaults 
 that make sense.  We could host ourselves up into the 7 or 8k range.
 See http://www.ncftp.com/ncftpd/doc/misc/ephemeral_ports.html



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Created] (HBASE-10375) hbase-default.xml hbase.status.multicast.address.port does not match code

2014-01-17 Thread Jonathan Hsieh (JIRA)
Jonathan Hsieh created HBASE-10375:
--

 Summary: hbase-default.xml hbase.status.multicast.address.port 
does not match code
 Key: HBASE-10375
 URL: https://issues.apache.org/jira/browse/HBASE-10375
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.1.1, 0.98.0, 0.99.0
Reporter: Jonathan Hsieh


In hbase-default.xml
{code}
+  property
+namehbase.status.multicast.address.port/name
+value6100/value
+description
+  Multicast port to use for the status publication by multicast.
+/description
+  /property
{code}

In HConstants it was 60100.
{code}
  public static final String STATUS_MULTICAST_PORT = 
hbase.status.multicast.port;
  public static final int DEFAULT_STATUS_MULTICAST_PORT = 16100;
{code}

(it was 60100 in the code for 0.96 and 0.98.)

I lean towards going with the code as opposed to the config file.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10349) Table became unusable when master balanced its region after table was dropped

2014-01-17 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-10349:


Attachment: hbase-10349_v2.patch

Attached v2 that fixed the test failures.

 Table became unusable when master balanced its region after table was dropped
 -

 Key: HBASE-10349
 URL: https://issues.apache.org/jira/browse/HBASE-10349
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Ted Yu
Assignee: Jimmy Xiang
 Fix For: 0.98.0, 0.99.0

 Attachments: 
 10349-hadoop-hdfs-namenode-hor11n14.gq1.ygridcore.net.zip, 10349-output.log, 
 10349-v1.txt, 10349-v2.txt, HBASE-10349-meta-test-and-debug.patch, 
 hbase-10349.patch, hbase-10349_v2.patch, 
 hbase-hbase-master-hor15n05.gq1.ygridcore.net.log.tar.gz


 0.98 was used.
 This was sequence of events:
 create 'tablethree_mod'
 snapshot 'tablethree_mod', 'snapshot_tablethree_mod'
 disable 'tablethree_mod'
 2014-01-15 09:34:51,749   restore_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:35:07,210   enable 'tablethree_mod'
 2014-01-15 09:35:46,134   delete_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:41:42,210   disable 'tablethree_mod'
 2014-01-15 09:41:43,610   drop 'tablethree_mod'
 create 'tablethree_mod'
 For the last table creation request:
 {code} 
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'create 'tablethree_mod',
 {NAME = 'f1', VERSIONS = 3}
 ,
 {NAME = 'f2', VERSIONS = 3}
 ,
 {NAME = 'f3', VERSIONS = 3}
 '
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'exists 'tablethree_mod''
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '0', 'f1:q1', 'value-0', 10'
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '1', 'f1:q1', 'value-1', 20'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '2', 'f2:q2', 'value-2', 30'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '3', 'f3:q3', 'value-3', 40'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '4', 'f3:q3', 'value-4', 50'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO|Done writing commands to 
 file. Will execute them now.
 2014-01-15 10:03:53,000|beaver.machine|INFO|RUNNING: /usr/lib/hbase/bin/hbase 
 shell /grid/0/tmp/hwqe/artifacts/tmp-471142
 2014-01-15 10:03:55,878|beaver.machine|INFO|2014-01-15 10:03:55,878 INFO 
 [main] Configuration.deprecation: hadoop.native.lib is deprecated. Instead, 
 use io.native.lib.available
 2014-01-15 10:03:57,283|beaver.machine|INFO|2014-01-15 10:03:57,283 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,669|beaver.machine|INFO|2014-01-15 10:03:57,669 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,720|beaver.machine|INFO|2014-01-15 10:03:57,720 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 2014-01-15 10:03:57,997|beaver.machine|INFO|ERROR: Table already exists: 
 tablethree_mod!
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 {code}
 This was an intermittent issue after using Snapshots, a table is not properly 
 dropped / and not able to properly re-create with the same name. And a 
 HRegion is empty or null Error occurs. (When you try to drop the table it 
 says it does not exist, and when you try to create the table it says that it 
 does already exist).
 {code}
 2014-01-15 10:04:02,462|beaver.machine|INFO|ERROR: HRegionInfo was null or 
 empty in hbase:meta, row=keyvalues=
 {tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:seqnumDuringOpen/1389778905355/Put/vlen=8/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:server/1389778905355/Put/vlen=32/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:serverstartcode/1389778905355/Put/vlen=8/mvcc=0}
  
 {code}
 Thanks to Huned who discovered this issue.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10374) add jemalloc into the choice of memstore allocation

2014-01-17 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875022#comment-13875022
 ] 

stack commented on HBASE-10374:
---

[~jianbginglover] What you thinking?  We pull in netty4, redo our core on top 
of it (smile), then as a side benefit, use their jemalloc implementation for 
block cache?  Poking around, I don't see it here 
http://netty.io/4.0/api/index.html  Thanks.

 add jemalloc into the choice of memstore allocation 
 

 Key: HBASE-10374
 URL: https://issues.apache.org/jira/browse/HBASE-10374
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Affects Versions: 0.96.1.1
Reporter: Bing Jiang
Priority: Minor

 https://blog.twitter.com/2013/netty-4-at-twitter-reduced-gc-overhead 
 introduced that Netty used jemalloc to gain benefits from GC.
 It can be a good choice for memstore block allocation.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10346) Add Documentation for stateless scanner

2014-01-17 Thread Vandana Ayyalasomayajula (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875021#comment-13875021
 ] 

Vandana Ayyalasomayajula commented on HBASE-10346:
--

[~ndimiduk] Could you review this patch ? Thanks!

 Add Documentation for stateless scanner
 ---

 Key: HBASE-10346
 URL: https://issues.apache.org/jira/browse/HBASE-10346
 Project: HBase
  Issue Type: Improvement
  Components: REST
Reporter: Vandana Ayyalasomayajula
Assignee: Vandana Ayyalasomayajula
Priority: Minor
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10346.0.patch






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10349) Table became unusable when master balanced its region after table was dropped

2014-01-17 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-10349:


Status: Patch Available  (was: Open)

 Table became unusable when master balanced its region after table was dropped
 -

 Key: HBASE-10349
 URL: https://issues.apache.org/jira/browse/HBASE-10349
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Ted Yu
Assignee: Jimmy Xiang
 Fix For: 0.98.0, 0.99.0

 Attachments: 
 10349-hadoop-hdfs-namenode-hor11n14.gq1.ygridcore.net.zip, 10349-output.log, 
 10349-v1.txt, 10349-v2.txt, HBASE-10349-meta-test-and-debug.patch, 
 hbase-10349.patch, hbase-10349_v2.patch, 
 hbase-hbase-master-hor15n05.gq1.ygridcore.net.log.tar.gz


 0.98 was used.
 This was sequence of events:
 create 'tablethree_mod'
 snapshot 'tablethree_mod', 'snapshot_tablethree_mod'
 disable 'tablethree_mod'
 2014-01-15 09:34:51,749   restore_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:35:07,210   enable 'tablethree_mod'
 2014-01-15 09:35:46,134   delete_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:41:42,210   disable 'tablethree_mod'
 2014-01-15 09:41:43,610   drop 'tablethree_mod'
 create 'tablethree_mod'
 For the last table creation request:
 {code} 
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'create 'tablethree_mod',
 {NAME = 'f1', VERSIONS = 3}
 ,
 {NAME = 'f2', VERSIONS = 3}
 ,
 {NAME = 'f3', VERSIONS = 3}
 '
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'exists 'tablethree_mod''
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '0', 'f1:q1', 'value-0', 10'
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '1', 'f1:q1', 'value-1', 20'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '2', 'f2:q2', 'value-2', 30'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '3', 'f3:q3', 'value-3', 40'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '4', 'f3:q3', 'value-4', 50'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO|Done writing commands to 
 file. Will execute them now.
 2014-01-15 10:03:53,000|beaver.machine|INFO|RUNNING: /usr/lib/hbase/bin/hbase 
 shell /grid/0/tmp/hwqe/artifacts/tmp-471142
 2014-01-15 10:03:55,878|beaver.machine|INFO|2014-01-15 10:03:55,878 INFO 
 [main] Configuration.deprecation: hadoop.native.lib is deprecated. Instead, 
 use io.native.lib.available
 2014-01-15 10:03:57,283|beaver.machine|INFO|2014-01-15 10:03:57,283 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,669|beaver.machine|INFO|2014-01-15 10:03:57,669 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,720|beaver.machine|INFO|2014-01-15 10:03:57,720 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 2014-01-15 10:03:57,997|beaver.machine|INFO|ERROR: Table already exists: 
 tablethree_mod!
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 {code}
 This was an intermittent issue after using Snapshots, a table is not properly 
 dropped / and not able to properly re-create with the same name. And a 
 HRegion is empty or null Error occurs. (When you try to drop the table it 
 says it does not exist, and when you try to create the table it says that it 
 does already exist).
 {code}
 2014-01-15 10:04:02,462|beaver.machine|INFO|ERROR: HRegionInfo was null or 
 empty in hbase:meta, row=keyvalues=
 {tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:seqnumDuringOpen/1389778905355/Put/vlen=8/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:server/1389778905355/Put/vlen=32/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:serverstartcode/1389778905355/Put/vlen=8/mvcc=0}
  
 {code}
 Thanks to Huned who discovered this issue.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10289) Avoid random port usage by default JMX Server. Create Custome JMX server

2014-01-17 Thread nijel (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875024#comment-13875024
 ] 

nijel commented on HBASE-10289:
---

pls ignore previous comment.. 

bq. Tell me more nijel In hbase-env.sh in trunk you can specify ports for JMX 
(IIRC).
- Even if we specify the port one of the port will be random. 

bq. can we remove the talk in hbase-env.sh of flags for starting JMX? 
- Yes. After this patch we need not specify the flags in hbase-env.sh

bq. What happens if someone passes the flags? Will we have two JMX servers 
running?
 - Yes, 2 JMX servers will be running. In both the servers beans will be 
registered.

The idea behind this patch is to give a option to user to control the ports 
used by JMX server. 

 Avoid random port usage by default JMX Server. Create Custome JMX server
 

 Key: HBASE-10289
 URL: https://issues.apache.org/jira/browse/HBASE-10289
 Project: HBase
  Issue Type: Improvement
Reporter: nijel
Priority: Minor
 Attachments: HBASE-10289.patch


 If we enable JMX MBean server for HMaster or Region server  through VM 
 arguments, the process will use one random which we cannot configure.
 This can be a problem if that random port is configured for some other 
 service.
 This issue can be avoided by supporting  a custom JMX Server.
 The ports can be configured. If there is no ports configured, it will 
 continue the same way as now.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-8410) Basic quota support for namespaces

2014-01-17 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875038#comment-13875038
 ] 

Ted Yu commented on HBASE-8410:
---

From 
https://builds.apache.org/job/PreCommit-HBASE-Build/8456//testReport/org.apache.hadoop.hbase.namespace/TestNamespaceAuditor/testRegionOperations/
 :
{code}
2014-01-17 05:27:16,370 INFO  [pool-1-thread-1] 
namespace.TestNamespaceAuditor(163): 
org.apache.hadoop.hbase.DoNotRetryIOException: 
org.apache.hadoop.hbase.DoNotRetryIOException: The table 
TestNamespaceAuditor_regiontest:table1 is not allowed to have 7 regions. The 
total number of regions permitted is only 6, while current region count is 0. 
This may be transient, please retry later if there are any ongoing split 
operations in the namespace.
at 
org.apache.hadoop.hbase.namespace.NamespaceAuditor.checkNamespaceTableCount(NamespaceAuditor.java:322)
at 
org.apache.hadoop.hbase.namespace.NamespaceAuditor.preCreateTable(NamespaceAuditor.java:82)
at 
org.apache.hadoop.hbase.master.MasterCoprocessorHost.preCreateTable(MasterCoprocessorHost.java:235)
at org.apache.hadoop.hbase.master.HMaster.createTable(HMaster.java:1733)
at org.apache.hadoop.hbase.master.HMaster.createTable(HMaster.java:1766)
at 
org.apache.hadoop.hbase.protobuf.generated.MasterProtos$MasterService$2.callBlockingMethod(MasterProtos.java:38221)
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2008)
at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:92)
at 
org.apache.hadoop.hbase.ipc.FifoRpcScheduler$1.run(FifoRpcScheduler.java:73)
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)

2014-01-17 05:27:16,371 DEBUG [pool-1-thread-1-EventThread] 
hbase.ZKNamespaceStateManager(205): Updating namespace state cache from node 
path /hbase/namespace-state/TestNamespaceAuditor_regiontest with data: 
NamespaceTableAndRegionInfo [name=TestNamespaceAuditor_regiontest, 
tableRegionInfoMap={}]
{code}
ZKNamespaceStateManager received notification 1 millisecond later than the 
receipt of DoNotRetryIOException

Please handle this scenario accordingly.

 Basic quota support for namespaces
 --

 Key: HBASE-8410
 URL: https://issues.apache.org/jira/browse/HBASE-8410
 Project: HBase
  Issue Type: Sub-task
Reporter: Francis Liu
Assignee: Vandana Ayyalasomayajula
 Attachments: HBASE-8410_trunk_10.patch, HBASE-8410_trunk_10.patch, 
 HBASE-8410_trunk_11.patch, HBASE-8410_trunk_12.patch, 
 HBASE-8410_trunk_2.patch, HBASE-8410_trunk_3.patch, HBASE-8410_trunk_4.patch, 
 HBASE-8410_trunk_4.patch, HBASE-8410_trunk_5.patch, HBASE-8410_trunk_6.patch, 
 HBASE-8410_trunk_7.patch, HBASE-8410_trunk_8.patch, HBASE-8410_trunk_9.patch, 
 HBASE_8410.patch, HBASE_8410_1_trunk.patch


 This task involves creating an observer which provides basic quota support to 
 namespaces in terms of (1) number of tables and (2) number of regions. The 
 quota support can be enabled by setting:
 property
 namehbase.coprocessor.region.classes/name
 valueorg.apache.hadoop.hbase.namespace.NamespaceController/value
 /property
 property
 namehbase.coprocessor.master.classes/name
 valueorg.apache.hadoop.hbase.namespace.NamespaceController/value
 /property
 in the hbase-site.xml.
 To add quotas to namespace, while creating namespace properties need to be 
 added.
 Examples:
 1. namespace_create 'ns1', {'hbase.namespace.quota.maxregion'='10'}
 2. 1. namespace_create 'ns2', {'hbase.namespace.quota.maxtables'='2'}, 
 {'hbase.namespace.quota.maxregion'='5'}
 The quotas can be modified/added to namespace at any point of time. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-9360) Enable 0.94 - 0.96 replication to minimize upgrade down time

2014-01-17 Thread Jeffrey Zhong (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875043#comment-13875043
 ] 

Jeffrey Zhong commented on HBASE-9360:
--

Good suggestions. Let me add a paragraph on this in the ref guide in upgrade  
replication.

 Enable 0.94 - 0.96 replication to minimize upgrade down time
 -

 Key: HBASE-9360
 URL: https://issues.apache.org/jira/browse/HBASE-9360
 Project: HBase
  Issue Type: Brainstorming
  Components: migration
Affects Versions: 0.98.0, 0.96.0
Reporter: Jeffrey Zhong

 As we know 0.96 is a singularity release, as of today a 0.94 hbase user has 
 to do in-place upgrade: make corresponding client changes, recompile client 
 application code, fully shut down existing 0.94 hbase cluster, deploy 0.96 
 binary, run upgrade script and then start the upgraded cluster. You can image 
 the down time will be extended if something is wrong in between. 
 To minimize the down time, another possible way is to setup a secondary 0.96 
 cluster and then setup replication between the existing 0.94 cluster and the 
 new 0.96 slave cluster. Once the 0.96 cluster is synced, a user can switch 
 the traffic to the 0.96 cluster and decommission the old one.
 The ideal steps will be:
 1) Setup a 0.96 cluster
 2) Setup replication between a running 0.94 cluster to the newly created 0.96 
 cluster
 3) Wait till they're in sync in replication
 4) Starts duplicated writes to both 0.94 and 0.96 clusters(could stop 
 relocation now)
 5) Forward read traffic to the slave 0.96 cluster
 6) After a certain period, stop writes to the original 0.94 cluster if 
 everything is good and completes upgrade
 To get us there, there are two tasks:
 1) Enable replication from 0.94 - 0.96
 I've run the idea with [~jdcryans], [~devaraj] and [~ndimiduk]. Currently it 
 seems the best approach is to build a very similar service or on top of 
 https://github.com/NGDATA/hbase-indexer/tree/master/hbase-sep with support 
 three commands replicateLogEntries, multi and delete. Inside the three 
 commands, we just pass down the corresponding requests to the destination 
 0.96 cluster as a bridge. The reason to support the multi and delete is for 
 CopyTable to copy data from a 0.94 cluster to a 0.96 one.
 The other approach is to provide limited support of 0.94 RPC protocol in 
 0.96. While an issue on this is that a 0.94 client needs to talk to zookeeper 
 firstly before it can connect to a 0.96 region server. Therefore, we need a 
 faked Zookeeper setup in front of a 0.96 cluster for a 0.94 client to 
 connect. It may also pollute 0.96 code base with 0.94 RPC code.
 2) To support writes to a 0.96 cluster and a 0.94 at the same time, we need 
 to load both hbase clients into one single JVM using different class loader.
 Let me know if you think this is worth to do and any better approach we could 
 take.
 Thanks!



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10289) Avoid random port usage by default JMX Server. Create Custome JMX server

2014-01-17 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875053#comment-13875053
 ] 

stack commented on HBASE-10289:
---

bq. The idea behind this patch is to give a option to user to control the ports 
used by JMX server.

Great.

These defines are only used in one place so I'd move them there rather than add 
them to HConstants:

+  /** Configuration key for rmi registry port */
+  public static final String RMI_REGISTRY_PORT = .rmi.registry.port;
+
+  /** Configuration key for rmi connector port */
+  public static final String RMI_CONNECTOR_PORT = .rmi.connector.port;
+
+  /** Configuration key for rmi registry ip */
+  public static final String RMI_REGISTRY_IP = .rmi.registry.ip;
+
+  /** Configuration key for rmi connector name */
+  public static final String RMI_REGISTRY_CONNECTORNAME = 
.rmi.registry.connectorname;
+
+  /** Default rmi registry ip value */
+  public static final String RMI_REGISTRY_IP_DEFAULT = 127.0.0.1;


JMXServer is missing an apache license (see adjacent .java files for how).  I 
can add the audience annotation on commit.

Class could do with a comment -- a synopsis of the comments you made above 
explaining this feature to me would do nicely.

How does your server get started and shutdown?  It should be embedded in Master 
and RegionServer?

 Avoid random port usage by default JMX Server. Create Custome JMX server
 

 Key: HBASE-10289
 URL: https://issues.apache.org/jira/browse/HBASE-10289
 Project: HBase
  Issue Type: Improvement
Reporter: nijel
Priority: Minor
 Attachments: HBASE-10289.patch


 If we enable JMX MBean server for HMaster or Region server  through VM 
 arguments, the process will use one random which we cannot configure.
 This can be a problem if that random port is configured for some other 
 service.
 This issue can be avoided by supporting  a custom JMX Server.
 The ports can be configured. If there is no ports configured, it will 
 continue the same way as now.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Andrew Purtell (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell updated HBASE-10370:
---

Affects Version/s: 0.98.0

+1 for 0.98

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Attachments: HBASE-10370-v1.diff, HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10322) Strip tags from KV while sending back to client on reads

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875068#comment-13875068
 ] 

Andrew Purtell commented on HBASE-10322:


Agree.

The goal is to prevent sensitive tags from going to clients who are not 
supposed to see them. This is blocking the 0.98 RC.

Checking on a per cell basis is going to hurt performance. Massaging cell data 
on a per cell basis e.g. copying, will kill performance. We need some global 
decision per connection.

Earlier we explored per-connection negotiation ideas on another JIRA but didn't 
come to a satisfactory resolution. 

Now we want to do the simplest thing possible. There is no need to handle tags 
in cell serialization for RPC. (Except! Replication! - Thanks [~anoop.hbase].) 
Cell ACLs and visibility expressions are shipped server side in operation 
attributes. Tag persistence with HFile v3 is all set. Tag persistence in the 
WAL uses WAL codecs which are only applied server side. 

We need an answer for replication though. My thinking is since we set up RPC 
for replication specially in the sink and source code, and replication is a 
server to server thing - or at least we can say replication is privileged - 
it should be ok to add a tag capable codec for replication, but have it not be 
the default. We can tell users that replication will be compatible between 0.96 
and 0.98 as long as you don't use cell tags. If you do start using the 0.98 
features which require cell tags, then your replication endpoints must all be 
upgrade to 0.98 first, and you must change a configuration setting. 

 Strip tags from KV while sending back to client on reads
 

 Key: HBASE-10322
 URL: https://issues.apache.org/jira/browse/HBASE-10322
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Anoop Sam John
Assignee: Anoop Sam John
Priority: Blocker
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10322.patch, HBASE-10322_V2.patch


 Right now we have some inconsistency wrt sending back tags on read. We do 
 this in scan when using Java client(Codec based cell block encoding). But 
 during a Get operation or when a pure PB based Scan comes we are not sending 
 back the tags.  So any of the below fix we have to do
 1. Send back tags in missing cases also. But sending back visibility 
 expression/ cell ACL is not correct.
 2. Don't send back tags in any case. This will a problem when a tool like 
 ExportTool use the scan to export the table data. We will miss exporting the 
 cell visibility/ACL.
 3. Send back tags based on some condition. It has to be per scan basis. 
 Simplest way is pass some kind of attribute in Scan which says whether to 
 send back tags or not. But believing some thing what scan specifies might not 
 be correct IMO. Then comes the way of checking the user who is doing the 
 scan. When a HBase super user doing the scan then only send back tags. So 
 when a case comes like Export Tool's the execution should happen from a super 
 user.
 So IMO we should go with #3.
 Patch coming soon.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10322) Strip tags from KV while sending back to client on reads

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875069#comment-13875069
 ] 

Andrew Purtell commented on HBASE-10322:


Ping [~stack], [~jdcryans], [~ram_krish], see ^^^

 Strip tags from KV while sending back to client on reads
 

 Key: HBASE-10322
 URL: https://issues.apache.org/jira/browse/HBASE-10322
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Anoop Sam John
Assignee: Anoop Sam John
Priority: Blocker
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10322.patch, HBASE-10322_V2.patch


 Right now we have some inconsistency wrt sending back tags on read. We do 
 this in scan when using Java client(Codec based cell block encoding). But 
 during a Get operation or when a pure PB based Scan comes we are not sending 
 back the tags.  So any of the below fix we have to do
 1. Send back tags in missing cases also. But sending back visibility 
 expression/ cell ACL is not correct.
 2. Don't send back tags in any case. This will a problem when a tool like 
 ExportTool use the scan to export the table data. We will miss exporting the 
 cell visibility/ACL.
 3. Send back tags based on some condition. It has to be per scan basis. 
 Simplest way is pass some kind of attribute in Scan which says whether to 
 send back tags or not. But believing some thing what scan specifies might not 
 be correct IMO. Then comes the way of checking the user who is doing the 
 scan. When a HBase super user doing the scan then only send back tags. So 
 when a case comes like Export Tool's the execution should happen from a super 
 user.
 So IMO we should go with #3.
 Patch coming soon.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-10370:
---

Attachment: 10370-v3.patch

Patch v3 closes the region and table in 
TestSplitTransactionOnCluster#testSplitFailedCompactionAndSplit

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Attachments: 10370-v3.patch, HBASE-10370-v1.diff, HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-10370:
---

Fix Version/s: 0.99.0
   0.98.0
 Hadoop Flags: Reviewed

Integrated to 0.98 and trunk.

Thanks for the patch, Shaohui.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.99.0

 Attachments: 10370-v3.patch, HBASE-10370-v1.diff, HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10123) Change default ports; move them out of linux ephemeral port range

2014-01-17 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875108#comment-13875108
 ] 

Hadoop QA commented on HBASE-10123:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12623685/hbase-10123.v4.patch
  against trunk revision .
  ATTACHMENT ID: 12623685

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop1.1{color}.  The patch compiles against the hadoop 
1.1 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8460//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8460//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8460//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8460//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8460//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8460//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8460//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8460//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8460//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8460//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8460//console

This message is automatically generated.

 Change default ports; move them out of linux ephemeral port range
 -

 Key: HBASE-10123
 URL: https://issues.apache.org/jira/browse/HBASE-10123
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.1.1
Reporter: stack
Assignee: Jonathan Hsieh
Priority: Critical
 Fix For: 0.99.0

 Attachments: hbase-10123.patch, hbase-10123.v2.patch, 
 hbase-10123.v3.patch, hbase-10123.v4.patch


 Our defaults clash w/ the range linux assigns itself for creating come-and-go 
 ephemeral ports; likely in our history we've clashed w/ a random, short-lived 
 process.  While easy to change the defaults, we should just ship w/ defaults 
 that make sense.  We could host ourselves up into the 7 or 8k range.
 See http://www.ncftp.com/ncftpd/doc/misc/ephemeral_ports.html



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875118#comment-13875118
 ] 

Hadoop QA commented on HBASE-10370:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12623704/10370-v3.patch
  against trunk revision .
  ATTACHMENT ID: 12623704

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8462//console

This message is automatically generated.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-10370:
--

Attachment: 10370v2.096.txt

What I applied to 0.96 (no unit test -- it failed apply).  Thanks [~liushaohui]

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-10370:
--

Fix Version/s: 0.96.2

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10375) hbase-default.xml hbase.status.multicast.address.port does not match code

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875126#comment-13875126
 ] 

Andrew Purtell commented on HBASE-10375:


60100 or 16100 ? 

 hbase-default.xml hbase.status.multicast.address.port does not match code
 -

 Key: HBASE-10375
 URL: https://issues.apache.org/jira/browse/HBASE-10375
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0, 0.99.0, 0.96.1.1
Reporter: Jonathan Hsieh

 In hbase-default.xml
 {code}
 +  property
 +namehbase.status.multicast.address.port/name
 +value6100/value
 +description
 +  Multicast port to use for the status publication by multicast.
 +/description
 +  /property
 {code}
 In HConstants it was 60100.
 {code}
   public static final String STATUS_MULTICAST_PORT = 
 hbase.status.multicast.port;
   public static final int DEFAULT_STATUS_MULTICAST_PORT = 16100;
 {code}
 (it was 60100 in the code for 0.96 and 0.98.)
 I lean towards going with the code as opposed to the config file.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10156) Fix up the HBASE-8755 slowdown when low contention

2014-01-17 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875129#comment-13875129
 ] 

stack commented on HBASE-10156:
---

Looking for a review please.  Does some cleanup of HLog, gets us back some of 
the perf we lost when we moved to new thread model (hbase-8755) when low 
contention.  As side benefit, it improves perf when high contention.  Downside 
is refactor of critical FSHLog class.  Am testing on cluster over here meantime.

 Fix up the HBASE-8755 slowdown when low contention
 --

 Key: HBASE-10156
 URL: https://issues.apache.org/jira/browse/HBASE-10156
 Project: HBase
  Issue Type: Sub-task
  Components: wal
Reporter: stack
Assignee: stack
 Attachments: 10156.txt, 10156v10.txt, 10156v11.txt, 10156v12.txt, 
 10156v12.txt, 10156v13.txt, 10156v16.txt, 10156v17.txt, 10156v18.txt, 
 10156v2.txt, 10156v3.txt, 10156v4.txt, 10156v5.txt, 10156v6.txt, 10156v7.txt, 
 10156v9.txt, Disrupting.java


 HBASE-8755 slows our writes when only a few clients.  Fix.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10375) hbase-default.xml hbase.status.multicast.address.port does not match code

2014-01-17 Thread Jonathan Hsieh (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875131#comment-13875131
 ] 

Jonathan Hsieh commented on HBASE-10375:


I committed HBASE-10123 to trunk only so i believe:

* all should say hbase.status.multicast.port instead of 
hbase.status.multicast.address.port
* 0.96/0.98 should be 60100
* trunk should be 16100

We should get [~nkeywal] to take a quick look to verify this.

 hbase-default.xml hbase.status.multicast.address.port does not match code
 -

 Key: HBASE-10375
 URL: https://issues.apache.org/jira/browse/HBASE-10375
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0, 0.99.0, 0.96.1.1
Reporter: Jonathan Hsieh

 In hbase-default.xml
 {code}
 +  property
 +namehbase.status.multicast.address.port/name
 +value6100/value
 +description
 +  Multicast port to use for the status publication by multicast.
 +/description
 +  /property
 {code}
 In HConstants it was 60100.
 {code}
   public static final String STATUS_MULTICAST_PORT = 
 hbase.status.multicast.port;
   public static final int DEFAULT_STATUS_MULTICAST_PORT = 16100;
 {code}
 (it was 60100 in the code for 0.96 and 0.98.)
 I lean towards going with the code as opposed to the config file.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10030) [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875132#comment-13875132
 ] 

Andrew Purtell commented on HBASE-10030:


bq. Java version: 1.8.0-ea, vendor: Oracle Corporation

Ok, I am sorry, I have not adequately explained what we did here. We deprecated 
the old method but did not remove it. Therefore at some point we can kill the 
conflicting method, I guess we can do this in trunk now that 0.98 is branched, 
and *then* we can compile with JDK 8. 

 [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method
 --

 Key: HBASE-10030
 URL: https://issues.apache.org/jira/browse/HBASE-10030
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: Andrew Purtell
Priority: Trivial
 Fix For: 0.98.0

 Attachments: 10030.patch


 On JDK 8, the erasure of PoolMap#remove(K,V) conflicts with superclass method 
 remove(Object,Object).



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10322) Strip tags from KV while sending back to client on reads

2014-01-17 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875139#comment-13875139
 ] 

stack commented on HBASE-10322:
---

Sounds reasonable Andrew, what you say about replication ([~jdcryans] is not 
around -- he is off in exotic locations again!).

Is there a patch to review yet?

 Strip tags from KV while sending back to client on reads
 

 Key: HBASE-10322
 URL: https://issues.apache.org/jira/browse/HBASE-10322
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Anoop Sam John
Assignee: Anoop Sam John
Priority: Blocker
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10322.patch, HBASE-10322_V2.patch


 Right now we have some inconsistency wrt sending back tags on read. We do 
 this in scan when using Java client(Codec based cell block encoding). But 
 during a Get operation or when a pure PB based Scan comes we are not sending 
 back the tags.  So any of the below fix we have to do
 1. Send back tags in missing cases also. But sending back visibility 
 expression/ cell ACL is not correct.
 2. Don't send back tags in any case. This will a problem when a tool like 
 ExportTool use the scan to export the table data. We will miss exporting the 
 cell visibility/ACL.
 3. Send back tags based on some condition. It has to be per scan basis. 
 Simplest way is pass some kind of attribute in Scan which says whether to 
 send back tags or not. But believing some thing what scan specifies might not 
 be correct IMO. Then comes the way of checking the user who is doing the 
 scan. When a HBase super user doing the scan then only send back tags. So 
 when a case comes like Export Tool's the execution should happen from a super 
 user.
 So IMO we should go with #3.
 Patch coming soon.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Comment Edited] (HBASE-10030) [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875132#comment-13875132
 ] 

Andrew Purtell edited comment on HBASE-10030 at 1/17/14 7:18 PM:
-

bq. Java version: 1.8.0-ea, vendor: Oracle Corporation

Ok, I am sorry, I have not adequately explained what we did here. We deprecated 
the old method but did not remove it. Therefore at some point we can kill the 
conflicting method, I guess we can do this in trunk now that 0.98 is branched, 
and *then* we can compile with JDK 8. 
Edit: Also, above, I forgot I have this removal applied privately to the branch 
I use for JDK 8 stuff. Sorry again.


was (Author: apurtell):
bq. Java version: 1.8.0-ea, vendor: Oracle Corporation

Ok, I am sorry, I have not adequately explained what we did here. We deprecated 
the old method but did not remove it. Therefore at some point we can kill the 
conflicting method, I guess we can do this in trunk now that 0.98 is branched, 
and *then* we can compile with JDK 8. 

 [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method
 --

 Key: HBASE-10030
 URL: https://issues.apache.org/jira/browse/HBASE-10030
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: Andrew Purtell
Priority: Trivial
 Fix For: 0.98.0

 Attachments: 10030.patch


 On JDK 8, the erasure of PoolMap#remove(K,V) conflicts with superclass method 
 remove(Object,Object).



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10030) [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875137#comment-13875137
 ] 

Andrew Purtell commented on HBASE-10030:


An alternative for trunk is we can just kill PoolMap as [~lhofhansl] says 
above. 

 [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method
 --

 Key: HBASE-10030
 URL: https://issues.apache.org/jira/browse/HBASE-10030
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: Andrew Purtell
Priority: Trivial
 Fix For: 0.98.0

 Attachments: 10030.patch


 On JDK 8, the erasure of PoolMap#remove(K,V) conflicts with superclass method 
 remove(Object,Object).



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10322) Strip tags from KV while sending back to client on reads

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875141#comment-13875141
 ] 

Andrew Purtell commented on HBASE-10322:


bq. Is there a patch to review yet?

There will be something shortly after we agree this is the way to go. Sounds 
like it. Soon, then.

 Strip tags from KV while sending back to client on reads
 

 Key: HBASE-10322
 URL: https://issues.apache.org/jira/browse/HBASE-10322
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Anoop Sam John
Assignee: Anoop Sam John
Priority: Blocker
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10322.patch, HBASE-10322_V2.patch


 Right now we have some inconsistency wrt sending back tags on read. We do 
 this in scan when using Java client(Codec based cell block encoding). But 
 during a Get operation or when a pure PB based Scan comes we are not sending 
 back the tags.  So any of the below fix we have to do
 1. Send back tags in missing cases also. But sending back visibility 
 expression/ cell ACL is not correct.
 2. Don't send back tags in any case. This will a problem when a tool like 
 ExportTool use the scan to export the table data. We will miss exporting the 
 cell visibility/ACL.
 3. Send back tags based on some condition. It has to be per scan basis. 
 Simplest way is pass some kind of attribute in Scan which says whether to 
 send back tags or not. But believing some thing what scan specifies might not 
 be correct IMO. Then comes the way of checking the user who is doing the 
 scan. When a HBase super user doing the scan then only send back tags. So 
 when a case comes like Export Tool's the execution should happen from a super 
 user.
 So IMO we should go with #3.
 Patch coming soon.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875142#comment-13875142
 ] 

Hadoop QA commented on HBASE-10370:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12623710/10370v2.096.txt
  against trunk revision .
  ATTACHMENT ID: 12623710

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8463//console

This message is automatically generated.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10030) [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method

2014-01-17 Thread Eric Charles (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875151#comment-13875151
 ] 

Eric Charles commented on HBASE-10030:
--

HBASE-10327.patch in HBASE-10327 is the removal of that deprecated method which 
is not used in trunk.
I think you could apply HBASE-10327.patch and it would be ok
(Would be good to compile with jdk8 because hbase runs fine with jdk8)


 [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method
 --

 Key: HBASE-10030
 URL: https://issues.apache.org/jira/browse/HBASE-10030
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: Andrew Purtell
Priority: Trivial
 Fix For: 0.98.0

 Attachments: 10030.patch


 On JDK 8, the erasure of PoolMap#remove(K,V) conflicts with superclass method 
 remove(Object,Object).



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10327) Remove remove(K, V) from type PoolMapK,V

2014-01-17 Thread Andrew Purtell (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell updated HBASE-10327:
---

Summary: Remove remove(K, V) from type PoolMapK,V  (was: remove(K, V) of 
type PoolMapK,V has the same erasure)

 Remove remove(K, V) from type PoolMapK,V
 --

 Key: HBASE-10327
 URL: https://issues.apache.org/jira/browse/HBASE-10327
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.99.0
Reporter: Eric Charles
Assignee: Eric Charles
 Attachments: HBASE-10327.patch


 I keep getting red cross in my eclipse, whatever the jdk (jdk6, jdk7, jdk8)
 Name clash: The method remove(K, V) of type PoolMapK,V has the same erasure 
 as remove(Object, Object) of type MapK,V but does not override it
 maybe related to HBASE-10030
 The solution I have is simply removing the deprecated method, and everything 
 is fine. I am not sure of the backwards compatibility here.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10030) [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875153#comment-13875153
 ] 

Andrew Purtell commented on HBASE-10030:


bq. I think you could apply HBASE-10327.patch and it would be ok

Thanks for the reminder. Let's repurpose that one.

 [JDK8] Erasure of PoolMap#remove(K,V) conflicts with superclass method
 --

 Key: HBASE-10030
 URL: https://issues.apache.org/jira/browse/HBASE-10030
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: Andrew Purtell
Priority: Trivial
 Fix For: 0.98.0

 Attachments: 10030.patch


 On JDK 8, the erasure of PoolMap#remove(K,V) conflicts with superclass method 
 remove(Object,Object).



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Reopened] (HBASE-10327) remove(K, V) of type PoolMapK,V has the same erasure

2014-01-17 Thread Andrew Purtell (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell reopened HBASE-10327:


  Assignee: Eric Charles

 remove(K, V) of type PoolMapK,V has the same erasure
 --

 Key: HBASE-10327
 URL: https://issues.apache.org/jira/browse/HBASE-10327
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.99.0
Reporter: Eric Charles
Assignee: Eric Charles
 Attachments: HBASE-10327.patch


 I keep getting red cross in my eclipse, whatever the jdk (jdk6, jdk7, jdk8)
 Name clash: The method remove(K, V) of type PoolMapK,V has the same erasure 
 as remove(Object, Object) of type MapK,V but does not override it
 maybe related to HBASE-10030
 The solution I have is simply removing the deprecated method, and everything 
 is fine. I am not sure of the backwards compatibility here.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10327) Remove remove(K, V) from type PoolMapK,V

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875159#comment-13875159
 ] 

Andrew Purtell commented on HBASE-10327:


We deprecated the offending method with HBASE-10030, which will be in 0.98, and 
now need to remove it on trunk. Will do so by committing this patch to trunk 
shortly unless objection.

 Remove remove(K, V) from type PoolMapK,V
 --

 Key: HBASE-10327
 URL: https://issues.apache.org/jira/browse/HBASE-10327
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.99.0
Reporter: Eric Charles
Assignee: Eric Charles
 Attachments: HBASE-10327.patch


 I keep getting red cross in my eclipse, whatever the jdk (jdk6, jdk7, jdk8)
 Name clash: The method remove(K, V) of type PoolMapK,V has the same erasure 
 as remove(Object, Object) of type MapK,V but does not override it
 maybe related to HBASE-10030
 The solution I have is simply removing the deprecated method, and everything 
 is fine. I am not sure of the backwards compatibility here.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10346) Add Documentation for stateless scanner

2014-01-17 Thread Vandana Ayyalasomayajula (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vandana Ayyalasomayajula updated HBASE-10346:
-

Attachment: HBASE-10346.1.patch

 Add Documentation for stateless scanner
 ---

 Key: HBASE-10346
 URL: https://issues.apache.org/jira/browse/HBASE-10346
 Project: HBase
  Issue Type: Improvement
  Components: REST
Reporter: Vandana Ayyalasomayajula
Assignee: Vandana Ayyalasomayajula
Priority: Minor
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10346.0.patch, HBASE-10346.1.patch






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10372) TestHBaseFsck#testSplitDaughtersNotInMeta often fails

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875163#comment-13875163
 ] 

Andrew Purtell commented on HBASE-10372:


This is only happening on builds.apache.org ? Not saying it's not a problem, 
just that I haven't seen this locally on three boxes running the unit tests 
continuously. 

 TestHBaseFsck#testSplitDaughtersNotInMeta often fails
 -

 Key: HBASE-10372
 URL: https://issues.apache.org/jira/browse/HBASE-10372
 Project: HBase
  Issue Type: Test
Reporter: Ted Yu
 Attachments: 10372-testSplitDaughtersNotInMeta-output.html


 This test fails quite often.
 Here are some examples:
 https://builds.apache.org/job/HBase-0.98/88/
 https://builds.apache.org/job/HBase-0.98/89/
 This assertion fails:
 {code}
   assertNotNull(HRegionInfo.getHRegionInfo(result));
 {code}
 the split hbase:meta entry didn't have corresponding HRegionInfo.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10372) TestHBaseFsck#testSplitDaughtersNotInMeta often fails

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875164#comment-13875164
 ] 

Andrew Purtell commented on HBASE-10372:


These days I'm inclined to close any issue opened from a builds.apache.org 
result that cannot be reproduced locally as Cannot Reproduce. 

 TestHBaseFsck#testSplitDaughtersNotInMeta often fails
 -

 Key: HBASE-10372
 URL: https://issues.apache.org/jira/browse/HBASE-10372
 Project: HBase
  Issue Type: Test
Reporter: Ted Yu
 Attachments: 10372-testSplitDaughtersNotInMeta-output.html


 This test fails quite often.
 Here are some examples:
 https://builds.apache.org/job/HBase-0.98/88/
 https://builds.apache.org/job/HBase-0.98/89/
 This assertion fails:
 {code}
   assertNotNull(HRegionInfo.getHRegionInfo(result));
 {code}
 the split hbase:meta entry didn't have corresponding HRegionInfo.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-9845) Add MR support for cell ACLs

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875168#comment-13875168
 ] 

Andrew Purtell commented on HBASE-9845:
---

Looks like this got missed, working on it.

 Add MR support for cell ACLs
 

 Key: HBASE-9845
 URL: https://issues.apache.org/jira/browse/HBASE-9845
 Project: HBase
  Issue Type: Improvement
Affects Versions: 0.98.0
Reporter: Andrew Purtell
Assignee: Andrew Purtell
 Fix For: 0.98.0


 We should support adding cell ACLs through TableOutputFormat and 
 HFileOutputFormat.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10349) Table became unusable when master balanced its region after table was dropped

2014-01-17 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875169#comment-13875169
 ] 

Hadoop QA commented on HBASE-10349:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12623695/hbase-10349_v2.patch
  against trunk revision .
  ATTACHMENT ID: 12623695

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 6 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop1.1{color}.  The patch compiles against the hadoop 
1.1 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8461//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8461//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8461//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8461//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8461//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8461//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8461//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8461//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8461//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8461//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8461//console

This message is automatically generated.

 Table became unusable when master balanced its region after table was dropped
 -

 Key: HBASE-10349
 URL: https://issues.apache.org/jira/browse/HBASE-10349
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Ted Yu
Assignee: Jimmy Xiang
 Fix For: 0.98.0, 0.99.0

 Attachments: 
 10349-hadoop-hdfs-namenode-hor11n14.gq1.ygridcore.net.zip, 10349-output.log, 
 10349-v1.txt, 10349-v2.txt, HBASE-10349-meta-test-and-debug.patch, 
 hbase-10349.patch, hbase-10349_v2.patch, 
 hbase-hbase-master-hor15n05.gq1.ygridcore.net.log.tar.gz


 0.98 was used.
 This was sequence of events:
 create 'tablethree_mod'
 snapshot 'tablethree_mod', 'snapshot_tablethree_mod'
 disable 'tablethree_mod'
 2014-01-15 09:34:51,749   restore_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:35:07,210   enable 'tablethree_mod'
 2014-01-15 09:35:46,134   delete_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:41:42,210   disable 'tablethree_mod'
 2014-01-15 09:41:43,610   drop 'tablethree_mod'
 create 'tablethree_mod'
 For the last table creation request:
 {code} 
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'create 'tablethree_mod',
 {NAME = 'f1', VERSIONS = 3}
 ,
 {NAME = 'f2', VERSIONS = 3}
 ,
 {NAME = 'f3', VERSIONS = 3}
 '
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'exists 'tablethree_mod''
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '0', 'f1:q1', 'value-0', 10'
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '1', 'f1:q1', 'value-1', 20'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '2', 'f2:q2', 'value-2', 30'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 

[jira] [Updated] (HBASE-10372) TestHBaseFsck#testSplitDaughtersNotInMeta often fails

2014-01-17 Thread Andrew Purtell (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell updated HBASE-10372:
---

Fix Version/s: 0.99.0
   0.98.0

Ok, I will mark this with a fix version for 0.98.0 but if at RC time there's no 
patch on the issue or this cannot be reproduced locally, I will do the above.

 TestHBaseFsck#testSplitDaughtersNotInMeta often fails
 -

 Key: HBASE-10372
 URL: https://issues.apache.org/jira/browse/HBASE-10372
 Project: HBase
  Issue Type: Test
Reporter: Ted Yu
 Fix For: 0.98.0, 0.99.0

 Attachments: 10372-testSplitDaughtersNotInMeta-output.html


 This test fails quite often.
 Here are some examples:
 https://builds.apache.org/job/HBase-0.98/88/
 https://builds.apache.org/job/HBase-0.98/89/
 This assertion fails:
 {code}
   assertNotNull(HRegionInfo.getHRegionInfo(result));
 {code}
 the split hbase:meta entry didn't have corresponding HRegionInfo.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10372) TestHBaseFsck#testSplitDaughtersNotInMeta often fails

2014-01-17 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875173#comment-13875173
 ] 

Ted Yu commented on HBASE-10372:


The above did happen on Apache Jenkins.

I am running the test locally.

Will resolve if cannot reproduce.

 TestHBaseFsck#testSplitDaughtersNotInMeta often fails
 -

 Key: HBASE-10372
 URL: https://issues.apache.org/jira/browse/HBASE-10372
 Project: HBase
  Issue Type: Test
Reporter: Ted Yu
 Fix For: 0.98.0, 0.99.0

 Attachments: 10372-testSplitDaughtersNotInMeta-output.html


 This test fails quite often.
 Here are some examples:
 https://builds.apache.org/job/HBase-0.98/88/
 https://builds.apache.org/job/HBase-0.98/89/
 This assertion fails:
 {code}
   assertNotNull(HRegionInfo.getHRegionInfo(result));
 {code}
 the split hbase:meta entry didn't have corresponding HRegionInfo.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10374) add jemalloc into the choice of memstore allocation

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875179#comment-13875179
 ] 

Andrew Purtell commented on HBASE-10374:


I found it here: http://netty.io/4.0/api/io/netty/buffer/ByteBufAllocator.html 

 add jemalloc into the choice of memstore allocation 
 

 Key: HBASE-10374
 URL: https://issues.apache.org/jira/browse/HBASE-10374
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Affects Versions: 0.96.1.1
Reporter: Bing Jiang
Priority: Minor

 https://blog.twitter.com/2013/netty-4-at-twitter-reduced-gc-overhead 
 introduced that Netty used jemalloc to gain benefits from GC.
 It can be a good choice for memstore block allocation.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10374) add jemalloc into the choice of memstore allocation

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875181#comment-13875181
 ] 

Andrew Purtell commented on HBASE-10374:


Someone should try pulling in netty4 while excluding earlier versions to see 
what happens. I bet it will be a mess :-) given our dependencies on Jetty 6 and 
ZooKeeper.

 add jemalloc into the choice of memstore allocation 
 

 Key: HBASE-10374
 URL: https://issues.apache.org/jira/browse/HBASE-10374
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Affects Versions: 0.96.1.1
Reporter: Bing Jiang
Priority: Minor

 https://blog.twitter.com/2013/netty-4-at-twitter-reduced-gc-overhead 
 introduced that Netty used jemalloc to gain benefits from GC.
 It can be a good choice for memstore block allocation.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10349) Table became unusable when master balanced its region after table was dropped

2014-01-17 Thread Matteo Bertozzi (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875188#comment-13875188
 ] 

Matteo Bertozzi commented on HBASE-10349:
-

+1 on jimmy v2 for me.
(it will be nice also having the MetaEditor.overwriteRegions test and the add 
regions debug committed, but I guess i can create another jira for them)

 Table became unusable when master balanced its region after table was dropped
 -

 Key: HBASE-10349
 URL: https://issues.apache.org/jira/browse/HBASE-10349
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Ted Yu
Assignee: Jimmy Xiang
 Fix For: 0.98.0, 0.99.0

 Attachments: 
 10349-hadoop-hdfs-namenode-hor11n14.gq1.ygridcore.net.zip, 10349-output.log, 
 10349-v1.txt, 10349-v2.txt, HBASE-10349-meta-test-and-debug.patch, 
 hbase-10349.patch, hbase-10349_v2.patch, 
 hbase-hbase-master-hor15n05.gq1.ygridcore.net.log.tar.gz


 0.98 was used.
 This was sequence of events:
 create 'tablethree_mod'
 snapshot 'tablethree_mod', 'snapshot_tablethree_mod'
 disable 'tablethree_mod'
 2014-01-15 09:34:51,749   restore_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:35:07,210   enable 'tablethree_mod'
 2014-01-15 09:35:46,134   delete_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:41:42,210   disable 'tablethree_mod'
 2014-01-15 09:41:43,610   drop 'tablethree_mod'
 create 'tablethree_mod'
 For the last table creation request:
 {code} 
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'create 'tablethree_mod',
 {NAME = 'f1', VERSIONS = 3}
 ,
 {NAME = 'f2', VERSIONS = 3}
 ,
 {NAME = 'f3', VERSIONS = 3}
 '
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'exists 'tablethree_mod''
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '0', 'f1:q1', 'value-0', 10'
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '1', 'f1:q1', 'value-1', 20'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '2', 'f2:q2', 'value-2', 30'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '3', 'f3:q3', 'value-3', 40'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '4', 'f3:q3', 'value-4', 50'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO|Done writing commands to 
 file. Will execute them now.
 2014-01-15 10:03:53,000|beaver.machine|INFO|RUNNING: /usr/lib/hbase/bin/hbase 
 shell /grid/0/tmp/hwqe/artifacts/tmp-471142
 2014-01-15 10:03:55,878|beaver.machine|INFO|2014-01-15 10:03:55,878 INFO 
 [main] Configuration.deprecation: hadoop.native.lib is deprecated. Instead, 
 use io.native.lib.available
 2014-01-15 10:03:57,283|beaver.machine|INFO|2014-01-15 10:03:57,283 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,669|beaver.machine|INFO|2014-01-15 10:03:57,669 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,720|beaver.machine|INFO|2014-01-15 10:03:57,720 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 2014-01-15 10:03:57,997|beaver.machine|INFO|ERROR: Table already exists: 
 tablethree_mod!
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 {code}
 This was an intermittent issue after using Snapshots, a table is not properly 
 dropped / and not able to properly re-create with the same name. And a 
 HRegion is empty or null Error occurs. (When you try to drop the table it 
 says it does not exist, and when you try to create the table it says that it 
 does already exist).
 {code}
 2014-01-15 10:04:02,462|beaver.machine|INFO|ERROR: HRegionInfo was null or 
 empty in hbase:meta, row=keyvalues=
 {tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:seqnumDuringOpen/1389778905355/Put/vlen=8/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:server/1389778905355/Put/vlen=32/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:serverstartcode/1389778905355/Put/vlen=8/mvcc=0}
  
 {code}
 Thanks to Huned who discovered this issue.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10371) Compact create empty hfile, then select this file for compaction and create empty hfile and over again.

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875201#comment-13875201
 ] 

Andrew Purtell commented on HBASE-10371:


+1 for 0.98

 Compact create empty hfile, then select this file for compaction and create 
 empty hfile and over again.
 ---

 Key: HBASE-10371
 URL: https://issues.apache.org/jira/browse/HBASE-10371
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10371-94.patch, HBASE-10371-trunk-2.patch, 
 HBASE-10371-trunk.patch


 (1) Select HFile for compaction
 {code}
 2014-01-16 01:01:25,111 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b
  whose maxTimeStamp is -1 while the max expired timestamp is 1389632485111
 {code}
 (2) Compact
 {code}
 2014-01-16 01:01:26,042 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 01:01:26,045 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  with permission=rwxrwxrwx
 2014-01-16 01:01:26,076 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Renaming compacted file at 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  to 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
 2014-01-16 01:01:26,142 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Completed compaction of 1 file(s) in a of 
 storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767. into 
 40de5d79f80e4fb197e409fb99ab0fd8, size=534; total size for store is 399.0 M
 2014-01-16 01:01:26,142 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactionRequest: completed 
 compaction: 
 regionName=storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767., 
 storeName=a, fileCount=1, fileSize=534, priority=16, time=18280340606333745; 
 duration=0sec
 {code}
 (3) Select HFile for compaction
 {code}
 2014-01-16 03:48:05,120 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
  whose maxTimeStamp is -1 while the max expired timestamp is 1389642485120
 {code}
 (4) Compact
 {code}
 2014-01-16 03:50:17,731 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 03:50:17,732 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90
 {code}
 ... 
 this loop for ever.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10371) Compact create empty hfile, then select this file for compaction and create empty hfile and over again.

2014-01-17 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875210#comment-13875210
 ] 

Lars Hofhansl commented on HBASE-10371:
---

lgtm

 Compact create empty hfile, then select this file for compaction and create 
 empty hfile and over again.
 ---

 Key: HBASE-10371
 URL: https://issues.apache.org/jira/browse/HBASE-10371
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10371-94.patch, HBASE-10371-trunk-2.patch, 
 HBASE-10371-trunk.patch


 (1) Select HFile for compaction
 {code}
 2014-01-16 01:01:25,111 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b
  whose maxTimeStamp is -1 while the max expired timestamp is 1389632485111
 {code}
 (2) Compact
 {code}
 2014-01-16 01:01:26,042 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 01:01:26,045 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  with permission=rwxrwxrwx
 2014-01-16 01:01:26,076 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Renaming compacted file at 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  to 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
 2014-01-16 01:01:26,142 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Completed compaction of 1 file(s) in a of 
 storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767. into 
 40de5d79f80e4fb197e409fb99ab0fd8, size=534; total size for store is 399.0 M
 2014-01-16 01:01:26,142 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactionRequest: completed 
 compaction: 
 regionName=storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767., 
 storeName=a, fileCount=1, fileSize=534, priority=16, time=18280340606333745; 
 duration=0sec
 {code}
 (3) Select HFile for compaction
 {code}
 2014-01-16 03:48:05,120 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
  whose maxTimeStamp is -1 while the max expired timestamp is 1389642485120
 {code}
 (4) Compact
 {code}
 2014-01-16 03:50:17,731 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 03:50:17,732 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90
 {code}
 ... 
 this loop for ever.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10249) Intermittent TestReplicationSyncUpTool failure

2014-01-17 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875217#comment-13875217
 ] 

Lars Hofhansl commented on HBASE-10249:
---

you gonna commit [~jdcryans]?

 Intermittent TestReplicationSyncUpTool failure
 --

 Key: HBASE-10249
 URL: https://issues.apache.org/jira/browse/HBASE-10249
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Jean-Daniel Cryans
 Fix For: 0.98.0, 0.96.2, 0.99.0, 0.94.17

 Attachments: HBASE-10249-0.94-v0.patch, HBASE-10249-0.94-v1.patch, 
 HBASE-10249-trunk-v0.patch, HBASE-10249-trunk-v1.patch


 New issue to keep track of this.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10123) Change default ports; move them out of linux ephemeral port range

2014-01-17 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875229#comment-13875229
 ] 

Hudson commented on HBASE-10123:


SUCCESS: Integrated in HBase-TRUNK #4830 (See 
[https://builds.apache.org/job/HBase-TRUNK/4830/])
HBASE-10123 Change default ports; move them out of linux ephemeral port range 
(jmhsieh: rev 1559200)
* /hbase/trunk/bin/local-master-backup.sh
* 
/hbase/trunk/hbase-client/src/test/java/org/apache/hadoop/hbase/client/TestClientNoCluster.java
* 
/hbase/trunk/hbase-common/src/main/java/org/apache/hadoop/hbase/HBaseConfiguration.java
* 
/hbase/trunk/hbase-common/src/main/java/org/apache/hadoop/hbase/HConstants.java
* /hbase/trunk/hbase-common/src/main/resources/hbase-default.xml
* 
/hbase/trunk/hbase-server/src/main/jamon/org/apache/hadoop/hbase/tmpl/master/BackupMasterStatusTmpl.jamon
* 
/hbase/trunk/hbase-server/src/main/jamon/org/apache/hadoop/hbase/tmpl/regionserver/RSStatusTmpl.jamon
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/LocalHBaseCluster.java
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/master/HMaster.java
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegionServer.java
* /hbase/trunk/src/main/docbkx/configuration.xml
* /hbase/trunk/src/main/docbkx/troubleshooting.xml


 Change default ports; move them out of linux ephemeral port range
 -

 Key: HBASE-10123
 URL: https://issues.apache.org/jira/browse/HBASE-10123
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.1.1
Reporter: stack
Assignee: Jonathan Hsieh
Priority: Critical
 Fix For: 0.99.0

 Attachments: hbase-10123.patch, hbase-10123.v2.patch, 
 hbase-10123.v3.patch, hbase-10123.v4.patch


 Our defaults clash w/ the range linux assigns itself for creating come-and-go 
 ephemeral ports; likely in our history we've clashed w/ a random, short-lived 
 process.  While easy to change the defaults, we should just ship w/ defaults 
 that make sense.  We could host ourselves up into the 7 or 8k range.
 See http://www.ncftp.com/ncftpd/doc/misc/ephemeral_ports.html



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10371) Compact create empty hfile, then select this file for compaction and create empty hfile and over again.

2014-01-17 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-10371:
---

Status: Patch Available  (was: Open)

 Compact create empty hfile, then select this file for compaction and create 
 empty hfile and over again.
 ---

 Key: HBASE-10371
 URL: https://issues.apache.org/jira/browse/HBASE-10371
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10371-94.patch, HBASE-10371-trunk-2.patch, 
 HBASE-10371-trunk.patch


 (1) Select HFile for compaction
 {code}
 2014-01-16 01:01:25,111 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b
  whose maxTimeStamp is -1 while the max expired timestamp is 1389632485111
 {code}
 (2) Compact
 {code}
 2014-01-16 01:01:26,042 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 01:01:26,045 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  with permission=rwxrwxrwx
 2014-01-16 01:01:26,076 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Renaming compacted file at 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  to 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
 2014-01-16 01:01:26,142 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Completed compaction of 1 file(s) in a of 
 storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767. into 
 40de5d79f80e4fb197e409fb99ab0fd8, size=534; total size for store is 399.0 M
 2014-01-16 01:01:26,142 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactionRequest: completed 
 compaction: 
 regionName=storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767., 
 storeName=a, fileCount=1, fileSize=534, priority=16, time=18280340606333745; 
 duration=0sec
 {code}
 (3) Select HFile for compaction
 {code}
 2014-01-16 03:48:05,120 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
  whose maxTimeStamp is -1 while the max expired timestamp is 1389642485120
 {code}
 (4) Compact
 {code}
 2014-01-16 03:50:17,731 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 03:50:17,732 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90
 {code}
 ... 
 this loop for ever.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10371) Compact create empty hfile, then select this file for compaction and create empty hfile and over again.

2014-01-17 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-10371:
---

Attachment: 10371-trunk-3.patch

Patch v3 corrects spelling mistake: entrys

 Compact create empty hfile, then select this file for compaction and create 
 empty hfile and over again.
 ---

 Key: HBASE-10371
 URL: https://issues.apache.org/jira/browse/HBASE-10371
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: 10371-trunk-3.patch, HBASE-10371-94.patch, 
 HBASE-10371-trunk-2.patch, HBASE-10371-trunk.patch


 (1) Select HFile for compaction
 {code}
 2014-01-16 01:01:25,111 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b
  whose maxTimeStamp is -1 while the max expired timestamp is 1389632485111
 {code}
 (2) Compact
 {code}
 2014-01-16 01:01:26,042 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 01:01:26,045 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  with permission=rwxrwxrwx
 2014-01-16 01:01:26,076 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Renaming compacted file at 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  to 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
 2014-01-16 01:01:26,142 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Completed compaction of 1 file(s) in a of 
 storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767. into 
 40de5d79f80e4fb197e409fb99ab0fd8, size=534; total size for store is 399.0 M
 2014-01-16 01:01:26,142 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactionRequest: completed 
 compaction: 
 regionName=storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767., 
 storeName=a, fileCount=1, fileSize=534, priority=16, time=18280340606333745; 
 duration=0sec
 {code}
 (3) Select HFile for compaction
 {code}
 2014-01-16 03:48:05,120 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
  whose maxTimeStamp is -1 while the max expired timestamp is 1389642485120
 {code}
 (4) Compact
 {code}
 2014-01-16 03:50:17,731 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 03:50:17,732 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90
 {code}
 ... 
 this loop for ever.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10349) Table became unusable when master balanced its region after table was dropped

2014-01-17 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875247#comment-13875247
 ] 

Ted Yu commented on HBASE-10349:


+1 on patch v2.

New tests are good.

 Table became unusable when master balanced its region after table was dropped
 -

 Key: HBASE-10349
 URL: https://issues.apache.org/jira/browse/HBASE-10349
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Ted Yu
Assignee: Jimmy Xiang
 Fix For: 0.98.0, 0.99.0

 Attachments: 
 10349-hadoop-hdfs-namenode-hor11n14.gq1.ygridcore.net.zip, 10349-output.log, 
 10349-v1.txt, 10349-v2.txt, HBASE-10349-meta-test-and-debug.patch, 
 hbase-10349.patch, hbase-10349_v2.patch, 
 hbase-hbase-master-hor15n05.gq1.ygridcore.net.log.tar.gz


 0.98 was used.
 This was sequence of events:
 create 'tablethree_mod'
 snapshot 'tablethree_mod', 'snapshot_tablethree_mod'
 disable 'tablethree_mod'
 2014-01-15 09:34:51,749   restore_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:35:07,210   enable 'tablethree_mod'
 2014-01-15 09:35:46,134   delete_snapshot 'snapshot_tablethree_mod'
 2014-01-15 09:41:42,210   disable 'tablethree_mod'
 2014-01-15 09:41:43,610   drop 'tablethree_mod'
 create 'tablethree_mod'
 For the last table creation request:
 {code} 
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'create 'tablethree_mod',
 {NAME = 'f1', VERSIONS = 3}
 ,
 {NAME = 'f2', VERSIONS = 3}
 ,
 {NAME = 'f3', VERSIONS = 3}
 '
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'exists 'tablethree_mod''
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '0', 'f1:q1', 'value-0', 10'
 2014-01-15 10:03:52,999|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '1', 'f1:q1', 'value-1', 20'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '2', 'f2:q2', 'value-2', 30'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '3', 'f3:q3', 'value-3', 40'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO| 'put 'tablethree_mod', 
 '4', 'f3:q3', 'value-4', 50'
 2014-01-15 10:03:53,000|beaver.component.hbase|INFO|Done writing commands to 
 file. Will execute them now.
 2014-01-15 10:03:53,000|beaver.machine|INFO|RUNNING: /usr/lib/hbase/bin/hbase 
 shell /grid/0/tmp/hwqe/artifacts/tmp-471142
 2014-01-15 10:03:55,878|beaver.machine|INFO|2014-01-15 10:03:55,878 INFO 
 [main] Configuration.deprecation: hadoop.native.lib is deprecated. Instead, 
 use io.native.lib.available
 2014-01-15 10:03:57,283|beaver.machine|INFO|2014-01-15 10:03:57,283 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,669|beaver.machine|INFO|2014-01-15 10:03:57,669 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,720|beaver.machine|INFO|2014-01-15 10:03:57,720 WARN 
 [main] conf.Configuration: hbase-site.xml:an attempt to override final 
 parameter: dfs.support.append; Ignoring.
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 2014-01-15 10:03:57,997|beaver.machine|INFO|ERROR: Table already exists: 
 tablethree_mod!
 2014-01-15 10:03:57,997|beaver.machine|INFO|
 {code}
 This was an intermittent issue after using Snapshots, a table is not properly 
 dropped / and not able to properly re-create with the same name. And a 
 HRegion is empty or null Error occurs. (When you try to drop the table it 
 says it does not exist, and when you try to create the table it says that it 
 does already exist).
 {code}
 2014-01-15 10:04:02,462|beaver.machine|INFO|ERROR: HRegionInfo was null or 
 empty in hbase:meta, row=keyvalues=
 {tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:seqnumDuringOpen/1389778905355/Put/vlen=8/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:server/1389778905355/Put/vlen=32/mvcc=0,
  
 tablethree_mod,,1389778226606.afc82d1ceabbaca36a504b83b65fc0c9./info:serverstartcode/1389778905355/Put/vlen=8/mvcc=0}
  
 {code}
 Thanks to Huned who discovered this issue.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10371) Compaction creates empty hfile, then selects this file for compaction and creates empty hfile and over again

2014-01-17 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-10371:
---

Summary: Compaction creates empty hfile, then selects this file for 
compaction and creates empty hfile and over again  (was: Compact create empty 
hfile, then select this file for compaction and create empty hfile and over 
again.)

 Compaction creates empty hfile, then selects this file for compaction and 
 creates empty hfile and over again
 

 Key: HBASE-10371
 URL: https://issues.apache.org/jira/browse/HBASE-10371
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: 10371-trunk-3.patch, HBASE-10371-94.patch, 
 HBASE-10371-trunk-2.patch, HBASE-10371-trunk.patch


 (1) Select HFile for compaction
 {code}
 2014-01-16 01:01:25,111 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b
  whose maxTimeStamp is -1 while the max expired timestamp is 1389632485111
 {code}
 (2) Compact
 {code}
 2014-01-16 01:01:26,042 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 01:01:26,045 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  with permission=rwxrwxrwx
 2014-01-16 01:01:26,076 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Renaming compacted file at 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  to 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
 2014-01-16 01:01:26,142 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Completed compaction of 1 file(s) in a of 
 storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767. into 
 40de5d79f80e4fb197e409fb99ab0fd8, size=534; total size for store is 399.0 M
 2014-01-16 01:01:26,142 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactionRequest: completed 
 compaction: 
 regionName=storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767., 
 storeName=a, fileCount=1, fileSize=534, priority=16, time=18280340606333745; 
 duration=0sec
 {code}
 (3) Select HFile for compaction
 {code}
 2014-01-16 03:48:05,120 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
  whose maxTimeStamp is -1 while the max expired timestamp is 1389642485120
 {code}
 (4) Compact
 {code}
 2014-01-16 03:50:17,731 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 03:50:17,732 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90
 {code}
 ... 
 this loop for ever.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10371) Compaction creates empty hfile, then selects this file for compaction and creates empty hfile and over again

2014-01-17 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875252#comment-13875252
 ] 

Ted Yu commented on HBASE-10371:


Ran all compaction related tests which passed.

Integrated to 0.98 and trunk.

Thanks for the patch, Lijin.

 Compaction creates empty hfile, then selects this file for compaction and 
 creates empty hfile and over again
 

 Key: HBASE-10371
 URL: https://issues.apache.org/jira/browse/HBASE-10371
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: 10371-trunk-3.patch, HBASE-10371-94.patch, 
 HBASE-10371-trunk-2.patch, HBASE-10371-trunk.patch


 (1) Select HFile for compaction
 {code}
 2014-01-16 01:01:25,111 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b
  whose maxTimeStamp is -1 while the max expired timestamp is 1389632485111
 {code}
 (2) Compact
 {code}
 2014-01-16 01:01:26,042 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 01:01:26,045 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  with permission=rwxrwxrwx
 2014-01-16 01:01:26,076 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Renaming compacted file at 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  to 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
 2014-01-16 01:01:26,142 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Completed compaction of 1 file(s) in a of 
 storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767. into 
 40de5d79f80e4fb197e409fb99ab0fd8, size=534; total size for store is 399.0 M
 2014-01-16 01:01:26,142 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactionRequest: completed 
 compaction: 
 regionName=storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767., 
 storeName=a, fileCount=1, fileSize=534, priority=16, time=18280340606333745; 
 duration=0sec
 {code}
 (3) Select HFile for compaction
 {code}
 2014-01-16 03:48:05,120 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
  whose maxTimeStamp is -1 while the max expired timestamp is 1389642485120
 {code}
 (4) Compact
 {code}
 2014-01-16 03:50:17,731 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 03:50:17,732 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90
 {code}
 ... 
 this loop for ever.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10371) Compaction creates empty hfile, then selects this file for compaction and creates empty hfile and over again

2014-01-17 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-10371:
---

Assignee: binlijin
Hadoop Flags: Reviewed

 Compaction creates empty hfile, then selects this file for compaction and 
 creates empty hfile and over again
 

 Key: HBASE-10371
 URL: https://issues.apache.org/jira/browse/HBASE-10371
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
Assignee: binlijin
 Attachments: 10371-trunk-3.patch, HBASE-10371-94.patch, 
 HBASE-10371-trunk-2.patch, HBASE-10371-trunk.patch


 (1) Select HFile for compaction
 {code}
 2014-01-16 01:01:25,111 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b
  whose maxTimeStamp is -1 while the max expired timestamp is 1389632485111
 {code}
 (2) Compact
 {code}
 2014-01-16 01:01:26,042 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/f3e38d10d579420494079e17a2557f0b,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 01:01:26,045 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  with permission=rwxrwxrwx
 2014-01-16 01:01:26,076 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Renaming compacted file at 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/.tmp/40de5d79f80e4fb197e409fb99ab0fd8
  to 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
 2014-01-16 01:01:26,142 INFO org.apache.hadoop.hbase.regionserver.Store: 
 Completed compaction of 1 file(s) in a of 
 storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767. into 
 40de5d79f80e4fb197e409fb99ab0fd8, size=534; total size for store is 399.0 M
 2014-01-16 01:01:26,142 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactionRequest: completed 
 compaction: 
 regionName=storagetable,01:,1369377609136.7d8941661904fb99a41f79a1fce47767., 
 storeName=a, fileCount=1, fileSize=534, priority=16, time=18280340606333745; 
 duration=0sec
 {code}
 (3) Select HFile for compaction
 {code}
 2014-01-16 03:48:05,120 INFO 
 org.apache.hadoop.hbase.regionserver.compactions.CompactSelection: Deleting 
 the expired store file by compaction: 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8
  whose maxTimeStamp is -1 while the max expired timestamp is 1389642485120
 {code}
 (4) Compact
 {code}
 2014-01-16 03:50:17,731 DEBUG org.apache.hadoop.hbase.regionserver.Compactor: 
 Compacting 
 hdfs://dump002002.cm6:9000/hbase-0.90/storagetable/7d8941661904fb99a41f79a1fce47767/a/40de5d79f80e4fb197e409fb99ab0fd8,
  keycount=0, bloomtype=NONE, size=534, encoding=NONE
 2014-01-16 03:50:17,732 DEBUG org.apache.hadoop.hbase.util.FSUtils: Creating 
 file=hdfs://dump002002.cm6:9000/hbase-0.90
 {code}
 ... 
 this loop for ever.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10374) add jemalloc into the choice of memstore allocation

2014-01-17 Thread Nick Dimiduk (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875262#comment-13875262
 ] 

Nick Dimiduk commented on HBASE-10374:
--

Yeah, I tried this once and it didn't go very well. I am keen to pursue Stack's 
suggestion though. I haven't come to a conclusive decision for why we'd pursue 
Netty's ByteBuf vs nio

 add jemalloc into the choice of memstore allocation 
 

 Key: HBASE-10374
 URL: https://issues.apache.org/jira/browse/HBASE-10374
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Affects Versions: 0.96.1.1
Reporter: Bing Jiang
Priority: Minor

 https://blog.twitter.com/2013/netty-4-at-twitter-reduced-gc-overhead 
 introduced that Netty used jemalloc to gain benefits from GC.
 It can be a good choice for memstore block allocation.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10374) add jemalloc into the choice of memstore allocation

2014-01-17 Thread Nick Dimiduk (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875265#comment-13875265
 ] 

Nick Dimiduk commented on HBASE-10374:
--

Premature submit. As I was a saying.

I don't have conclusive reasons to pursue Netty ByteBufs vs nio ByteBuffers. 
The compositional structure Netty provides is really nice, if we can use it, 
but I think it could be implemented on top of stock ByteBuffers as well.

 add jemalloc into the choice of memstore allocation 
 

 Key: HBASE-10374
 URL: https://issues.apache.org/jira/browse/HBASE-10374
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Affects Versions: 0.96.1.1
Reporter: Bing Jiang
Priority: Minor

 https://blog.twitter.com/2013/netty-4-at-twitter-reduced-gc-overhead 
 introduced that Netty used jemalloc to gain benefits from GC.
 It can be a good choice for memstore block allocation.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10346) Add Documentation for stateless scanner

2014-01-17 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875270#comment-13875270
 ] 

Hadoop QA commented on HBASE-10346:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12623717/HBASE-10346.1.patch
  against trunk revision .
  ATTACHMENT ID: 12623717

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop1.1{color}.  The patch compiles against the hadoop 
1.1 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 lineLengths{color}.  The patch introduces the following lines 
longer than 100:
+Row key=dGVzdHJvdzE=gt;lt;Cell column=YTox 
timestamp=1389900769772gt;dGVzdHZhbHVlLWExlt;br
+dGVzdHZhbHVlLWEylt;/Cellgt;lt;Cell 
column=Yjoxtimestamp=1389900818233gt;dGVzdHZhbHVlLWIylt;br
+/Cellgt;lt;/Rowgt;lt;Row key=dGVzdHJvdzM=gt;lt;Cell column=YTox 
timestamp=1389900847336br
+Row key=dGVzdHJvdzE=gt;lt;Cell column=YTox 
timestamp=1389900769772gt;dGVzdHZhbHVlLWExlt;br
+curl -H Content-Type: text/xml 
https://localhost:8080/ExampleScanner/*?startrow=testrow1amp;limit=2
+Row key=dGVzdHJvdzE=gt;lt;Cell column=YTox 
timestamp=1389900769772gt;dGVzdHZhbHVlLWExlt;br
+curl -H Content-Type: text/xml 
https://localhost:8080/ExampleScanner/*?starttime=1389900769772amp;endtime=138990080
+Row key=dGVzdHJvdzE=gt;lt;Cell column=YTox 
timestamp=1389900769772gt;dGVzdHZhbHVlLWExlt;br
+Row key=dGVzdHJvdzE=gt;lt;Cell column=YTox 
timestamp=1389900769772gt;dGVzdHZhbHVlLWExlt;br

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8464//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8464//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8464//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8464//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8464//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8464//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8464//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8464//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8464//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8464//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8464//console

This message is automatically generated.

 Add Documentation for stateless scanner
 ---

 Key: HBASE-10346
 URL: https://issues.apache.org/jira/browse/HBASE-10346
 Project: HBase
  Issue Type: Improvement
  Components: REST
Reporter: Vandana Ayyalasomayajula
Assignee: Vandana Ayyalasomayajula
Priority: Minor
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10346.0.patch, HBASE-10346.1.patch






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10346) Add Documentation for stateless scanner

2014-01-17 Thread Nick Dimiduk (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875280#comment-13875280
 ] 

Nick Dimiduk commented on HBASE-10346:
--

The additional docs look great! Mind cleaning up the line length issues for me, 
I'll get it committed today.

 Add Documentation for stateless scanner
 ---

 Key: HBASE-10346
 URL: https://issues.apache.org/jira/browse/HBASE-10346
 Project: HBase
  Issue Type: Improvement
  Components: REST
Reporter: Vandana Ayyalasomayajula
Assignee: Vandana Ayyalasomayajula
Priority: Minor
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10346.0.patch, HBASE-10346.1.patch






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10346) Add Documentation for stateless scanner

2014-01-17 Thread Nick Dimiduk (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875281#comment-13875281
 ] 

Nick Dimiduk commented on HBASE-10346:
--

Oh, it's all in the preformatted output. Nevermind. +1

 Add Documentation for stateless scanner
 ---

 Key: HBASE-10346
 URL: https://issues.apache.org/jira/browse/HBASE-10346
 Project: HBase
  Issue Type: Improvement
  Components: REST
Reporter: Vandana Ayyalasomayajula
Assignee: Vandana Ayyalasomayajula
Priority: Minor
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10346.0.patch, HBASE-10346.1.patch






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10366) 0.94 filterRow() may be skipped in 0.96(or onwards) code

2014-01-17 Thread Jeffrey Zhong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeffrey Zhong updated HBASE-10366:
--

Attachment: hbase-10366-v1.patch

Thanks All for the reviews! v1 patch adds a unit test.

[~te...@apache.org] The Javadoc for current hasFilterRow() is all right because 
new filter implementation should follow that way in the future. I modified some 
comments in my previous patch to specifically call out that 0.94 hasFilterRow() 
has different semantics.

 0.94 filterRow() may be skipped in 0.96(or onwards) code
 

 Key: HBASE-10366
 URL: https://issues.apache.org/jira/browse/HBASE-10366
 Project: HBase
  Issue Type: Bug
  Components: Filters
Reporter: Jeffrey Zhong
Assignee: Jeffrey Zhong
Priority: Critical
 Fix For: 0.98.0, 0.96.2

 Attachments: hbase-10366-v1.patch, hbase-10366.patch


 HBASE-6429 combines both filterRow  filterRow(ListKeyValue kvs) functions 
 in Filter. 
 While 0.94 code or older, it may not implement hasFilterRow as HBase-6429 
 expected because 0.94(old) hasFilterRow only returns true when 
 filterRow(ListKeyValue kvs) is overridden not the filterRow(). Therefore, 
 the filterRow() will be skipped.
 Since we don't ask 0.94 users to update their existing filter code, the issue 
 will cause scan returns unexpected keyvalues and break the backward 
 compatibility.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10346) Add Documentation for stateless scanner

2014-01-17 Thread Nick Dimiduk (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-10346:
-

Attachment: HBASE-10346.addendum.patch

I jumped the gun, committed patch 0, not patch 1, to both 0.98 and trunk. This 
is the interdiff between the two. I'm verifying it applies cleanly and I'll 
commit it as an addendum.

 Add Documentation for stateless scanner
 ---

 Key: HBASE-10346
 URL: https://issues.apache.org/jira/browse/HBASE-10346
 Project: HBase
  Issue Type: Improvement
  Components: REST
Reporter: Vandana Ayyalasomayajula
Assignee: Vandana Ayyalasomayajula
Priority: Minor
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10346.0.patch, HBASE-10346.1.patch, 
 HBASE-10346.addendum.patch






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875312#comment-13875312
 ] 

Hudson commented on HBASE-10370:


FAILURE: Integrated in HBase-0.98-on-Hadoop-1.1 #82 (See 
[https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/82/])
HBASE-10370 Compaction in out-of-date Store causes region split failed (Tedyu: 
rev 1559215)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestSplitTransactionOnCluster.java


 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10374) add jemalloc into the choice of memstore allocation

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875319#comment-13875319
 ] 

Andrew Purtell commented on HBASE-10374:


This is also directly related to HBASE-10191

 add jemalloc into the choice of memstore allocation 
 

 Key: HBASE-10374
 URL: https://issues.apache.org/jira/browse/HBASE-10374
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Affects Versions: 0.96.1.1
Reporter: Bing Jiang
Priority: Minor

 https://blog.twitter.com/2013/netty-4-at-twitter-reduced-gc-overhead 
 introduced that Netty used jemalloc to gain benefits from GC.
 It can be a good choice for memstore block allocation.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875318#comment-13875318
 ] 

Hudson commented on HBASE-10370:


FAILURE: Integrated in hbase-0.96 #261 (See 
[https://builds.apache.org/job/hbase-0.96/261/])
HBASE-10370 Compaction in out-of-date Store causes region split failed (stack: 
rev 1559226)
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java


 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10346) Add Documentation for stateless scanner

2014-01-17 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875327#comment-13875327
 ] 

Hadoop QA commented on HBASE-10346:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12623735/HBASE-10346.addendum.patch
  against trunk revision .
  ATTACHMENT ID: 12623735

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8466//console

This message is automatically generated.

 Add Documentation for stateless scanner
 ---

 Key: HBASE-10346
 URL: https://issues.apache.org/jira/browse/HBASE-10346
 Project: HBase
  Issue Type: Improvement
  Components: REST
Reporter: Vandana Ayyalasomayajula
Assignee: Vandana Ayyalasomayajula
Priority: Minor
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10346.0.patch, HBASE-10346.1.patch, 
 HBASE-10346.addendum.patch






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875335#comment-13875335
 ] 

Hudson commented on HBASE-10370:


FAILURE: Integrated in HBase-0.98 #91 (See 
[https://builds.apache.org/job/HBase-0.98/91/])
HBASE-10370 Compaction in out-of-date Store causes region split failed (Tedyu: 
rev 1559215)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestSplitTransactionOnCluster.java


 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10346) Add Documentation for stateless scanner

2014-01-17 Thread Nick Dimiduk (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-10346:
-

Resolution: Fixed
Status: Resolved  (was: Patch Available)

Committed to 0.98 and trunk.

 Add Documentation for stateless scanner
 ---

 Key: HBASE-10346
 URL: https://issues.apache.org/jira/browse/HBASE-10346
 Project: HBase
  Issue Type: Improvement
  Components: REST
Reporter: Vandana Ayyalasomayajula
Assignee: Vandana Ayyalasomayajula
Priority: Minor
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10346.0.patch, HBASE-10346.1.patch, 
 HBASE-10346.addendum.patch






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875351#comment-13875351
 ] 

Ted Yu commented on HBASE-10370:


From 
https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/82/testReport/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testSplitFailedCompactionAndSplit/
 :
{code}
java.lang.AssertionError
at org.junit.Assert.fail(Assert.java:86)
at org.junit.Assert.assertTrue(Assert.java:41)
at org.junit.Assert.assertNotNull(Assert.java:621)
at org.junit.Assert.assertNotNull(Assert.java:631)
at 
org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster.testSplitFailedCompactionAndSplit(TestSplitTransactionOnCluster.java:335)
{code}
The following assertion failed:
{code}
CompactionContext cc = store.requestCompaction();
assertNotNull(cc);
{code}
Since the above is in the new test added, I will revert from 0.98 and trunk.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875348#comment-13875348
 ] 

Hudson commented on HBASE-10370:


SUCCESS: Integrated in HBase-TRUNK #4831 (See 
[https://builds.apache.org/job/HBase-TRUNK/4831/])
HBASE-10370 Compaction in out-of-date Store causes region split failed (Tedyu: 
rev 1559216)
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java
* 
/hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestSplitTransactionOnCluster.java


 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875353#comment-13875353
 ] 

Andrew Purtell commented on HBASE-10370:


bq. I will revert from 0.98 and trunk

Thanks Ted. 

+1 to recommit without the new test, or with a fixed test.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875355#comment-13875355
 ] 

Andrew Purtell commented on HBASE-10370:


This could also be handled with an addendum that drops the new test, since this 
was committed to 0.96 without the new test.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875356#comment-13875356
 ] 

Ted Yu commented on HBASE-10370:


bq. or with a fixed test
I prefer the above.

Thanks Andy.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875366#comment-13875366
 ] 

Andrew Purtell commented on HBASE-10370:


Well this was committed elsewhere without the new test, so I'd prefer that if 
you don't mind.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875368#comment-13875368
 ] 

Ted Yu commented on HBASE-10370:


Okay.
I will integrate one more time before 0.98.0 RC0 comes out.

Let's give Shaohui some time to make the test robust.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875370#comment-13875370
 ] 

Andrew Purtell commented on HBASE-10370:


Can we commit this to 0.98 as was committed to 0.96 and open a followup JIRA 
for a test for both 0.98 and 0.96 (at Stack's option)? 

bq. I will integrate one more time before 0.98.0 RC0 comes out.

Ok, the only blocker remaining is HBASE-10322. I'm going to tag the RC as soon 
as that goes in.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370v2.096.txt, HBASE-10370-v1.diff, 
 HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Created] (HBASE-10376) Backport JvmPauseMonitor to 0.94 branch

2014-01-17 Thread Ted Yu (JIRA)
Ted Yu created HBASE-10376:
--

 Summary: Backport JvmPauseMonitor to 0.94 branch
 Key: HBASE-10376
 URL: https://issues.apache.org/jira/browse/HBASE-10376
 Project: HBase
  Issue Type: Task
Reporter: Ted Yu


In the following thread:
http://search-hadoop.com/m/NniICiuKee1

Nick mentioned using JvmPauseMonitor to help track down an anomaly.

This JIRA is to backport JvmPauseMonitor (introduced in HBASE-9630) to 0.94 
branch



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-10370:
---

Attachment: 10370-v4.patch

Sounds good.

Patch v4 is the fix without test.

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370-v4.patch, 10370v2.096.txt, 
 HBASE-10370-v1.diff, HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10370) Compaction in out-of-date Store causes region split failed

2014-01-17 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-10370:
---

Status: Open  (was: Patch Available)

 Compaction in out-of-date Store causes region split failed
 --

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370-v4.patch, 10370v2.096.txt, 
 HBASE-10370-v1.diff, HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HBASE-10370) Compaction in out-of-date Store causes region split failure

2014-01-17 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-10370:
---

Summary: Compaction in out-of-date Store causes region split failure  (was: 
Compaction in out-of-date Store causes region split failed)

 Compaction in out-of-date Store causes region split failure
 ---

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370-v4.patch, 10370v2.096.txt, 
 HBASE-10370-v1.diff, HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Created] (HBASE-10377) Add test for HBASE-10370 Compaction in out-of-date Store causes region split failure

2014-01-17 Thread Ted Yu (JIRA)
Ted Yu created HBASE-10377:
--

 Summary: Add test for HBASE-10370 Compaction in out-of-date Store 
causes region split failure
 Key: HBASE-10377
 URL: https://issues.apache.org/jira/browse/HBASE-10377
 Project: HBase
  Issue Type: Test
Reporter: Ted Yu


HBASE-10370 fixes the issue where region split fails following compacting 
out-of-date Store

The new test failed in this build:

https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/82/testReport/org.apache.hadoop.hbase.regionserver/TestSplitTransactionOnCluster/testSplitFailedCompactionAndSplit/

This issue is to make the new test, testSplitFailedCompactionAndSplit, robust.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HBASE-10370) Compaction in out-of-date Store causes region split failure

2014-01-17 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875385#comment-13875385
 ] 

Ted Yu commented on HBASE-10370:


Re-integrated the fix into 0.98 and trunk.

I created HBASE-10377 for restoring the test.

Once Jenkins builds come back green, will resolve this issue.

 Compaction in out-of-date Store causes region split failure
 ---

 Key: HBASE-10370
 URL: https://issues.apache.org/jira/browse/HBASE-10370
 Project: HBase
  Issue Type: Bug
  Components: Compaction
Affects Versions: 0.94.3, 0.98.0, 0.99.0
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: 10370-v3.patch, 10370-v4.patch, 10370v2.096.txt, 
 HBASE-10370-v1.diff, HBASE-10370-v2.diff


 In out product cluster, we encounter a problem that two daughter regions can 
 not been opened for FileNotFoundException.
 {quote}
 2014-01-14,20:12:46,927 INFO 
 org.apache.hadoop.hbase.regionserver.SplitRequest: Running rollback/cleanup 
 of failed split of 
 user_profile,x,1389671863815.99e016485b0bc142d67ae07a884f6966.; 
 Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 java.io.IOException: Failed 
 lg-hadoop-st34.bj,21600,1389060755669-daughterOpener=ec8bbda0f132c481b451fa40e7152b98
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.openDaughters(SplitTransaction.java:375)
 at 
 org.apache.hadoop.hbase.regionserver.SplitTransaction.execute(SplitTransaction.java:467)
 at 
 org.apache.hadoop.hbase.regionserver.SplitRequest.run(SplitRequest.java:69)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 Caused by: java.io.IOException: java.io.IOException: 
 java.io.FileNotFoundException: File does not exist: 
 /hbase/lgprc-xiaomi/user_profile/99e016485b0bc142d67ae07a884f6966/A/5e05d706e4a84f34acc2cf00f089a4cf
 
 {quote}
 The reason is that a compaction in an out-of-date Store deletes the hfiles, 
 which are referenced by  the daughter regions after split. This will cause 
 the daughter regions can not be opened forever. 
 The timeline is that 
 Assumption: there are two hfiles: a, b in Store A in Region R
 t0: A compaction request of Store A(a+b) in Region R is sent.
 t1: First Split for Region R. But this split is timeout and rollbacked. In 
 the rollback, region reinitializes all store objects , see SplitTransaction 
 #824. Now the store is Region R is A'(a+b).
 t2: Run the compaction sent in t0 . (hfile: a + b - c): A(a+b) - A(c). 
 Hfile a and b are archived.
 t3: Another Split for Region R. R splits into two region R.0, R.1, which 
 create hfile references for hfile a, b from Store A'(a + b)
 t4: For hfile a, b have been deleted, the opening for region R.0 and R.1 will 
 failed for FileNotFoundException.
 I have add a test to identity this problem.
 After search the jira, maybe HBASE-8502 is the same problem. [~goldin]



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


  1   2   >