[jira] [Updated] (HBASE-9766) HFileV3 - Optional tags write and read is not working as expected

2013-10-15 Thread Anoop Sam John (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-9766:
--

Attachment: HBASE-9766_V3.patch

Corrected the test failure. Let QA run on this. Once all tests pass I will 
commit.

> HFileV3 - Optional tags write and read is not working as expected
> -
>
> Key: HBASE-9766
> URL: https://issues.apache.org/jira/browse/HBASE-9766
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.98.0
>Reporter: Anoop Sam John
>Assignee: Anoop Sam John
> Fix For: 0.98.0
>
> Attachments: HBASE-9766.patch, HBASE-9766_V2.patch, 
> HBASE-9766_V3.patch
>
>
> In the writer V3 includesTags always comes as true only and so writing tags 
> length always even when compaction selection says not to do so.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Vladimir Rodionov (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796461#comment-13796461
 ] 

Vladimir Rodionov commented on HBASE-9769:
--

Some additional performance numbers on a patch:

Table: 1 CF + 5 CQ, value ~ 10-15 bytes. Rows = 50. All data in a block 
cache. Tested on *StoreScanner * directly.

Default:
Raw = 1.28M rows per sec
1 CQ in Scan   = 0.7M
2 CQ in Scan   = 0.5M
3 CQ in Scan   = 0.4M
4 CQ in Scan  = 0.32M
5 CQ in Scan  = 0.33M

Patch:

Raw = 1.28M rows per sec
1 CQ in Scan   = 1.27M
2 CQ in Scan   = 1.2M
3 CQ in Scan   = 1.1M
4 CQ in Scan  = 1.05M
5 CQ in Scan  = 1M



> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt, 9769-94.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9778) Avoid seeking to next column in ExplicitColumnTracker when possible

2013-10-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796460#comment-13796460
 ] 

Hadoop QA commented on HBASE-9778:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608659/9778-trunk.txt
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

 {color:red}-1 core tests{color}.  The patch failed these unit tests:
   
org.apache.hadoop.hbase.regionserver.TestExplicitColumnTracker
  org.apache.hadoop.hbase.regionserver.TestQueryMatcher

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7560//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7560//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7560//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7560//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7560//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7560//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7560//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7560//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7560//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7560//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7560//console

This message is automatically generated.

> Avoid seeking to next column in ExplicitColumnTracker when possible
> ---
>
> Key: HBASE-9778
> URL: https://issues.apache.org/jira/browse/HBASE-9778
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.98.0, 0.94.13, 0.96.1
>
> Attachments: 9778-0.94.txt, 9778-trunk.txt
>
>
> The issue of slow seeking in ExplicitColumnTracker was brought up by 
> [~vrodionov] on the dev list.
> My idea here is to avoid the seeking if we know that there aren't many 
> versions to skip.
> How do we know? We'll use the column family's VERSIONS setting as a hint. If 
> VERSIONS is set to 1 (or maybe some value < 10) we'll avoid the seek and call 
> SKIP repeatedly.
> HBASE-9769 has some initial number for this approach:
> Interestingly it depends on which column(s) is (are) selected.
> Some numbers: 4m rows, 5 cols each, 1 cf, 10 bytes values, VERSIONS=1, 
> everything filtered at the server with a ValueFilter. Everything measured in 
> seconds.
> Without patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.5|14.3|14.6|11.1|20.3|
> With patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.4|8.9|9.9|6.4|10.0|
> Variation here was +- 0.2s.
> So with this patch scanning is 2x faster than without in some cases, and 
> never slower. No special hint needed, beyond declaring VERSIONS correctly.



--
This message was sent by Atlassian JIRA
(v6.1#

[jira] [Commented] (HBASE-9778) Avoid seeking to next column in ExplicitColumnTracker when possible

2013-10-15 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796457#comment-13796457
 ] 

stack commented on HBASE-9778:
--

Nice numbers for such a little code change.

I don't know this code well.

Looking at it, doesn't SKIP mean 'do not include this KV in the result'?  So if 
maxVersion == 1, it means we already have a KV and we are saying 'do not 
include this result' for all subsequent seen columns rather than call a seek?  
What if many versions of this row.  Now we are not seeking, we could spend a 
bunch of time having to skip all values?

Am I groking this right?

> Avoid seeking to next column in ExplicitColumnTracker when possible
> ---
>
> Key: HBASE-9778
> URL: https://issues.apache.org/jira/browse/HBASE-9778
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.98.0, 0.94.13, 0.96.1
>
> Attachments: 9778-0.94.txt, 9778-trunk.txt
>
>
> The issue of slow seeking in ExplicitColumnTracker was brought up by 
> [~vrodionov] on the dev list.
> My idea here is to avoid the seeking if we know that there aren't many 
> versions to skip.
> How do we know? We'll use the column family's VERSIONS setting as a hint. If 
> VERSIONS is set to 1 (or maybe some value < 10) we'll avoid the seek and call 
> SKIP repeatedly.
> HBASE-9769 has some initial number for this approach:
> Interestingly it depends on which column(s) is (are) selected.
> Some numbers: 4m rows, 5 cols each, 1 cf, 10 bytes values, VERSIONS=1, 
> everything filtered at the server with a ValueFilter. Everything measured in 
> seconds.
> Without patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.5|14.3|14.6|11.1|20.3|
> With patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.4|8.9|9.9|6.4|10.0|
> Variation here was +- 0.2s.
> So with this patch scanning is 2x faster than without in some cases, and 
> never slower. No special hint needed, beyond declaring VERSIONS correctly.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Devaraj Das (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796454#comment-13796454
 ] 

Devaraj Das commented on HBASE-9773:


Cool. Now it seems this will also fix HBASE-9777. Would it?

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Fix For: 0.98.0, 0.96.1
>
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796455#comment-13796455
 ] 

Hudson commented on HBASE-9773:
---

SUCCESS: Integrated in hbase-0.96 #143 (See 
[https://builds.apache.org/job/hbase-0.96/143/])
HBASE-9773 Master aborted when hbck asked the master to assign a region that 
was already online (jxiang: rev 1532634)
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java
* 
/hbase/branches/0.96/hbase-server/src/test/java/org/apache/hadoop/hbase/master/TestAssignmentManagerOnCluster.java


> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Fix For: 0.98.0, 0.96.1
>
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796448#comment-13796448
 ] 

Hudson commented on HBASE-9773:
---

SUCCESS: Integrated in HBase-TRUNK #4622 (See 
[https://builds.apache.org/job/HBase-TRUNK/4622/])
HBASE-9773 Master aborted when hbck asked the master to assign a region that 
was already online (jxiang: rev 1532633)
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java
* 
/hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/master/TestAssignmentManagerOnCluster.java


> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Fix For: 0.98.0, 0.96.1
>
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code

2013-10-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796447#comment-13796447
 ] 

Hadoop QA commented on HBASE-9754:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608649/9754-rp-0.txt
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 27 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7559//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7559//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7559//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7559//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7559//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7559//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7559//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7559//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7559//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7559//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7559//console

This message is automatically generated.

> Consider eliminating threadlocals from MVCC code
> 
>
> Key: HBASE-9754
> URL: https://issues.apache.org/jira/browse/HBASE-9754
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Ted Yu
> Fix For: 0.98.0
>
> Attachments: 9754-rp-0.txt
>
>
> Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
> Currently we use ThreadLocals to communicate the current readpoint between a 
> RegionScanner and the Store\{File}Scanner's down the stack.
> Since ThreadLocals are not cheap we should consider whether it is possible to 
> pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9779) IntegrationTestLoadAndVerify fails deleting IntegrationTestLoadAndVerify table

2013-10-15 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-9779:
-

Attachment: 9779part.txt

Fix so we don't retry if table doesn't exist.

If Elliott is reading, this failed the #36, #40 and half if not all of build 
#38. Let me make it critical.

> IntegrationTestLoadAndVerify fails deleting IntegrationTestLoadAndVerify 
> table 
> ---
>
> Key: HBASE-9779
> URL: https://issues.apache.org/jira/browse/HBASE-9779
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: stack
>Assignee: stack
> Attachments: 9779part.txt
>
>
> As part of the test, we want to delete the created table to restore cluster 
> state.  Interestingly we can disable the table successfully but then 
> immediately after we fail the delete because we cannot get the table 
> descriptor -- getting the file descriptor is used to test if table is present.
> The test for getDescriptor is kinda broke because it throws base IOE which 
> causes clients to retry over and over again as though the descriptor was 
> going to come back.
> This bug is kinda ugly because in at least one case it caused our 
> long-running hbase-it suite run to fail so would be good to fix.
> Here is sample from a test run:
> {code}
> Disabling table IntegrationTestLoadAndVerify 2013-10-11 18:27:53,485 INFO  
> [main] client.HBaseAdmin: Started disable of IntegrationTestLoadAndVerify
> 2013-10-11 18:27:53,526 INFO  [main] zookeeper.ZooKeeper: Initiating client 
> connection, connectString=a1805.halxg.cloudera.com:2181 sessionTimeout=9 
> watcher=catalogtracker-on-hconnection-0x5a7e666f
> 2013-10-11 18:27:53,527 INFO  [main] zookeeper.RecoverableZooKeeper: Process 
> identifier=catalogtracker-on-hconnection-0x5a7e666f connecting to ZooKeeper 
> ensemble=a1805.halxg.cloudera.com:2181
> 2013-10-11 18:27:53,527 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: 
> Opening socket connection to server 
> a1805.halxg.cloudera.com/10.20.200.105:2181. Will not attempt to authenticate 
> using SASL (unknown error)
> 2013-10-11 18:27:53,527 DEBUG [main] catalog.CatalogTracker: Starting catalog 
> tracker org.apache.hadoop.hbase.catalog.CatalogTracker@4ace08a5
> 2013-10-11 18:27:53,529 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: Socket 
> connection established to a1805.halxg.cloudera.com/10.20.200.105:2181, 
> initiating session
> 2013-10-11 18:27:53,539 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: 
> Session establishment complete on server 
> a1805.halxg.cloudera.com/10.20.200.105:2181, sessionid = 0x1412d47f53a5c70, 
> negotiated timeout = 4
> 2013-10-11 18:27:53,602 DEBUG [main] catalog.CatalogTracker: Stopping catalog 
> tracker org.apache.hadoop.hbase.catalog.CatalogTracker@4ace08a5
> 2013-10-11 18:27:53,662 INFO  [main] zookeeper.ZooKeeper: Session: 
> 0x1412d47f53a5c70 closed
> 2013-10-11 18:27:53,662 INFO  [main-EventThread] zookeeper.ClientCnxn: 
> EventThread shut down
> .2013-10-11 18:27:54,666 INFO  [main] zookeeper.ZooKeeper: Initiating client 
> connection, connectString=a1805.halxg.cloudera.com:2181 sessionTimeout=9 
> watcher=catalogtracker-on-hconnection-0x5a7e666f
> 2013-10-11 18:27:54,667 INFO  [main] zookeeper.RecoverableZooKeeper: Process 
> identifier=catalogtracker-on-hconnection-0x5a7e666f connecting to ZooKeeper 
> ensemble=a1805.halxg.cloudera.com:2181
> 2013-10-11 18:27:54,667 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: 
> Opening socket connection to server 
> a1805.halxg.cloudera.com/10.20.200.105:2181. Will not attempt to authenticate 
> using SASL (unknown error)
> 2013-10-11 18:27:54,667 DEBUG [main] catalog.CatalogTracker: Starting catalog 
> tracker org.apache.hadoop.hbase.catalog.CatalogTracker@692c0c5d
> 2013-10-11 18:27:54,667 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: Socket 
> connection established to a1805.halxg.cloudera.com/10.20.200.105:2181, 
> initiating session
> 2013-10-11 18:27:54,696 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: 
> Session establishment complete on server 
> a1805.halxg.cloudera.com/10.20.200.105:2181, sessionid = 0x1412d47f53a5c71, 
> negotiated timeout = 4
> 2013-10-11 18:27:54,821 DEBUG [main] catalog.CatalogTracker: Stopping catalog 
> tracker org.apache.hadoop.hbase.catalog.CatalogTracker@692c0c5d
> 2013-10-11 18:27:54,871 INFO  [main] zookeeper.ZooKeeper: Session: 
> 0x1412d47f53a5c71 closed
> 2013-10-11 18:27:54,871 INFO  [main-EventThread] zookeeper.ClientCnxn: 
> EventThread shut down
> .2013-10-11 18:27:55,890 INFO  [main] zookeeper.ZooKeeper: Initiating client 
> con

[jira] [Updated] (HBASE-9779) IntegrationTestLoadAndVerify fails deleting IntegrationTestLoadAndVerify table

2013-10-15 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-9779:
-

Priority: Critical  (was: Major)

> IntegrationTestLoadAndVerify fails deleting IntegrationTestLoadAndVerify 
> table 
> ---
>
> Key: HBASE-9779
> URL: https://issues.apache.org/jira/browse/HBASE-9779
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: stack
>Assignee: stack
>Priority: Critical
> Attachments: 9779part.txt
>
>
> As part of the test, we want to delete the created table to restore cluster 
> state.  Interestingly we can disable the table successfully but then 
> immediately after we fail the delete because we cannot get the table 
> descriptor -- getting the file descriptor is used to test if table is present.
> The test for getDescriptor is kinda broke because it throws base IOE which 
> causes clients to retry over and over again as though the descriptor was 
> going to come back.
> This bug is kinda ugly because in at least one case it caused our 
> long-running hbase-it suite run to fail so would be good to fix.
> Here is sample from a test run:
> {code}
> Disabling table IntegrationTestLoadAndVerify 2013-10-11 18:27:53,485 INFO  
> [main] client.HBaseAdmin: Started disable of IntegrationTestLoadAndVerify
> 2013-10-11 18:27:53,526 INFO  [main] zookeeper.ZooKeeper: Initiating client 
> connection, connectString=a1805.halxg.cloudera.com:2181 sessionTimeout=9 
> watcher=catalogtracker-on-hconnection-0x5a7e666f
> 2013-10-11 18:27:53,527 INFO  [main] zookeeper.RecoverableZooKeeper: Process 
> identifier=catalogtracker-on-hconnection-0x5a7e666f connecting to ZooKeeper 
> ensemble=a1805.halxg.cloudera.com:2181
> 2013-10-11 18:27:53,527 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: 
> Opening socket connection to server 
> a1805.halxg.cloudera.com/10.20.200.105:2181. Will not attempt to authenticate 
> using SASL (unknown error)
> 2013-10-11 18:27:53,527 DEBUG [main] catalog.CatalogTracker: Starting catalog 
> tracker org.apache.hadoop.hbase.catalog.CatalogTracker@4ace08a5
> 2013-10-11 18:27:53,529 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: Socket 
> connection established to a1805.halxg.cloudera.com/10.20.200.105:2181, 
> initiating session
> 2013-10-11 18:27:53,539 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: 
> Session establishment complete on server 
> a1805.halxg.cloudera.com/10.20.200.105:2181, sessionid = 0x1412d47f53a5c70, 
> negotiated timeout = 4
> 2013-10-11 18:27:53,602 DEBUG [main] catalog.CatalogTracker: Stopping catalog 
> tracker org.apache.hadoop.hbase.catalog.CatalogTracker@4ace08a5
> 2013-10-11 18:27:53,662 INFO  [main] zookeeper.ZooKeeper: Session: 
> 0x1412d47f53a5c70 closed
> 2013-10-11 18:27:53,662 INFO  [main-EventThread] zookeeper.ClientCnxn: 
> EventThread shut down
> .2013-10-11 18:27:54,666 INFO  [main] zookeeper.ZooKeeper: Initiating client 
> connection, connectString=a1805.halxg.cloudera.com:2181 sessionTimeout=9 
> watcher=catalogtracker-on-hconnection-0x5a7e666f
> 2013-10-11 18:27:54,667 INFO  [main] zookeeper.RecoverableZooKeeper: Process 
> identifier=catalogtracker-on-hconnection-0x5a7e666f connecting to ZooKeeper 
> ensemble=a1805.halxg.cloudera.com:2181
> 2013-10-11 18:27:54,667 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: 
> Opening socket connection to server 
> a1805.halxg.cloudera.com/10.20.200.105:2181. Will not attempt to authenticate 
> using SASL (unknown error)
> 2013-10-11 18:27:54,667 DEBUG [main] catalog.CatalogTracker: Starting catalog 
> tracker org.apache.hadoop.hbase.catalog.CatalogTracker@692c0c5d
> 2013-10-11 18:27:54,667 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: Socket 
> connection established to a1805.halxg.cloudera.com/10.20.200.105:2181, 
> initiating session
> 2013-10-11 18:27:54,696 INFO  
> [main-SendThread(a1805.halxg.cloudera.com:2181)] zookeeper.ClientCnxn: 
> Session establishment complete on server 
> a1805.halxg.cloudera.com/10.20.200.105:2181, sessionid = 0x1412d47f53a5c71, 
> negotiated timeout = 4
> 2013-10-11 18:27:54,821 DEBUG [main] catalog.CatalogTracker: Stopping catalog 
> tracker org.apache.hadoop.hbase.catalog.CatalogTracker@692c0c5d
> 2013-10-11 18:27:54,871 INFO  [main] zookeeper.ZooKeeper: Session: 
> 0x1412d47f53a5c71 closed
> 2013-10-11 18:27:54,871 INFO  [main-EventThread] zookeeper.ClientCnxn: 
> EventThread shut down
> .2013-10-11 18:27:55,890 INFO  [main] zookeeper.ZooKeeper: Initiating client 
> connection, connectString=a1805.halxg.cloudera.com:2181 sessionTimeout=9 
> watcher=catalogtracker-on-hconnection-0

[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code

2013-10-15 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796445#comment-13796445
 ] 

Lars Hofhansl commented on HBASE-9754:
--

I think in the *PolicyObserver.preStoreScannerOpen, we want the current MVCC 
readpoint, rather than the store smallest readpoint.
I am also a bit concerned about passing 0 as readPt in some places, as that 
makes everything visible. And for example it avoids setting the memstoreTs of 
KV to 0 even when there are no old scanners open that could see that KV.
(In fact these were the consideration that made me think a correct patch for 
this is hard)

> Consider eliminating threadlocals from MVCC code
> 
>
> Key: HBASE-9754
> URL: https://issues.apache.org/jira/browse/HBASE-9754
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Ted Yu
> Fix For: 0.98.0
>
> Attachments: 9754-rp-0.txt
>
>
> Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
> Currently we use ThreadLocals to communicate the current readpoint between a 
> RegionScanner and the Store\{File}Scanner's down the stack.
> Since ThreadLocals are not cheap we should consider whether it is possible to 
> pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Created] (HBASE-9779) IntegrationTestLoadAndVerify fails deleting IntegrationTestLoadAndVerify table

2013-10-15 Thread stack (JIRA)
stack created HBASE-9779:


 Summary: IntegrationTestLoadAndVerify fails deleting 
IntegrationTestLoadAndVerify table 
 Key: HBASE-9779
 URL: https://issues.apache.org/jira/browse/HBASE-9779
 Project: HBase
  Issue Type: Bug
  Components: test
Affects Versions: 0.96.0
Reporter: stack
Assignee: stack


As part of the test, we want to delete the created table to restore cluster 
state.  Interestingly we can disable the table successfully but then 
immediately after we fail the delete because we cannot get the table descriptor 
-- getting the file descriptor is used to test if table is present.

The test for getDescriptor is kinda broke because it throws base IOE which 
causes clients to retry over and over again as though the descriptor was going 
to come back.

This bug is kinda ugly because in at least one case it caused our long-running 
hbase-it suite run to fail so would be good to fix.

Here is sample from a test run:

{code}
Disabling table IntegrationTestLoadAndVerify 2013-10-11 18:27:53,485 INFO  
[main] client.HBaseAdmin: Started disable of IntegrationTestLoadAndVerify
2013-10-11 18:27:53,526 INFO  [main] zookeeper.ZooKeeper: Initiating client 
connection, connectString=a1805.halxg.cloudera.com:2181 sessionTimeout=9 
watcher=catalogtracker-on-hconnection-0x5a7e666f
2013-10-11 18:27:53,527 INFO  [main] zookeeper.RecoverableZooKeeper: Process 
identifier=catalogtracker-on-hconnection-0x5a7e666f connecting to ZooKeeper 
ensemble=a1805.halxg.cloudera.com:2181
2013-10-11 18:27:53,527 INFO  [main-SendThread(a1805.halxg.cloudera.com:2181)] 
zookeeper.ClientCnxn: Opening socket connection to server 
a1805.halxg.cloudera.com/10.20.200.105:2181. Will not attempt to authenticate 
using SASL (unknown error)
2013-10-11 18:27:53,527 DEBUG [main] catalog.CatalogTracker: Starting catalog 
tracker org.apache.hadoop.hbase.catalog.CatalogTracker@4ace08a5
2013-10-11 18:27:53,529 INFO  [main-SendThread(a1805.halxg.cloudera.com:2181)] 
zookeeper.ClientCnxn: Socket connection established to 
a1805.halxg.cloudera.com/10.20.200.105:2181, initiating session
2013-10-11 18:27:53,539 INFO  [main-SendThread(a1805.halxg.cloudera.com:2181)] 
zookeeper.ClientCnxn: Session establishment complete on server 
a1805.halxg.cloudera.com/10.20.200.105:2181, sessionid = 0x1412d47f53a5c70, 
negotiated timeout = 4
2013-10-11 18:27:53,602 DEBUG [main] catalog.CatalogTracker: Stopping catalog 
tracker org.apache.hadoop.hbase.catalog.CatalogTracker@4ace08a5
2013-10-11 18:27:53,662 INFO  [main] zookeeper.ZooKeeper: Session: 
0x1412d47f53a5c70 closed
2013-10-11 18:27:53,662 INFO  [main-EventThread] zookeeper.ClientCnxn: 
EventThread shut down
.2013-10-11 18:27:54,666 INFO  [main] zookeeper.ZooKeeper: Initiating client 
connection, connectString=a1805.halxg.cloudera.com:2181 sessionTimeout=9 
watcher=catalogtracker-on-hconnection-0x5a7e666f
2013-10-11 18:27:54,667 INFO  [main] zookeeper.RecoverableZooKeeper: Process 
identifier=catalogtracker-on-hconnection-0x5a7e666f connecting to ZooKeeper 
ensemble=a1805.halxg.cloudera.com:2181
2013-10-11 18:27:54,667 INFO  [main-SendThread(a1805.halxg.cloudera.com:2181)] 
zookeeper.ClientCnxn: Opening socket connection to server 
a1805.halxg.cloudera.com/10.20.200.105:2181. Will not attempt to authenticate 
using SASL (unknown error)
2013-10-11 18:27:54,667 DEBUG [main] catalog.CatalogTracker: Starting catalog 
tracker org.apache.hadoop.hbase.catalog.CatalogTracker@692c0c5d
2013-10-11 18:27:54,667 INFO  [main-SendThread(a1805.halxg.cloudera.com:2181)] 
zookeeper.ClientCnxn: Socket connection established to 
a1805.halxg.cloudera.com/10.20.200.105:2181, initiating session
2013-10-11 18:27:54,696 INFO  [main-SendThread(a1805.halxg.cloudera.com:2181)] 
zookeeper.ClientCnxn: Session establishment complete on server 
a1805.halxg.cloudera.com/10.20.200.105:2181, sessionid = 0x1412d47f53a5c71, 
negotiated timeout = 4
2013-10-11 18:27:54,821 DEBUG [main] catalog.CatalogTracker: Stopping catalog 
tracker org.apache.hadoop.hbase.catalog.CatalogTracker@692c0c5d
2013-10-11 18:27:54,871 INFO  [main] zookeeper.ZooKeeper: Session: 
0x1412d47f53a5c71 closed
2013-10-11 18:27:54,871 INFO  [main-EventThread] zookeeper.ClientCnxn: 
EventThread shut down
.2013-10-11 18:27:55,890 INFO  [main] zookeeper.ZooKeeper: Initiating client 
connection, connectString=a1805.halxg.cloudera.com:2181 sessionTimeout=9 
watcher=catalogtracker-on-hconnection-0x5a7e666f
2013-10-11 18:27:55,891 INFO  [main] zookeeper.RecoverableZooKeeper: Process 
identifier=catalogtracker-on-hconnection-0x5a7e666f connecting to ZooKeeper 
ensemble=a1805.halxg.cloudera.com:2181
2013-10-11 18:27:55,891 INFO  [main-SendThread(a1805.halxg.cloudera.com:2181)] 
zookeeper.ClientCnxn: Opening socket connection to server 
a1805.halxg.cloudera.com/10.20.200.105:2181. Will not attempt to authenticate 
using SASL (unknown error)
2013

[jira] [Updated] (HBASE-9778) Avoid seeking to next column in ExplicitColumnTracker when possible

2013-10-15 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-9778:
-

Description: 
The issue of slow seeking in ExplicitColumnTracker was brought up by 
[~vrodionov] on the dev list.

My idea here is to avoid the seeking if we know that there aren't many versions 
to skip.
How do we know? We'll use the column family's VERSIONS setting as a hint. If 
VERSIONS is set to 1 (or maybe some value < 10) we'll avoid the seek and call 
SKIP repeatedly.

HBASE-9769 has some initial number for this approach:
Interestingly it depends on which column(s) is (are) selected.

Some numbers: 4m rows, 5 cols each, 1 cf, 10 bytes values, VERSIONS=1, 
everything filtered at the server with a ValueFilter. Everything measured in 
seconds.

Without patch:
||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
|6.4|8.5|14.3|14.6|11.1|20.3|

With patch:
||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
|6.4|8.4|8.9|9.9|6.4|10.0|

Variation here was +- 0.2s.

So with this patch scanning is 2x faster than without in some cases, and never 
slower. No special hint needed, beyond declaring VERSIONS correctly.


  was:
The issue of slow seeking in ExplicitColumnTracker was brought up by 
[~vrodionov] on the dev list.
My idea here is to avoid the seeking if we know that there aren't many rows to 
skip.
How do we know? We'll use the column family's VERSIONS setting as a hint. If 
VERSIONS is set to 1 (or maybe some value < 10) we'll avoid the seek and call 
SKIP repeatedly.

HBASE-9769 has some initial number for this approach:
Interestingly it depends on which column(s) is (are) selected.

Some numbers: 4m rows, 5 cols each, 1 cf, 10 bytes values, VERSIONS=1, 
everything filtered at the server with a ValueFilter. Everything measured in 
seconds.

Without patch:
||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
|6.4|8.5|14.3|14.6|11.1|20.3|

With patch:
||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
|6.4|8.4|8.9|9.9|6.4|10.0|

Variation here was +- 0.2s.

So with this patch scanning is 2x faster than without in some cases, and never 
slower. No special hint needed, beyond declaring VERSIONS correctly.



> Avoid seeking to next column in ExplicitColumnTracker when possible
> ---
>
> Key: HBASE-9778
> URL: https://issues.apache.org/jira/browse/HBASE-9778
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.98.0, 0.94.13, 0.96.1
>
> Attachments: 9778-0.94.txt, 9778-trunk.txt
>
>
> The issue of slow seeking in ExplicitColumnTracker was brought up by 
> [~vrodionov] on the dev list.
> My idea here is to avoid the seeking if we know that there aren't many 
> versions to skip.
> How do we know? We'll use the column family's VERSIONS setting as a hint. If 
> VERSIONS is set to 1 (or maybe some value < 10) we'll avoid the seek and call 
> SKIP repeatedly.
> HBASE-9769 has some initial number for this approach:
> Interestingly it depends on which column(s) is (are) selected.
> Some numbers: 4m rows, 5 cols each, 1 cf, 10 bytes values, VERSIONS=1, 
> everything filtered at the server with a ValueFilter. Everything measured in 
> seconds.
> Without patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.5|14.3|14.6|11.1|20.3|
> With patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.4|8.9|9.9|6.4|10.0|
> Variation here was +- 0.2s.
> So with this patch scanning is 2x faster than without in some cases, and 
> never slower. No special hint needed, beyond declaring VERSIONS correctly.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796436#comment-13796436
 ] 

Lars Hofhansl commented on HBASE-9769:
--

Nit: Use the Eclipse formatter from HBASE-5961. We use 2 spaces instead of a 
tab for indentation.


> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt, 9769-94.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796435#comment-13796435
 ] 

Lars Hofhansl commented on HBASE-9769:
--

I created HBASE-9778 for my patch idea.


> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt, 9769-94.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796434#comment-13796434
 ] 

Lars Hofhansl commented on HBASE-9769:
--

MAX_VERSIONS=1 (or a low number) can only be used to eliminate the NEXT_COL 
seek (as that is use to seek past versions of the same column). It does not 
indicate anything about the number of columns in a row, and hence we know 
nothing about whether SEEK_NEXT_ROW or a series of SKIPs is better.

We need both, I think.

(MAX_VERSIONS is a hint in the sense that there temporarily can be more 
versions in the memstore and/or distributed over various HFiles, only after a 
major compaction will the number of versions actually be <= MAX_VERSIONS.)


> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt, 9769-94.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Vladimir Rodionov (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796433#comment-13796433
 ] 

Vladimir Rodionov commented on HBASE-9769:
--

Lars, our patches are independent. I think they need to be merged into one, or 
you better create new JIRA for *do-not-seek-next-col* thing.

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt, 9769-94.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Vladimir Rodionov (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796431#comment-13796431
 ] 

Vladimir Rodionov commented on HBASE-9769:
--

To activate this feature (hint):
{code}
Scan scan = ...
scan.setAttribute(Scan.SCAN_NARROW_ROWS, "true".getBytes());
{code}

OK. I think I will replace SCAN_NARROW_ROWS with HINT_NARROW_ROWS.

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt, 9769-94.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Vladimir Rodionov (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796430#comment-13796430
 ] 

Vladimir Rodionov commented on HBASE-9769:
--

It contains check MAX_VERSIONS = 1 suggested by Lars (not sure if it is really 
a hint?). Lars version gives improvements as well, but it relies on default 
hint of MAX_VERSIONS and is slower I think. I completely eliminated 
ExplicitColumnTracker from the code path. The more columns in a scan the more 
is going to be performance difference, I think (again). 

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt, 9769-94.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Vladimir Rodionov (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796426#comment-13796426
 ] 

Vladimir Rodionov commented on HBASE-9769:
--

It contains check MAX_VERSIONS = 1 suggested by Lars (not sure if it is really 
a hint?).

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt, 9769-94.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9778) Avoid seeking to next column in ExplicitColumnTracker when possible

2013-10-15 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-9778:
-

Attachment: 9778-trunk.txt

And a trunk version for HadoopQA

> Avoid seeking to next column in ExplicitColumnTracker when possible
> ---
>
> Key: HBASE-9778
> URL: https://issues.apache.org/jira/browse/HBASE-9778
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.98.0, 0.94.13, 0.96.1
>
> Attachments: 9778-0.94.txt, 9778-trunk.txt
>
>
> The issue of slow seeking in ExplicitColumnTracker was brought up by 
> [~vrodionov] on the dev list.
> My idea here is to avoid the seeking if we know that there aren't many rows 
> to skip.
> How do we know? We'll use the column family's VERSIONS setting as a hint. If 
> VERSIONS is set to 1 (or maybe some value < 10) we'll avoid the seek and call 
> SKIP repeatedly.
> HBASE-9769 has some initial number for this approach:
> Interestingly it depends on which column(s) is (are) selected.
> Some numbers: 4m rows, 5 cols each, 1 cf, 10 bytes values, VERSIONS=1, 
> everything filtered at the server with a ValueFilter. Everything measured in 
> seconds.
> Without patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.5|14.3|14.6|11.1|20.3|
> With patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.4|8.9|9.9|6.4|10.0|
> Variation here was +- 0.2s.
> So with this patch scanning is 2x faster than without in some cases, and 
> never slower. No special hint needed, beyond declaring VERSIONS correctly.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9778) Avoid seeking to next column in ExplicitColumnTracker when possible

2013-10-15 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-9778:
-

Attachment: 9778-0.94.txt

0.94 patch.

> Avoid seeking to next column in ExplicitColumnTracker when possible
> ---
>
> Key: HBASE-9778
> URL: https://issues.apache.org/jira/browse/HBASE-9778
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.98.0, 0.94.13, 0.96.1
>
> Attachments: 9778-0.94.txt, 9778-trunk.txt
>
>
> The issue of slow seeking in ExplicitColumnTracker was brought up by 
> [~vrodionov] on the dev list.
> My idea here is to avoid the seeking if we know that there aren't many rows 
> to skip.
> How do we know? We'll use the column family's VERSIONS setting as a hint. If 
> VERSIONS is set to 1 (or maybe some value < 10) we'll avoid the seek and call 
> SKIP repeatedly.
> HBASE-9769 has some initial number for this approach:
> Interestingly it depends on which column(s) is (are) selected.
> Some numbers: 4m rows, 5 cols each, 1 cf, 10 bytes values, VERSIONS=1, 
> everything filtered at the server with a ValueFilter. Everything measured in 
> seconds.
> Without patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.5|14.3|14.6|11.1|20.3|
> With patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.4|8.9|9.9|6.4|10.0|
> Variation here was +- 0.2s.
> So with this patch scanning is 2x faster than without in some cases, and 
> never slower. No special hint needed, beyond declaring VERSIONS correctly.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9778) Avoid seeking to next column in ExplicitColumnTracker when possible

2013-10-15 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-9778:
-

Status: Patch Available  (was: Open)

> Avoid seeking to next column in ExplicitColumnTracker when possible
> ---
>
> Key: HBASE-9778
> URL: https://issues.apache.org/jira/browse/HBASE-9778
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.98.0, 0.94.13, 0.96.1
>
> Attachments: 9778-0.94.txt, 9778-trunk.txt
>
>
> The issue of slow seeking in ExplicitColumnTracker was brought up by 
> [~vrodionov] on the dev list.
> My idea here is to avoid the seeking if we know that there aren't many rows 
> to skip.
> How do we know? We'll use the column family's VERSIONS setting as a hint. If 
> VERSIONS is set to 1 (or maybe some value < 10) we'll avoid the seek and call 
> SKIP repeatedly.
> HBASE-9769 has some initial number for this approach:
> Interestingly it depends on which column(s) is (are) selected.
> Some numbers: 4m rows, 5 cols each, 1 cf, 10 bytes values, VERSIONS=1, 
> everything filtered at the server with a ValueFilter. Everything measured in 
> seconds.
> Without patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.5|14.3|14.6|11.1|20.3|
> With patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.4|8.9|9.9|6.4|10.0|
> Variation here was +- 0.2s.
> So with this patch scanning is 2x faster than without in some cases, and 
> never slower. No special hint needed, beyond declaring VERSIONS correctly.



--
This message was sent by Atlassian JIRA
(v6.1#6144)



[jira] [Commented] (HBASE-9778) Avoid seeking to next column in ExplicitColumnTracker when possible

2013-10-15 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796423#comment-13796423
 ] 

Lars Hofhansl commented on HBASE-9778:
--

Note that this avoid the NEXT_COL but not the NEXT_ROW seeks. Here we have 
nothing to go by to know whether a row is going to be large (many columns) or 
not, and hence we'd need to hint the scan explicitly somehow and/or use a 
filter as Vladimir suggests in HBASE-9769.

> Avoid seeking to next column in ExplicitColumnTracker when possible
> ---
>
> Key: HBASE-9778
> URL: https://issues.apache.org/jira/browse/HBASE-9778
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.98.0, 0.94.13, 0.96.1
>
>
> The issue of slow seeking in ExplicitColumnTracker was brought up by 
> [~vrodionov] on the dev list.
> My idea here is to avoid the seeking if we know that there aren't many rows 
> to skip.
> How do we know? We'll use the column family's VERSIONS setting as a hint. If 
> VERSIONS is set to 1 (or maybe some value < 10) we'll avoid the seek and call 
> SKIP repeatedly.
> HBASE-9769 has some initial number for this approach:
> Interestingly it depends on which column(s) is (are) selected.
> Some numbers: 4m rows, 5 cols each, 1 cf, 10 bytes values, VERSIONS=1, 
> everything filtered at the server with a ValueFilter. Everything measured in 
> seconds.
> Without patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.5|14.3|14.6|11.1|20.3|
> With patch:
> ||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
> |6.4|8.4|8.9|9.9|6.4|10.0|
> Variation here was +- 0.2s.
> So with this patch scanning is 2x faster than without in some cases, and 
> never slower. No special hint needed, beyond declaring VERSIONS correctly.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Vladimir Rodionov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vladimir Rodionov updated HBASE-9769:
-

Attachment: 9769-94.txt

Initial version of a patch

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt, 9769-94.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Created] (HBASE-9778) Avoid seeking to next column in ExplicitColumnTracker when possible

2013-10-15 Thread Lars Hofhansl (JIRA)
Lars Hofhansl created HBASE-9778:


 Summary: Avoid seeking to next column in ExplicitColumnTracker 
when possible
 Key: HBASE-9778
 URL: https://issues.apache.org/jira/browse/HBASE-9778
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.98.0, 0.94.13, 0.96.1


The issue of slow seeking in ExplicitColumnTracker was brought up by 
[~vrodionov] on the dev list.
My idea here is to avoid the seeking if we know that there aren't many rows to 
skip.
How do we know? We'll use the column family's VERSIONS setting as a hint. If 
VERSIONS is set to 1 (or maybe some value < 10) we'll avoid the seek and call 
SKIP repeatedly.

HBASE-9769 has some initial number for this approach:
Interestingly it depends on which column(s) is (are) selected.

Some numbers: 4m rows, 5 cols each, 1 cf, 10 bytes values, VERSIONS=1, 
everything filtered at the server with a ValueFilter. Everything measured in 
seconds.

Without patch:
||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
|6.4|8.5|14.3|14.6|11.1|20.3|

With patch:
||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
|6.4|8.4|8.9|9.9|6.4|10.0|

Variation here was +- 0.2s.

So with this patch scanning is 2x faster than without in some cases, and never 
slower. No special hint needed, beyond declaring VERSIONS correctly.




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796417#comment-13796417
 ] 

Lars Hofhansl commented on HBASE-9769:
--

bq. Lars, HTable can have small number of versions and large number of column 
qualifiers or large values (say 100K). 

That is true. Seeking to the next column is not a good idea, though, if we know 
there are not going to be many versions to skip. So the suggested patch here 
will not be slower than before, and it will improve performance in many cases.

As the size of a KV approaches the HFile blocksize (64k by default), SKIP and 
SEEK_NEXT_COL should become equivalent in performance (in both cases we'll need 
to find the KV in the next block).

As I said, this does not eliminate the NEXT_ROW seeking.

I fear the filter approach will lead to issues when there are already filters 
configured on the scan. You'd have to convert this to a FilterList while 
keeping all the semantics and performance characteristics.
I think it might be best to ship your Filter and document its use.

I'll file a separate issue for my patch.


> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9776) Test Load And Verify Fails with TableNotEnabledException

2013-10-15 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796414#comment-13796414
 ] 

stack commented on HBASE-9776:
--

Thanks [~jeffreyz]

Looking at the delete table code, it is doing a getDescription to test 
existence.  Rather than return a TableNotFoundException which is a DoNotRetry 
exception, the code is throwing an IOE which gets retried 35 times in a matter 
of seconds.  Could do with some improvement.  Let me look at it.

> Test Load And Verify Fails with TableNotEnabledException
> 
>
> Key: HBASE-9776
> URL: https://issues.apache.org/jira/browse/HBASE-9776
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: Jeffrey Zhong
>Assignee: Jeffrey Zhong
>Priority: Minor
> Attachments: hbase-9776.patch
>
>
> Occasionally IntegrationTestLoadAndVerify failed with the following error. 
> This is caused by RPC retry and the first attempt actually went through 
> successfully and the second retry attempt fails because the table is disabled 
> by the first attempt.
> {code}
> 2013-10-10 
> 19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
>  org.apache.hadoop.hbase.TableNotEnabledException: 
> IntegrationTestLoadAndVerify
> 2013-10-10 19:55:54,340|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
> 2013-10-10 19:55:54,341|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
> 2013-10-10 19:55:54,342|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796411#comment-13796411
 ] 

Lars Hofhansl commented on HBASE-9769:
--

Note that Vladimir's small row hint still can be used to eliminate the NEXT_ROW 
seek. Maybe, again, it is prudent to split this in two issues.


> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-9769:
-

Attachment: 9769-0.94-sample2.txt

Even simpler. Same perf numbers.
(The previous patch was actually not correct when there are actually multiple 
versions).


> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample2.txt, 
> 9769-0.94-sample.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796408#comment-13796408
 ] 

Lars Hofhansl commented on HBASE-9769:
--

Interestingly it depends on which column(s) is (are) selected.

Some numbers: 4m rows, 5 cols each, 1 cf, 10 bytes values, VERSIONS=1. 
Everything measured in seconds.

Without patch:
||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
|6.4|8.5|14.3|14.6|11.1|20.3|

With patch sample1:
||Wildcard||Col 1||Col 2||Col 4||Col 5||Col 2+4||
|6.4|8.4|8.9|9.9|6.4|10.0|

Variation here was +- 0.2s.

So with this patch scanning is 2x faster than without in some cases, and never 
slower. No special hint needed, beyond declaring VERSIONS correctly.


> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9689) Support using OperationAttributes through shell script

2013-10-15 Thread ramkrishna.s.vasudevan (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796400#comment-13796400
 ] 

ramkrishna.s.vasudevan commented on HBASE-9689:
---

[~apurtell]
bq.There is no way to have a backwards compatible put command that does not 
take an attributes hash, just a timestamp
You can do that.  
{code}
put 't1', 'r1', 'c1', 'value', {TIMESTAMP=>100}
{code}
What i meant was Timstamp cannot be specified as per the older way
{code}
put 't1','r1','value',ts
{code}
The problem is put accepts a set of parameters.  So if we avoid the ts 
parameter and specify the ATTRIBUTE instead the timestamp is taken as 
'mykeymyvalue'.

> Support using OperationAttributes through shell script
> --
>
> Key: HBASE-9689
> URL: https://issues.apache.org/jira/browse/HBASE-9689
> Project: HBase
>  Issue Type: Improvement
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.98.0
>
> Attachments: HBASE-9689_1.patch, HBASE-9689_2.patch, HBASE-9689.patch
>
>
> Currently the ruby scripts for Put does not support setting of Operation 
> Attributes through shell.  
> It may be useful in some cases and also for testing.  And that would give a 
> full  fledged support using shell.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9754) Consider eliminating threadlocals from MVCC code

2013-10-15 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-9754:
--

Attachment: 9754-rp-0.txt

Rebased patch.

> Consider eliminating threadlocals from MVCC code
> 
>
> Key: HBASE-9754
> URL: https://issues.apache.org/jira/browse/HBASE-9754
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Ted Yu
> Fix For: 0.98.0
>
> Attachments: 9754-rp-0.txt
>
>
> Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
> Currently we use ThreadLocals to communicate the current readpoint between a 
> RegionScanner and the Store\{File}Scanner's down the stack.
> Since ThreadLocals are not cheap we should consider whether it is possible to 
> pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9754) Consider eliminating threadlocals from MVCC code

2013-10-15 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-9754:
--

Attachment: (was: 9754-rp-0.txt)

> Consider eliminating threadlocals from MVCC code
> 
>
> Key: HBASE-9754
> URL: https://issues.apache.org/jira/browse/HBASE-9754
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Ted Yu
> Fix For: 0.98.0
>
> Attachments: 9754-rp-0.txt
>
>
> Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
> Currently we use ThreadLocals to communicate the current readpoint between a 
> RegionScanner and the Store\{File}Scanner's down the stack.
> Since ThreadLocals are not cheap we should consider whether it is possible to 
> pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-5487) Generic framework for Master-coordinated tasks

2013-10-15 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796384#comment-13796384
 ] 

stack commented on HBASE-5487:
--

@enis List makes for pretty good set of requirements.  We used to talk 100k 
regions but folks are long past that now so we are behind the curve (Flurry are 
>250k IIRC) and we may want to tend away from a few large regions and more 
toward many small regions if we can get AM to perform (advantages: smaller 
compression runs, easier to free up WALs, etc)

> Generic framework for Master-coordinated tasks
> --
>
> Key: HBASE-5487
> URL: https://issues.apache.org/jira/browse/HBASE-5487
> Project: HBase
>  Issue Type: New Feature
>  Components: master, regionserver, Zookeeper
>Affects Versions: 0.94.0
>Reporter: Mubarak Seyed
>Priority: Critical
> Attachments: Region management in Master5.docx, Region management in 
> Master.pdf
>
>
> Need a framework to execute master-coordinated tasks in a fault-tolerant 
> manner. 
> Master-coordinated tasks such as online-scheme change and delete-range 
> (deleting region(s) based on start/end key) can make use of this framework.
> The advantages of framework are
> 1. Eliminate repeated code in Master, ZooKeeper tracker and Region-server for 
> master-coordinated tasks
> 2. Ability to abstract the common functions across Master -> ZK and RS -> ZK
> 3. Easy to plugin new master-coordinated tasks without adding code to core 
> components



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9771) [WebUI] Humanize store and blockcache statistics on RS

2013-10-15 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796380#comment-13796380
 ] 

Hudson commented on HBASE-9771:
---

SUCCESS: Integrated in hbase-0.96 #142 (See 
[https://builds.apache.org/job/hbase-0.96/142/])
HBASE-9771 [WebUI] Humanize store and blockcache statistics on RS (ndimiduk: 
rev 1532599)
* 
/hbase/branches/0.96/hbase-hadoop-compat/src/main/java/org/apache/hadoop/hbase/regionserver/MetricsRegionServerWrapper.java
* 
/hbase/branches/0.96/hbase-server/src/main/jamon/org/apache/hadoop/hbase/tmpl/regionserver/ServerMetricsTmpl.jamon


> [WebUI] Humanize store and blockcache statistics on RS
> --
>
> Key: HBASE-9771
> URL: https://issues.apache.org/jira/browse/HBASE-9771
> Project: HBase
>  Issue Type: Improvement
>  Components: UI
>Affects Versions: 0.98.0, 0.96.0
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Trivial
> Fix For: 0.98.0, 0.96.1
>
> Attachments: HBASE-9771.00.patch, HBASE-9771.01.patch, 
> HBASE-9771.02.patch
>
>
> Some statistics reported on webui don't include hints about what the unit is, 
> leaving people guessing about what they mean. Clean them up.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-5487) Generic framework for Master-coordinated tasks

2013-10-15 Thread Sergey Shelukhin (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sergey Shelukhin updated HBASE-5487:


Attachment: Region management in Master5.docx

attaching preliminary version... still need to iron out all operation details, 
where step-by-step operation starts many things may be wrong, but it should be 
good until then.

Sorry Mac craps out when printing this as PDF due to mix of page orientations, 
let me try to find some better format for final version...

> Generic framework for Master-coordinated tasks
> --
>
> Key: HBASE-5487
> URL: https://issues.apache.org/jira/browse/HBASE-5487
> Project: HBase
>  Issue Type: New Feature
>  Components: master, regionserver, Zookeeper
>Affects Versions: 0.94.0
>Reporter: Mubarak Seyed
>Priority: Critical
> Attachments: Region management in Master5.docx, Region management in 
> Master.pdf
>
>
> Need a framework to execute master-coordinated tasks in a fault-tolerant 
> manner. 
> Master-coordinated tasks such as online-scheme change and delete-range 
> (deleting region(s) based on start/end key) can make use of this framework.
> The advantages of framework are
> 1. Eliminate repeated code in Master, ZooKeeper tracker and Region-server for 
> master-coordinated tasks
> 2. Ability to abstract the common functions across Master -> ZK and RS -> ZK
> 3. Easy to plugin new master-coordinated tasks without adding code to core 
> components



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796365#comment-13796365
 ] 

stack commented on HBASE-9773:
--

[~jxiang] makes sense.  I like the test.

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Fix For: 0.98.0, 0.96.1
>
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-5487) Generic framework for Master-coordinated tasks

2013-10-15 Thread Enis Soztutar (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796362#comment-13796362
 ] 

Enis Soztutar commented on HBASE-5487:
--

I also started a document some time ago, but never got to finish it to the 
level of details I would like. However, I think we can agree on the design 
goals section which I augmented from the discussion so far:

- Robust implementation
- Compressive test coverage by mocking server and region assignment states 
(unit testable without MiniCluster and CM stuff)
- Bulk region operations
- Region operations should be isolated from server operations (AM vs SSH, log 
splitting), and table operations (disabling / disabled table, schema changes, 
etc) and cluster shutdown. AM and SSH should NEVER know about table state 
(disable/disabling). Server liveness checks can only be done as an optimization 
(servers can fail after the check is done)
- There should be one source of truth
- Should be compatible with master failover, and concurrent region 
operations(split, RS failover, balancer, etc)
- AM should guarantee that a region can be hosted by a single region server at 
any given time
- AM should be understandable by simple human beings like myself
- Actions for AM should be logged (possibly separately). We would like to be 
able to construct the history for the regions from logs or some persisted 
state. 
- Assignment should be performant and parallelizable. We should target handling 
millions of regions and thousands of servers. A single region assignment should 
complete under 1 sec. (1PB data with 1 GB regions  = 1M regions)
- No master abort when a region’s state cannot be determined. This results in 
support cases where master cannot start, and without master things become even 
worse. We should “quarantine” the regions if needed absolutely.  


> Generic framework for Master-coordinated tasks
> --
>
> Key: HBASE-5487
> URL: https://issues.apache.org/jira/browse/HBASE-5487
> Project: HBase
>  Issue Type: New Feature
>  Components: master, regionserver, Zookeeper
>Affects Versions: 0.94.0
>Reporter: Mubarak Seyed
>Priority: Critical
> Attachments: Region management in Master.pdf
>
>
> Need a framework to execute master-coordinated tasks in a fault-tolerant 
> manner. 
> Master-coordinated tasks such as online-scheme change and delete-range 
> (deleting region(s) based on start/end key) can make use of this framework.
> The advantages of framework are
> 1. Eliminate repeated code in Master, ZooKeeper tracker and Region-server for 
> master-coordinated tasks
> 2. Ability to abstract the common functions across Master -> ZK and RS -> ZK
> 3. Easy to plugin new master-coordinated tasks without adding code to core 
> components



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code

2013-10-15 Thread chunhui shen (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796357#comment-13796357
 ] 

chunhui shen commented on HBASE-9754:
-

Patch looks good.
Agree with eliminating the threadlocals

> Consider eliminating threadlocals from MVCC code
> 
>
> Key: HBASE-9754
> URL: https://issues.apache.org/jira/browse/HBASE-9754
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Ted Yu
> Fix For: 0.98.0
>
> Attachments: 9754-rp-0.txt
>
>
> Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
> Currently we use ThreadLocals to communicate the current readpoint between a 
> RegionScanner and the Store\{File}Scanner's down the stack.
> Since ThreadLocals are not cheap we should consider whether it is possible to 
> pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9773:
---

   Resolution: Fixed
Fix Version/s: 0.96.1
   0.98.0
 Hadoop Flags: Reviewed
   Status: Resolved  (was: Patch Available)

Integrated into trunk and 0.96.  Thanks Devaraj for finding the issue. Thanks 
Stack, Ted for reviewing the patch.

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Fix For: 0.98.0, 0.96.1
>
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-9769:
-

Attachment: 9769-0.94-sample1.txt

With VERSIONS=1 only. I think I like this better. User declares to only be 
interested in 1 version, in that case we use SKIP and INCLUDE and not column 
seeking.

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample1.txt, 9769-0.94-sample.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Comment Edited] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Jimmy Xiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796341#comment-13796341
 ] 

Jimmy Xiang edited comment on HBASE-9773 at 10/16/13 2:43 AM:
--

[~saint@gmail.com], if it is null, the state is managed by the caller so we 
leave it to the caller. If the transitionInZK is not set, the caller is just to 
do the best to close the region just in case to avoid double assignment. So we 
don't check the actual state here.


was (Author: jxiang):
[~saint@gmail.com], if it is null, the state is managed by the caller so we 
leave it to the caller.

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Vladimir Rodionov (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796348#comment-13796348
 ] 

Vladimir Rodionov commented on HBASE-9769:
--

Lars, HTable can have small number of versions and large number of  column 
qualifiers or large values (say 100K). 

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Comment Edited] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796346#comment-13796346
 ] 

Lars Hofhansl edited comment on HBASE-9769 at 10/16/13 2:38 AM:


Here's a *sample* of what I mean.
10 is hardcoded. Alternatively we could only do this when VERSIONS=1, so this 
way the user would naturally provide a hint about how many version (s)he 
expects to keep and we use that hint.



was (Author: lhofhansl):
Here's a *sample* of what I mean.
10 is hardcoded. Alternatively we could only do this when VERSIONS=1, so way 
the use would naturally provide a hint about how many version (s)he expects to 
keep and we use that hint.


> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9771) [WebUI] Humanize store and blockcache statistics on RS

2013-10-15 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796347#comment-13796347
 ] 

Hudson commented on HBASE-9771:
---

SUCCESS: Integrated in HBase-TRUNK #4621 (See 
[https://builds.apache.org/job/HBase-TRUNK/4621/])
HBASE-9771 [WebUI] Humanize store and blockcache statistics on RS (ndimiduk: 
rev 1532598)
* 
/hbase/trunk/hbase-hadoop-compat/src/main/java/org/apache/hadoop/hbase/regionserver/MetricsRegionServerWrapper.java
* 
/hbase/trunk/hbase-server/src/main/jamon/org/apache/hadoop/hbase/tmpl/regionserver/ServerMetricsTmpl.jamon


> [WebUI] Humanize store and blockcache statistics on RS
> --
>
> Key: HBASE-9771
> URL: https://issues.apache.org/jira/browse/HBASE-9771
> Project: HBase
>  Issue Type: Improvement
>  Components: UI
>Affects Versions: 0.98.0, 0.96.0
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Trivial
> Fix For: 0.98.0, 0.96.1
>
> Attachments: HBASE-9771.00.patch, HBASE-9771.01.patch, 
> HBASE-9771.02.patch
>
>
> Some statistics reported on webui don't include hints about what the unit is, 
> leaving people guessing about what they mean. Clean them up.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-9769:
-

Attachment: 9769-0.94-sample.txt

Here's a *sample* of what I mean.
10 is hardcoded. Alternatively we could only do this when VERSIONS=1, so way 
the use would naturally provide a hint about how many version (s)he expects to 
keep and we use that hint.


> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Attachments: 9769-0.94-sample.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796345#comment-13796345
 ] 

Lars Hofhansl commented on HBASE-9769:
--

Another idea I had was to make use of the column family's VERSIONS setting. If 
it is "small" use INCLUDE and SKIP in the ExplicitColumnTracker, otherwise use 
INCLUDE_AND_SEEK_NEXT_COL and SEEK_NEXT_COL. In my tests this yields a nice 
improvement bringing ExplicitColumnTracker on par with 
ScanWildcardColumnTracker. For now I defined "small" as 10, but that needs to 
be tested more.


> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Jimmy Xiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796341#comment-13796341
 ] 

Jimmy Xiang commented on HBASE-9773:


[~saint@gmail.com], if it is null, the state is managed by the caller so we 
leave it to the caller.

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9776) Test Load And Verify Fails with TableNotEnabledException

2013-10-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796337#comment-13796337
 ] 

Hadoop QA commented on HBASE-9776:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608617/hbase-9776.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 6 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7558//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7558//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7558//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7558//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7558//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7558//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7558//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7558//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7558//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7558//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7558//console

This message is automatically generated.

> Test Load And Verify Fails with TableNotEnabledException
> 
>
> Key: HBASE-9776
> URL: https://issues.apache.org/jira/browse/HBASE-9776
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: Jeffrey Zhong
>Assignee: Jeffrey Zhong
>Priority: Minor
> Attachments: hbase-9776.patch
>
>
> Occasionally IntegrationTestLoadAndVerify failed with the following error. 
> This is caused by RPC retry and the first attempt actually went through 
> successfully and the second retry attempt fails because the table is disabled 
> by the first attempt.
> {code}
> 2013-10-10 
> 19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
>  org.apache.hadoop.hbase.TableNotEnabledException: 
> IntegrationTestLoadAndVerify
> 2013-10-10 19:55:54,340|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
> 2013-10-10 19:55:54,341|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
> 2013-10-10 19:55:54,342|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Vladimir Rodionov (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796331#comment-13796331
 ] 

Vladimir Rodionov commented on HBASE-9769:
--

Its a server-side filter and is not meant to be exposed to HBase client. The 
reason: it has only list of qualifiers - no columns. It is instantiated in 
StoreScanner. If Scan has already filter, the new FilterList is created with 
MUST_PASS_ALL operator. First goes  ExplicitColumnsFilter then existing filter.

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Comment Edited] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread chunhui shen (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796311#comment-13796311
 ] 

chunhui shen edited comment on HBASE-9769 at 10/16/13 1:45 AM:
---

Move the logic of above patch to Scan class, is it also OK?

It means adding the ExplicitColumnsFilter in Scan.java when setting the 
attribute "SCAN-SMALL-ROWS"

In addition, I worry about the data correctness if Scan already has complex 
filters.


was (Author: zjushch):
Move the logic of above patch to Scan class, is it also OK?

It means adding the ExplicitColumnsFilter in Scan.java when setting the 
attribute "SCAN-SMALL-ROWS"

In addition, I'm worry the data correctness if Scan already has complex filters.

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread chunhui shen (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796311#comment-13796311
 ] 

chunhui shen commented on HBASE-9769:
-

Move the logic of above patch to Scan class, is it also OK?

It means adding the ExplicitColumnsFilter in Scan.java when setting the 
attribute "SCAN-SMALL-ROWS"

In addition, I'm worry the data correctness if Scan already has complex filters.

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9776) Test Load And Verify Fails with TableNotEnabledException

2013-10-15 Thread Jeffrey Zhong (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796305#comment-13796305
 ] 

Jeffrey Zhong commented on HBASE-9776:
--

[~saint@gmail.com] I'm afraid you hit a different issue. The above stack 
trace you posted seems we have a half done deletion before and subsequent 
retries all failed because of that.  Since delete/disable/create table 
operations aren't idempotent, executeCallable on these table operations is 
problematic. I guess we need a FATE like model for  table operations.


> Test Load And Verify Fails with TableNotEnabledException
> 
>
> Key: HBASE-9776
> URL: https://issues.apache.org/jira/browse/HBASE-9776
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: Jeffrey Zhong
>Assignee: Jeffrey Zhong
>Priority: Minor
> Attachments: hbase-9776.patch
>
>
> Occasionally IntegrationTestLoadAndVerify failed with the following error. 
> This is caused by RPC retry and the first attempt actually went through 
> successfully and the second retry attempt fails because the table is disabled 
> by the first attempt.
> {code}
> 2013-10-10 
> 19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
>  org.apache.hadoop.hbase.TableNotEnabledException: 
> IntegrationTestLoadAndVerify
> 2013-10-10 19:55:54,340|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
> 2013-10-10 19:55:54,341|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
> 2013-10-10 19:55:54,342|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9775) Client write path perf issues

2013-10-15 Thread Elliott Clark (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796294#comment-13796294
 ] 

Elliott Clark commented on HBASE-9775:
--

YCSB with 33 nodes fails completely. with:

{code}
com.yahoo.ycsb.DBException: 
org.apache.hadoop.hbase.client.RetriesExhaustedWithDetailsException: Failed 
8608 actions: IOException: 8608 times, 
at com.yahoo.ycsb.db.HBaseClient.cleanup(HBaseClient.java:113)
at com.yahoo.ycsb.DBWrapper.cleanup(DBWrapper.java:73)
at com.yahoo.ycsb.ClientThread.run(Client.java:307)
Caused by: org.apache.hadoop.hbase.client.RetriesExhaustedWithDetailsException: 
Failed 8608 actions: IOException: 8608 times, 
at 
org.apache.hadoop.hbase.client.AsyncProcess$BatchErrors.makeException(AsyncProcess.java:185)
at 
org.apache.hadoop.hbase.client.AsyncProcess$BatchErrors.access$500(AsyncProcess.java:169)
at 
org.apache.hadoop.hbase.client.AsyncProcess.getErrors(AsyncProcess.java:782)
at 
org.apache.hadoop.hbase.client.HTable.backgroundFlushCommits(HTable.java:934)
at org.apache.hadoop.hbase.client.HTable.flushCommits(HTable.java:1193)
at com.yahoo.ycsb.db.HBaseClient.cleanup(HBaseClient.java:108)
... 2 more
{code}

running:

{code}
#!/bin/bash
OPERTATIONS=2000
RECORDS=$(($OPERTATIONS*10))

echo "disable 'usertable'; drop 'usertable'" | hbase shell
echo "create 'usertable',{NAME => 'd', VERSIONS => 1,COMPRESSION => 'lz4', 
BLOCKCACHE => true}" | hbase shell
echo "balancer" | hbase shell
sleep 10
cd /home/eclark/RC5/ycsb/
bin/ycsb load hbase -P workloads/workloada -p columnfamily=d -s -threads 32 -p 
recordcount=$RECORDS 2> /tmp/ycsb-workload-a-load-output.log
{code}


I've also seen the same stack and exception when trying to run ITBLL on this 
cluster.

> Client write path perf issues
> -
>
> Key: HBASE-9775
> URL: https://issues.apache.org/jira/browse/HBASE-9775
> Project: HBase
>  Issue Type: Bug
>  Components: Client
>Affects Versions: 0.96.0
>Reporter: Elliott Clark
>Priority: Critical
> Attachments: Charts Search   Cloudera Manager.png, short_ycsb.png
>
>
> Testing on larger clusters has not had the desired throughput increases.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9775) Client write path perf issues

2013-10-15 Thread Elliott Clark (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Elliott Clark updated HBASE-9775:
-

Summary: Client write path perf issues  (was: Client write path scales very 
badly with more servers)

> Client write path perf issues
> -
>
> Key: HBASE-9775
> URL: https://issues.apache.org/jira/browse/HBASE-9775
> Project: HBase
>  Issue Type: Bug
>  Components: Client
>Affects Versions: 0.96.0
>Reporter: Elliott Clark
>Priority: Critical
> Attachments: Charts Search   Cloudera Manager.png, short_ycsb.png
>
>
> Testing on larger clusters has not had the desired throughput increases.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9753) Excessive readpoint checks in MemstoreScanner

2013-10-15 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796284#comment-13796284
 ] 

Hudson commented on HBASE-9753:
---

FAILURE: Integrated in hbase-0.96-hadoop2 #89 (See 
[https://builds.apache.org/job/hbase-0.96-hadoop2/89/])
HBASE-9753 Excessive readpoint checks in MemstoreScanner (tedyu: rev 1532431)
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/MemStore.java


> Excessive readpoint checks in MemstoreScanner
> -
>
> Key: HBASE-9753
> URL: https://issues.apache.org/jira/browse/HBASE-9753
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Ted Yu
> Fix For: 0.98.0, 0.94.13, 0.96.1
>
> Attachments: 9753-0.94.txt, 9753-v1.txt
>
>
> Brought up by [~vrodionov] on the mailing list. See also HBASE-9751.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9570) With AccessDeniedException, HBase shell would be better to just display the error message to be user friendly

2013-10-15 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796285#comment-13796285
 ] 

Hudson commented on HBASE-9570:
---

FAILURE: Integrated in hbase-0.96-hadoop2 #89 (See 
[https://builds.apache.org/job/hbase-0.96-hadoop2/89/])
HBASE-9570 With AccessDeniedException, HBase shell would be better to just 
display the error message to be user friendly (Yang Wang) (ndimiduk: rev 
1532398)
* /hbase/branches/0.96/hbase-shell/src/main/ruby/shell/commands.rb


> With AccessDeniedException, HBase shell would be better to just display the 
> error message to be user friendly
> -
>
> Key: HBASE-9570
> URL: https://issues.apache.org/jira/browse/HBASE-9570
> Project: HBase
>  Issue Type: Improvement
>  Components: shell
>Reporter: Yang Wang
>Assignee: Yang Wang
>Priority: Minor
> Fix For: 0.98.0, 0.96.1
>
> Attachments: HBASE-9570, HBASE-9570.01.patch, HBASE-9570.02.patch, 
> HBASE-9570-0.96.01.patch
>
>
> When access unauthorized resource like table, AccessDeniedException will be 
> thrown. In HBase shell, the error message with stack trace will be displayed 
> as follows. It would be better to just display the error message avoiding the 
> stack trace to be user friendly. 
> {noformat}
> ERROR: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient 
> permissions for user 'u1' for scanner open on table t1
> at 
> org.apache.hadoop.hbase.security.access.AccessController.preScannerOpen(AccessController.java:1116)
> at 
> org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preScannerOpen(RegionCoprocessorHost.java:1293)
> at 
> org.apache.hadoop.hbase.regionserver.HRegionServer.scan(HRegionServer.java:3026)
> at 
> org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26971)
> at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2083)
> at 
> org.apache.hadoop.hbase.ipc.RpcServer$CallRunner.run(RpcServer.java:1820)
> at 
> org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.consumerLoop(SimpleRpcScheduler.java:165)
> at 
> org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.access$000(SimpleRpcScheduler.java:41)
> at 
> org.apache.hadoop.hbase.ipc.SimpleRpcScheduler$1.run(SimpleRpcScheduler.java:113)
> at java.lang.Thread.run(Thread.java:662)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9753) Excessive readpoint checks in MemstoreScanner

2013-10-15 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796282#comment-13796282
 ] 

Hudson commented on HBASE-9753:
---

FAILURE: Integrated in HBase-TRUNK-on-Hadoop-2.0.0 #795 (See 
[https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/795/])
HBASE-9753 Excessive readpoint checks in MemstoreScanner (tedyu: rev 1532432)
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/MemStore.java


> Excessive readpoint checks in MemstoreScanner
> -
>
> Key: HBASE-9753
> URL: https://issues.apache.org/jira/browse/HBASE-9753
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Ted Yu
> Fix For: 0.98.0, 0.94.13, 0.96.1
>
> Attachments: 9753-0.94.txt, 9753-v1.txt
>
>
> Brought up by [~vrodionov] on the mailing list. See also HBASE-9751.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9766) HFileV3 - Optional tags write and read is not working as expected

2013-10-15 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796280#comment-13796280
 ] 

Andrew Purtell commented on HBASE-9766:
---

+1 on patch v2

> HFileV3 - Optional tags write and read is not working as expected
> -
>
> Key: HBASE-9766
> URL: https://issues.apache.org/jira/browse/HBASE-9766
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.98.0
>Reporter: Anoop Sam John
>Assignee: Anoop Sam John
> Fix For: 0.98.0
>
> Attachments: HBASE-9766.patch, HBASE-9766_V2.patch
>
>
> In the writer V3 includesTags always comes as true only and so writing tags 
> length always even when compaction selection says not to do so.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Created] (HBASE-9777) Two consecutive RS crashes could lead to their SSH stepping on each other's toes and cause master abort

2013-10-15 Thread Devaraj Das (JIRA)
Devaraj Das created HBASE-9777:
--

 Summary: Two consecutive RS crashes could lead to their SSH 
stepping on each other's toes and cause master abort
 Key: HBASE-9777
 URL: https://issues.apache.org/jira/browse/HBASE-9777
 Project: HBase
  Issue Type: Bug
Reporter: Devaraj Das


Here is the sequence of events:
1. Master assigns regions to some server RS1. One particular region is 
300d71b112325d43b99b6148ec7bc5b3
2. RS1 crashes
3. Master tries to bulk-reassign (this has retries as well) the regions to 
other RSs. Let's say one of them is RS2.
{noformat}
2013-10-14 21:16:22,218 INFO  
[hor13n02.gq1.ygridcore.net,6,1381784464025-GeneralBulkAssigner-0] 
master.RegionStates: Transitioned {300d71b112325d43b99b6148ec7bc5b3 
state=OFFLINE, ts=1381785382125, server=null} to 
{300d71b112325d43b99b6148ec7bc5b3 state=PENDING_OPEN, ts=1381785382218, 
server=hor13n04.gq1.ygridcore.net,60020,1381784772417}
{noformat}
4. RS2 crashes
5. The ServerShutdownHandler for RS2 is executed, and it tries to reassign the 
regions.
{noformat}
2013-10-14 21:16:32,185 INFO  [MASTER_SERVER_OPERATIONS-hor13n02:6-3] 
master.RegionStates: Found opening region {300d71b112325d43b99b6148ec7bc5b3 
state=PENDING_OPEN, ts=1381785382218, 
server=hor13n04.gq1.ygridcore.net,60020,1381784772417} to be reassigned by SSH 
for hor13n04.gq1.ygridcore.net,60020,1381784772417
{noformat}
6. (5) succeeds. The region states are made OPEN.
7. The retry from (3) kicks in 
{noformat}
2013-10-14 21:16:22,222 INFO  [MASTER_SERVER_OPERATIONS-hor13n02:6-1] 
master.GeneralBulkAssigner: Failed assigning 52 regions to server 
hor13n04.gq1.ygridcore.net,60020,1381784772417, reassigning them
{noformat}
8. The retry finds some region state as OPEN, and the master aborts with the 
stack trace:
{noformat}
2013-10-14 21:16:34,342 FATAL AM.-pool1-t46 master.HMaster: Unexpected state :
{300d71b112325d43b99b6148ec7bc5b3 state=OPEN, ts=1381785392864, 
server=hor13n08.gq1.ygridcore.net,60020,1381785385596} .. Cannot transit it to 
OFFLINE.
java.lang.IllegalStateException: Unexpected state : 
{300d71b112325d43b99b6148ec7bc5b3 state=OPEN, ts=1381785392864, 
server=hor13n08.gq1.ygridcore.net,60020,1381785385596}
.. Cannot transit it to OFFLINE.
at 
org.apache.hadoop.hbase.master.AssignmentManager.setOfflineInZooKeeper(AssignmentManager.java:2074)
at 
org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1855)
at 
org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1449)
at org.apache.hadoop.hbase.master.AssignCallable.call(AssignCallable.java:45)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
{noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9777) Two consecutive RS crashes could lead to their SSH stepping on each other's toes and cause master abort

2013-10-15 Thread Devaraj Das (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Devaraj Das updated HBASE-9777:
---

Description: 
Here is the sequence of events (with a version of 0.96 very close to RC5 
version created on 10/11):
1. Master assigns regions to some server RS1. One particular region is 
300d71b112325d43b99b6148ec7bc5b3
2. RS1 crashes
3. Master tries to bulk-reassign (this has retries as well) the regions to 
other RSs. Let's say one of them is RS2.
{noformat}
2013-10-14 21:16:22,218 INFO  
[hor13n02.gq1.ygridcore.net,6,1381784464025-GeneralBulkAssigner-0] 
master.RegionStates: Transitioned {300d71b112325d43b99b6148ec7bc5b3 
state=OFFLINE, ts=1381785382125, server=null} to 
{300d71b112325d43b99b6148ec7bc5b3 state=PENDING_OPEN, ts=1381785382218, 
server=hor13n04.gq1.ygridcore.net,60020,1381784772417}
{noformat}
4. RS2 crashes
5. The ServerShutdownHandler for RS2 is executed, and it tries to reassign the 
regions.
{noformat}
2013-10-14 21:16:32,185 INFO  [MASTER_SERVER_OPERATIONS-hor13n02:6-3] 
master.RegionStates: Found opening region {300d71b112325d43b99b6148ec7bc5b3 
state=PENDING_OPEN, ts=1381785382218, 
server=hor13n04.gq1.ygridcore.net,60020,1381784772417} to be reassigned by SSH 
for hor13n04.gq1.ygridcore.net,60020,1381784772417
{noformat}
6. (5) succeeds. The region states are made OPEN.
7. The retry from (3) kicks in 
{noformat}
2013-10-14 21:16:22,222 INFO  [MASTER_SERVER_OPERATIONS-hor13n02:6-1] 
master.GeneralBulkAssigner: Failed assigning 52 regions to server 
hor13n04.gq1.ygridcore.net,60020,1381784772417, reassigning them
{noformat}
8. The retry finds some region state as OPEN, and the master aborts with the 
stack trace:
{noformat}
2013-10-14 21:16:34,342 FATAL AM.-pool1-t46 master.HMaster: Unexpected state :
{300d71b112325d43b99b6148ec7bc5b3 state=OPEN, ts=1381785392864, 
server=hor13n08.gq1.ygridcore.net,60020,1381785385596} .. Cannot transit it to 
OFFLINE.
java.lang.IllegalStateException: Unexpected state : 
{300d71b112325d43b99b6148ec7bc5b3 state=OPEN, ts=1381785392864, 
server=hor13n08.gq1.ygridcore.net,60020,1381785385596}
.. Cannot transit it to OFFLINE.
at 
org.apache.hadoop.hbase.master.AssignmentManager.setOfflineInZooKeeper(AssignmentManager.java:2074)
at 
org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1855)
at 
org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1449)
at org.apache.hadoop.hbase.master.AssignCallable.call(AssignCallable.java:45)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
{noformat}

  was:
Here is the sequence of events:
1. Master assigns regions to some server RS1. One particular region is 
300d71b112325d43b99b6148ec7bc5b3
2. RS1 crashes
3. Master tries to bulk-reassign (this has retries as well) the regions to 
other RSs. Let's say one of them is RS2.
{noformat}
2013-10-14 21:16:22,218 INFO  
[hor13n02.gq1.ygridcore.net,6,1381784464025-GeneralBulkAssigner-0] 
master.RegionStates: Transitioned {300d71b112325d43b99b6148ec7bc5b3 
state=OFFLINE, ts=1381785382125, server=null} to 
{300d71b112325d43b99b6148ec7bc5b3 state=PENDING_OPEN, ts=1381785382218, 
server=hor13n04.gq1.ygridcore.net,60020,1381784772417}
{noformat}
4. RS2 crashes
5. The ServerShutdownHandler for RS2 is executed, and it tries to reassign the 
regions.
{noformat}
2013-10-14 21:16:32,185 INFO  [MASTER_SERVER_OPERATIONS-hor13n02:6-3] 
master.RegionStates: Found opening region {300d71b112325d43b99b6148ec7bc5b3 
state=PENDING_OPEN, ts=1381785382218, 
server=hor13n04.gq1.ygridcore.net,60020,1381784772417} to be reassigned by SSH 
for hor13n04.gq1.ygridcore.net,60020,1381784772417
{noformat}
6. (5) succeeds. The region states are made OPEN.
7. The retry from (3) kicks in 
{noformat}
2013-10-14 21:16:22,222 INFO  [MASTER_SERVER_OPERATIONS-hor13n02:6-1] 
master.GeneralBulkAssigner: Failed assigning 52 regions to server 
hor13n04.gq1.ygridcore.net,60020,1381784772417, reassigning them
{noformat}
8. The retry finds some region state as OPEN, and the master aborts with the 
stack trace:
{noformat}
2013-10-14 21:16:34,342 FATAL AM.-pool1-t46 master.HMaster: Unexpected state :
{300d71b112325d43b99b6148ec7bc5b3 state=OPEN, ts=1381785392864, 
server=hor13n08.gq1.ygridcore.net,60020,1381785385596} .. Cannot transit it to 
OFFLINE.
java.lang.IllegalStateException: Unexpected state : 
{300d71b112325d43b99b6148ec7bc5b3 state=OPEN, ts=1381785392864, 
server=hor13n08.gq1.ygridcore.net,60020,1381785385596}
.. Cannot transit it to OFFLINE.
at 
org.apache.hadoop.hbase.master.AssignmentManager.setOfflineInZooKeeper(AssignmentManager.java:2074)
at 
org.apa

[jira] [Commented] (HBASE-9776) Test Load And Verify Fails with TableNotEnabledException

2013-10-15 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796269#comment-13796269
 ] 

stack commented on HBASE-9776:
--

+1 on trying the patch.  We'll soon see if it works or not.

> Test Load And Verify Fails with TableNotEnabledException
> 
>
> Key: HBASE-9776
> URL: https://issues.apache.org/jira/browse/HBASE-9776
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: Jeffrey Zhong
>Assignee: Jeffrey Zhong
>Priority: Minor
> Attachments: hbase-9776.patch
>
>
> Occasionally IntegrationTestLoadAndVerify failed with the following error. 
> This is caused by RPC retry and the first attempt actually went through 
> successfully and the second retry attempt fails because the table is disabled 
> by the first attempt.
> {code}
> 2013-10-10 
> 19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
>  org.apache.hadoop.hbase.TableNotEnabledException: 
> IntegrationTestLoadAndVerify
> 2013-10-10 19:55:54,340|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
> 2013-10-10 19:55:54,341|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
> 2013-10-10 19:55:54,342|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9776) Test Load And Verify Fails with TableNotEnabledException

2013-10-15 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796267#comment-13796267
 ] 

stack commented on HBASE-9776:
--

You mean this?
{code}
Deleting table IntegrationTestLoadAndVerify 
2013-10-13 07:42:32,410 INFO  [main] hbase.HBaseCluster: Restoring cluster - 
started
2013-10-13 07:42:32,414 INFO  [main] hbase.HBaseCluster: Added new HBaseAdmin
2013-10-13 07:42:32,414 INFO  [main] hbase.HBaseCluster: Restoring cluster - 
done
2013-10-13 07:42:32,414 ERROR [main] util.AbstractHBaseTool: Error running 
command-line tool
org.apache.hadoop.hbase.client.RetriesExhaustedException: Failed after 
attempts=35, exceptions:
Sun Oct 13 07:37:26 PDT 2013, 
org.apache.hadoop.hbase.client.RpcRetryingCaller@511be529, 
org.apache.hadoop.hbase.ipc.RemoteWithExtrasException(java.io.IOException): 
java.io.IOException: HTableDescriptor missing for IntegrationTestLoadAndVerify
at 
org.apache.hadoop.hbase.master.handler.TableEventHandler.getTableDescriptor(TableEventHandler.java:231)
at 
org.apache.hadoop.hbase.master.handler.DeleteTableHandler.prepareWithTableLock(DeleteTableHandler.java:58)
at 
org.apache.hadoop.hbase.master.handler.TableEventHandler.prepare(TableEventHandler.java:93)
at org.apache.hadoop.hbase.master.HMaster.deleteTable(HMaster.java:1816)
at org.apache.hadoop.hbase.master.HMaster.deleteTable(HMaster.java:1826)
at 
org.apache.hadoop.hbase.protobuf.generated.MasterProtos$MasterService$2.callBlockingMethod(MasterProtos.java:38213)
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2146)
at 
org.apache.hadoop.hbase.ipc.RpcServer$Handler.run(RpcServer.java:1851)

Sun Oct 13 07:37:26 PDT 2013, 
org.apache.hadoop.hbase.client.RpcRetryingCaller@511be529, 
org.apache.hadoop.hbase.ipc.RemoteWithExtrasException(java.io.IOException): 
java.io.IOException: HTableDescriptor missing for IntegrationTestLoadAndVerify
at 
org.apache.hadoop.hbase.master.handler.TableEventHandler.getTableDescriptor(TableEventHandler.java:231)
at 
org.apache.hadoop.hbase.master.handler.DeleteTableHandler.prepareWithTableLock(DeleteTableHandler.java:58)
at 
org.apache.hadoop.hbase.master.handler.TableEventHandler.prepare(TableEventHandler.java:93)
at org.apache.hadoop.hbase.master.HMaster.deleteTable(HMaster.java:1816)
at org.apache.hadoop.hbase.master.HMaster.deleteTable(HMaster.java:1826)
at 
org.apache.hadoop.hbase.protobuf.generated.MasterProtos$MasterService$2.callBlockingMethod(MasterProtos.java:38213)
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2146)
at 
org.apache.hadoop.hbase.ipc.RpcServer$Handler.run(RpcServer.java:1851)


{code}

> Test Load And Verify Fails with TableNotEnabledException
> 
>
> Key: HBASE-9776
> URL: https://issues.apache.org/jira/browse/HBASE-9776
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: Jeffrey Zhong
>Assignee: Jeffrey Zhong
>Priority: Minor
> Attachments: hbase-9776.patch
>
>
> Occasionally IntegrationTestLoadAndVerify failed with the following error. 
> This is caused by RPC retry and the first attempt actually went through 
> successfully and the second retry attempt fails because the table is disabled 
> by the first attempt.
> {code}
> 2013-10-10 
> 19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
>  org.apache.hadoop.hbase.TableNotEnabledException: 
> IntegrationTestLoadAndVerify
> 2013-10-10 19:55:54,340|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
> 2013-10-10 19:55:54,341|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
> 2013-10-10 19:55:54,342|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9776) Test Load And Verify Fails with TableNotEnabledException

2013-10-15 Thread Jeffrey Zhong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeffrey Zhong updated HBASE-9776:
-

Attachment: hbase-9776.patch

The fix is simple and use HBaseTestingUtility#deleteTable to delete a table in 
clean up phase. The utility deleteTable ignores the TableNotEnabledException 
and proceed with deletion.

> Test Load And Verify Fails with TableNotEnabledException
> 
>
> Key: HBASE-9776
> URL: https://issues.apache.org/jira/browse/HBASE-9776
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: Jeffrey Zhong
>Assignee: Jeffrey Zhong
>Priority: Minor
> Attachments: hbase-9776.patch
>
>
> Occasionally IntegrationTestLoadAndVerify failed with the following error. 
> This is caused by RPC retry and the first attempt actually went through 
> successfully and the second retry attempt fails because the table is disabled 
> by the first attempt.
> {code}
> 2013-10-10 
> 19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
>  org.apache.hadoop.hbase.TableNotEnabledException: 
> IntegrationTestLoadAndVerify
> 2013-10-10 19:55:54,340|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
> 2013-10-10 19:55:54,341|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
> 2013-10-10 19:55:54,342|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9775) Client write path scales very badly with more servers

2013-10-15 Thread Elliott Clark (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Elliott Clark updated HBASE-9775:
-

Priority: Critical  (was: Major)

> Client write path scales very badly with more servers
> -
>
> Key: HBASE-9775
> URL: https://issues.apache.org/jira/browse/HBASE-9775
> Project: HBase
>  Issue Type: Bug
>  Components: Client
>Affects Versions: 0.96.0
>Reporter: Elliott Clark
>Priority: Critical
> Attachments: Charts Search   Cloudera Manager.png, short_ycsb.png
>
>
> Testing on larger clusters has not had the desired throughput increases.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9775) Client write path scales very badly with more servers

2013-10-15 Thread Elliott Clark (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Elliott Clark updated HBASE-9775:
-

Attachment: Charts Search   Cloudera Manager.png

Here's the network.  The client barely reaches 1.2 gbit of a possible 10 gbit.

We're also seeing a pretty huge slow down vs 0.94.  Something in the order of 
15% slower on the write path (the write path is better or the same).

So I'm thinking that it's got to be something in the AsyncProcess code.  
Batching seems suspect but I'm running more tests to check.  For me that huge a 
pref loss and a scaling loss are pretty huge issues.

> Client write path scales very badly with more servers
> -
>
> Key: HBASE-9775
> URL: https://issues.apache.org/jira/browse/HBASE-9775
> Project: HBase
>  Issue Type: Bug
>  Components: Client
>Affects Versions: 0.96.0
>Reporter: Elliott Clark
> Attachments: Charts Search   Cloudera Manager.png, short_ycsb.png
>
>
> Testing on larger clusters has not had the desired throughput increases.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9776) Test Load And Verify Fails with TableNotEnabledException

2013-10-15 Thread Jeffrey Zhong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeffrey Zhong updated HBASE-9776:
-

Status: Patch Available  (was: Open)

> Test Load And Verify Fails with TableNotEnabledException
> 
>
> Key: HBASE-9776
> URL: https://issues.apache.org/jira/browse/HBASE-9776
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: Jeffrey Zhong
>Assignee: Jeffrey Zhong
>Priority: Minor
> Attachments: hbase-9776.patch
>
>
> Occasionally IntegrationTestLoadAndVerify failed with the following error. 
> This is caused by RPC retry and the first attempt actually went through 
> successfully and the second retry attempt fails because the table is disabled 
> by the first attempt.
> {code}
> 2013-10-10 
> 19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
>  org.apache.hadoop.hbase.TableNotEnabledException: 
> IntegrationTestLoadAndVerify
> 2013-10-10 19:55:54,340|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
> 2013-10-10 19:55:54,341|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
> 2013-10-10 19:55:54,342|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796254#comment-13796254
 ] 

Hadoop QA commented on HBASE-9773:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608601/trunk-9773_v2.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7557//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7557//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7557//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7557//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7557//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7557//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7557//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7557//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7557//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7557//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7557//console

This message is automatically generated.

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbas

[jira] [Updated] (HBASE-9606) Apply small scan to meta scan where rowLimit is low

2013-10-15 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-9606:
--

Resolution: Fixed
Status: Resolved  (was: Patch Available)

> Apply small scan to meta scan where rowLimit is low
> ---
>
> Key: HBASE-9606
> URL: https://issues.apache.org/jira/browse/HBASE-9606
> Project: HBase
>  Issue Type: Improvement
>Reporter: Ted Yu
>Assignee: Ted Yu
> Fix For: 0.98.0
>
> Attachments: 9606-v2.txt, small-v3.txt
>
>
> HBASE-9488 added the feature for small scan where RPC calls are reduced.
> We can apply small scan to MetaScanner#metaScan() where rowLimit is low.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796252#comment-13796252
 ] 

stack commented on HBASE-9773:
--

Patch looks good [~jxiang]  Should the state check be tighter than just 
checking for null?  What could it be?  What would be legit and what not?

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9544) Remove TestAdmin#testIsEnabledOnNonexistentTable()

2013-10-15 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-9544:
--

  Resolution: Fixed
Hadoop Flags: Reviewed
  Status: Resolved  (was: Patch Available)

> Remove TestAdmin#testIsEnabledOnNonexistentTable()
> --
>
> Key: HBASE-9544
> URL: https://issues.apache.org/jira/browse/HBASE-9544
> Project: HBase
>  Issue Type: Test
>Reporter: Ted Yu
>Assignee: Ted Yu
>Priority: Minor
> Fix For: 0.98.0
>
> Attachments: 9544.txt
>
>
> TestAdmin#testIsEnabledOnNonexistentTable() is no longer needed
> TestAdmin#testIsEnabledOrDisabledOnUnknownTable() covers the above test



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code

2013-10-15 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796249#comment-13796249
 ] 

Ted Yu commented on HBASE-9754:
---

In StoreScanner.java, I replaced the initial value with the following and reran 
test suite:
{code}
+  private long readPt = Long.MAX_VALUE;
{code}
The suite passed - readPt would be assigned correctly in ctor's so its initial 
value doesn't matter.

> Consider eliminating threadlocals from MVCC code
> 
>
> Key: HBASE-9754
> URL: https://issues.apache.org/jira/browse/HBASE-9754
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Ted Yu
> Fix For: 0.98.0
>
> Attachments: 9754-rp-0.txt
>
>
> Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
> Currently we use ThreadLocals to communicate the current readpoint between a 
> RegionScanner and the Store\{File}Scanner's down the stack.
> Since ThreadLocals are not cheap we should consider whether it is possible to 
> pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9775) Client write path scales very badly with more servers

2013-10-15 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796250#comment-13796250
 ] 

stack commented on HBASE-9775:
--

What you suspect the issue Mr [~eclark]?  Client.  The way we are batching up 
edits in client and sending them across?

> Client write path scales very badly with more servers
> -
>
> Key: HBASE-9775
> URL: https://issues.apache.org/jira/browse/HBASE-9775
> Project: HBase
>  Issue Type: Bug
>  Components: Client
>Affects Versions: 0.96.0
>Reporter: Elliott Clark
> Attachments: short_ycsb.png
>
>
> Testing on larger clusters has not had the desired throughput increases.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796251#comment-13796251
 ] 

Ted Yu commented on HBASE-9773:
---

+1

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796245#comment-13796245
 ] 

Hadoop QA commented on HBASE-9773:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608601/trunk-9773_v2.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7556//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7556//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7556//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7556//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7556//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7556//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7556//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7556//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7556//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7556//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7556//console

This message is automatically generated.

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbas

[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Vladimir Rodionov (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796242#comment-13796242
 ] 

Vladimir Rodionov commented on HBASE-9769:
--

In 0.94.12 the difference is not so dramatic as in 0.94.6 but still exists:

default: 500K rows per sec
filter-based: 1.2M rows per sec

It seems that there is performance regression in scan filters in 0.94.12. The 
code which gives me almost 1.5M in 0.94.6 runs only 1.2M rows per sec in 
0.94.12.

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9771) [WebUI] Humanize store and blockcache statistics on RS

2013-10-15 Thread Nick Dimiduk (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-9771:


   Resolution: Fixed
Fix Version/s: 0.96.1
   0.98.0
 Release Note: Committed to 0.96 and trunk. Thanks for the review Elliott.
 Hadoop Flags: Reviewed
   Status: Resolved  (was: Patch Available)

> [WebUI] Humanize store and blockcache statistics on RS
> --
>
> Key: HBASE-9771
> URL: https://issues.apache.org/jira/browse/HBASE-9771
> Project: HBase
>  Issue Type: Improvement
>  Components: UI
>Affects Versions: 0.98.0, 0.96.0
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Trivial
> Fix For: 0.98.0, 0.96.1
>
> Attachments: HBASE-9771.00.patch, HBASE-9771.01.patch, 
> HBASE-9771.02.patch
>
>
> Some statistics reported on webui don't include hints about what the unit is, 
> leaving people guessing about what they mean. Clean them up.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9776) Test Load And Verify Fails with TableNotEnabledException

2013-10-15 Thread Jeffrey Zhong (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796208#comment-13796208
 ] 

Jeffrey Zhong commented on HBASE-9776:
--

Yes, the disableTable is not idempotent operations because the subsequent retry 
fails while we can't remove the enable state check because we use it to sync 
operations like: one is trying to do schema changes and the other is trying to 
delete the same table. 

My plan is to use HBaseTestingUtility#deleteTable instead to let the 
application client to eat the exception if happens and proceed with delete 
because we're in clean up phase.

> Test Load And Verify Fails with TableNotEnabledException
> 
>
> Key: HBASE-9776
> URL: https://issues.apache.org/jira/browse/HBASE-9776
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: Jeffrey Zhong
>Assignee: Jeffrey Zhong
>Priority: Minor
>
> Occasionally IntegrationTestLoadAndVerify failed with the following error. 
> This is caused by RPC retry and the first attempt actually went through 
> successfully and the second retry attempt fails because the table is disabled 
> by the first attempt.
> {code}
> 2013-10-10 
> 19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
>  org.apache.hadoop.hbase.TableNotEnabledException: 
> IntegrationTestLoadAndVerify
> 2013-10-10 19:55:54,340|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
> 2013-10-10 19:55:54,341|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
> 2013-10-10 19:55:54,342|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9776) Test Load And Verify Fails with TableNotEnabledException

2013-10-15 Thread Jeffrey Zhong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeffrey Zhong updated HBASE-9776:
-

Description: 
Occasionally IntegrationTestLoadAndVerify failed with the following error. This 
is caused by RPC retry and the first attempt actually went through successfully 
and the second retry attempt fails because the table is disabled by the first 
attempt.

{code}
2013-10-10 
19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
 org.apache.hadoop.hbase.TableNotEnabledException: IntegrationTestLoadAndVerify
2013-10-10 19:55:54,340|beaver.machine|INFO|at 
org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
2013-10-10 19:55:54,341|beaver.machine|INFO|at 
org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
2013-10-10 19:55:54,342|beaver.machine|INFO|at 
org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
{code}

  was:
Occasionally IntegrationTestLoadAndVerify failed with the following error. This 
is caused by RPC retry and the first attempt actually went through 
successfully.  

{code}
2013-10-10 
19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
 org.apache.hadoop.hbase.TableNotEnabledException: IntegrationTestLoadAndVerify
2013-10-10 19:55:54,340|beaver.machine|INFO|at 
org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
2013-10-10 19:55:54,341|beaver.machine|INFO|at 
org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
2013-10-10 19:55:54,342|beaver.machine|INFO|at 
org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
{code}


> Test Load And Verify Fails with TableNotEnabledException
> 
>
> Key: HBASE-9776
> URL: https://issues.apache.org/jira/browse/HBASE-9776
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: Jeffrey Zhong
>Assignee: Jeffrey Zhong
>Priority: Minor
>
> Occasionally IntegrationTestLoadAndVerify failed with the following error. 
> This is caused by RPC retry and the first attempt actually went through 
> successfully and the second retry attempt fails because the table is disabled 
> by the first attempt.
> {code}
> 2013-10-10 
> 19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
>  org.apache.hadoop.hbase.TableNotEnabledException: 
> IntegrationTestLoadAndVerify
> 2013-10-10 19:55:54,340|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
> 2013-10-10 19:55:54,341|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
> 2013-10-10 19:55:54,342|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9776) Test Load And Verify Fails with TableNotEnabledException

2013-10-15 Thread Elliott Clark (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796148#comment-13796148
 ] 

Elliott Clark commented on HBASE-9776:
--

Yep we're seeing this too on our clusters.  You can work around this by turning 
off deleting the table on the test and using the shell.  But the async delete 
just seems broken right now.

> Test Load And Verify Fails with TableNotEnabledException
> 
>
> Key: HBASE-9776
> URL: https://issues.apache.org/jira/browse/HBASE-9776
> Project: HBase
>  Issue Type: Bug
>  Components: test
>Affects Versions: 0.96.0
>Reporter: Jeffrey Zhong
>Assignee: Jeffrey Zhong
>Priority: Minor
>
> Occasionally IntegrationTestLoadAndVerify failed with the following error. 
> This is caused by RPC retry and the first attempt actually went through 
> successfully.  
> {code}
> 2013-10-10 
> 19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
>  org.apache.hadoop.hbase.TableNotEnabledException: 
> IntegrationTestLoadAndVerify
> 2013-10-10 19:55:54,340|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
> 2013-10-10 19:55:54,341|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
> 2013-10-10 19:55:54,342|beaver.machine|INFO|at 
> org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
> {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Comment Edited] (HBASE-9775) Client write path scales very badly with more servers

2013-10-15 Thread Elliott Clark (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795787#comment-13795787
 ] 

Elliott Clark edited comment on HBASE-9775 at 10/15/13 11:20 PM:
-

Here's what I'm seeing on a largish test cluster.  77 Nodes  (the pinkish red) 
has about the same throughput as the 31 node cluster (blue).  They are the 
exact same machines, I just took away two racks.

This is YCSB loads.


was (Author: eclark):
Here's what I'm seeing on a largish test cluster.  77 Nodes  (the pinkish red) 
has about the same throughput as the 31 node cluster (blue).  They are the 
exact same machines, I just took away two racks.

> Client write path scales very badly with more servers
> -
>
> Key: HBASE-9775
> URL: https://issues.apache.org/jira/browse/HBASE-9775
> Project: HBase
>  Issue Type: Bug
>  Components: Client
>Affects Versions: 0.96.0
>Reporter: Elliott Clark
> Attachments: short_ycsb.png
>
>
> Testing on larger clusters has not had the desired throughput increases.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9775) Client write path scales very badly with more servers

2013-10-15 Thread Elliott Clark (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Elliott Clark updated HBASE-9775:
-

Attachment: short_ycsb.png

Here's what I'm seeing on a largish test cluster.  77 Nodes  (the pinkish red) 
has about the same throughput as the 31 node cluster (blue).  They are the 
exact same machines, I just took away two racks.

> Client write path scales very badly with more servers
> -
>
> Key: HBASE-9775
> URL: https://issues.apache.org/jira/browse/HBASE-9775
> Project: HBase
>  Issue Type: Bug
>  Components: Client
>Affects Versions: 0.96.0
>Reporter: Elliott Clark
> Attachments: short_ycsb.png
>
>
> Testing on larger clusters has not had the desired throughput increases.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Created] (HBASE-9775) Client write path scales very badly with more servers

2013-10-15 Thread Elliott Clark (JIRA)
Elliott Clark created HBASE-9775:


 Summary: Client write path scales very badly with more servers
 Key: HBASE-9775
 URL: https://issues.apache.org/jira/browse/HBASE-9775
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.96.0
Reporter: Elliott Clark


Testing on larger clusters has not had the desired throughput increases.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Created] (HBASE-9776) Test Load And Verify Fails with TableNotEnabledException

2013-10-15 Thread Jeffrey Zhong (JIRA)
Jeffrey Zhong created HBASE-9776:


 Summary: Test Load And Verify Fails with TableNotEnabledException
 Key: HBASE-9776
 URL: https://issues.apache.org/jira/browse/HBASE-9776
 Project: HBase
  Issue Type: Bug
  Components: test
Affects Versions: 0.96.0
Reporter: Jeffrey Zhong
Assignee: Jeffrey Zhong
Priority: Minor


Occasionally IntegrationTestLoadAndVerify failed with the following error. This 
is caused by RPC retry and the first attempt actually went through 
successfully.  

{code}
2013-10-10 
19:55:54,339|beaver.machine|INFO|org.apache.hadoop.hbase.TableNotEnabledException:
 org.apache.hadoop.hbase.TableNotEnabledException: IntegrationTestLoadAndVerify
2013-10-10 19:55:54,340|beaver.machine|INFO|at 
org.apache.hadoop.hbase.master.handler.DisableTableHandler.prepare(DisableTableHandler.java:100)
2013-10-10 19:55:54,341|beaver.machine|INFO|at 
org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1979)
2013-10-10 19:55:54,342|beaver.machine|INFO|at 
org.apache.hadoop.hbase.master.HMaster.disableTable(HMaster.java:1990)
{code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Created] (HBASE-9774) Provide a way for coprocessors to register and report custom metrics

2013-10-15 Thread Gary Helmling (JIRA)
Gary Helmling created HBASE-9774:


 Summary: Provide a way for coprocessors to register and report 
custom metrics
 Key: HBASE-9774
 URL: https://issues.apache.org/jira/browse/HBASE-9774
 Project: HBase
  Issue Type: New Feature
  Components: Coprocessors, metrics
Reporter: Gary Helmling


It would help provide better visibility into what coprocessors are doing if we 
provided a way for coprocessors to export their own metrics.  The general idea 
is to:

* extend access to the HBase "metrics bus" down into the coprocessor 
environments
* coprocessors can then register and increment custom metrics
* coprocessor metrics are then reported along with all others through normal 
mechanisms



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9771) [WebUI] Humanize store and blockcache statistics on RS

2013-10-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795776#comment-13795776
 ] 

Hadoop QA commented on HBASE-9771:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608575/HBASE-9771.02.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7554//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7554//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7554//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7554//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7554//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7554//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7554//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7554//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7554//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7554//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7554//console

This message is automatically generated.

> [WebUI] Humanize store and blockcache statistics on RS
> --
>
> Key: HBASE-9771
> URL: https://issues.apache.org/jira/browse/HBASE-9771
> Project: HBase
>  Issue Type: Improvement
>  Components: UI
>Affects Versions: 0.98.0, 0.96.0
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Trivial
> Attachments: HBASE-9771.00.patch, HBASE-9771.01.patch, 
> HBASE-9771.02.patch
>
>
> Some statistics reported on webui don't include hints about what the unit is, 
> leaving people guessing about what they mean. Clean them up.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9773:
---

Attachment: trunk-9773_v2.patch

Attached a wrong patch file.  Here is the right one.

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Attachments: trunk-9773.patch, trunk-9773_v2.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795774#comment-13795774
 ] 

Hadoop QA commented on HBASE-9773:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608596/trunk-9773.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7555//console

This message is automatically generated.

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Attachments: trunk-9773.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Jimmy Xiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795754#comment-13795754
 ] 

Jimmy Xiang commented on HBASE-9773:


It's introduced by HBASE-9514 I think.  I attached a patch to fix it, added a 
test too.

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
> Attachments: trunk-9773.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Assigned] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang reassigned HBASE-9773:
--

Assignee: Jimmy Xiang

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
>Assignee: Jimmy Xiang
> Attachments: trunk-9773.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9773:
---

Status: Patch Available  (was: Open)

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
> Attachments: trunk-9773.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Updated] (HBASE-9773) Master aborted when hbck asked the master to assign a region that was already online

2013-10-15 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9773:
---

Attachment: trunk-9773.patch

> Master aborted when hbck asked the master to assign a region that was already 
> online
> 
>
> Key: HBASE-9773
> URL: https://issues.apache.org/jira/browse/HBASE-9773
> Project: HBase
>  Issue Type: Bug
>Reporter: Devaraj Das
> Attachments: trunk-9773.patch
>
>
> Came across this situation (with a version of 0.96 very close to RC5 version 
> created on 10/11):
> The sequence of events that happened:
> 1. The hbck tool couldn't communicate with the RegionServer hosting namespace 
> region due to some security exceptions. hbck INCORRECTLY assumed the region 
> was not deployed.
> In output.log (client side):
> {noformat}
> 2013-10-12 10:42:57,067|beaver.machine|INFO|ERROR: Region { meta => 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., hdfs => 
> hdfs://gs-hdp2-secure-1381559462-hbase-12.cs1cloud.internal:8020/apps/hbase/data/data/hbase/namespace/a0ac0825ba2d0830614e7f808f31787a,
>  deployed =>  } not deployed on any region server.
> 2013-10-12 10:42:57,067|beaver.machine|INFO|Trying to fix unassigned region...
> {noformat}
> 2. This led to the hbck tool trying to tell the master to "assign" the region.
> In master log (hbase-hbase-master-gs-hdp2-secure-1381559462-hbase-12.log):
> {noformat}
> 2013-10-12 10:52:35,960 INFO  [RpcServer.handler=4,port=6] 
> master.HMaster: Client=hbase//172.18.145.105 assign 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 3. The master went through the steps - sent a CLOSE to the RegionServer 
> hosting namespace region.
> From master log:
> {noformat}
> 2013-10-12 10:52:35,981 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: Sent CLOSE to 
> gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794 for 
> region hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.
> {noformat}
> 4. The master then tried to assign the namespace region to a region server, 
> and in the process ABORTED:
> From master log:
> {noformat}
> 2013-10-12 10:52:36,025 DEBUG [RpcServer.handler=4,port=6] 
> master.AssignmentManager: No previous transition plan found (or ignoring an 
> existing plan) for 
> hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a.; generated 
> random 
> plan=hri=hbase:namespace,,1381564449706.a0ac0825ba2d0830614e7f808f31787a., 
> src=, 
> dest=gs-hdp2-secure-1381559462-hbase-9.cs1cloud.internal,60020,1381564439807; 
> 4 (online=4, available=4) available servers, forceNewPlan=true
> 2013-10-12 10:52:36,026 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Master server abort: loaded coprocessors are: 
> [org.apache.hadoop.hbase.security.access.AccessController]
> 2013-10-12 10:52:36,027 FATAL [RpcServer.handler=4,port=6] 
> master.HMaster: Unexpected state : {a0ac0825ba2d0830614e7f808f31787a 
> state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> java.lang.IllegalStateException: Unexpected state : 
> {a0ac0825ba2d0830614e7f808f31787a state=OPEN, ts=1381564451344, 
> server=gs-hdp2-secure-1381559462-hbase-1.cs1cloud.internal,60020,1381564439794}
>  .. Cannot transit it to OFFLINE.
> {noformat}
> {code}AssignmentManager.assign(HRegionInfo region, boolean setOfflineInZK, 
> boolean forceNewPlan){code} is the method that does all the above. This was 
> called from the HMaster with true for both the boolean arguments.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-5487) Generic framework for Master-coordinated tasks

2013-10-15 Thread Jonathan Hsieh (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795737#comment-13795737
 ] 

Jonathan Hsieh commented on HBASE-5487:
---

[~sershe] Looking forward to it!  

> Generic framework for Master-coordinated tasks
> --
>
> Key: HBASE-5487
> URL: https://issues.apache.org/jira/browse/HBASE-5487
> Project: HBase
>  Issue Type: New Feature
>  Components: master, regionserver, Zookeeper
>Affects Versions: 0.94.0
>Reporter: Mubarak Seyed
>Priority: Critical
> Attachments: Region management in Master.pdf
>
>
> Need a framework to execute master-coordinated tasks in a fault-tolerant 
> manner. 
> Master-coordinated tasks such as online-scheme change and delete-range 
> (deleting region(s) based on start/end key) can make use of this framework.
> The advantages of framework are
> 1. Eliminate repeated code in Master, ZooKeeper tracker and Region-server for 
> master-coordinated tasks
> 2. Ability to abstract the common functions across Master -> ZK and RS -> ZK
> 3. Easy to plugin new master-coordinated tasks without adding code to core 
> components



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9753) Excessive readpoint checks in MemstoreScanner

2013-10-15 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795732#comment-13795732
 ] 

Hudson commented on HBASE-9753:
---

SUCCESS: Integrated in hbase-0.96 #141 (See 
[https://builds.apache.org/job/hbase-0.96/141/])
HBASE-9753 Excessive readpoint checks in MemstoreScanner (tedyu: rev 1532431)
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/MemStore.java


> Excessive readpoint checks in MemstoreScanner
> -
>
> Key: HBASE-9753
> URL: https://issues.apache.org/jira/browse/HBASE-9753
> Project: HBase
>  Issue Type: Bug
>Reporter: Lars Hofhansl
>Assignee: Ted Yu
> Fix For: 0.98.0, 0.94.13, 0.96.1
>
> Attachments: 9753-0.94.txt, 9753-v1.txt
>
>
> Brought up by [~vrodionov] on the mailing list. See also HBASE-9751.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9748) Address outstanding comments raised for HBASE-9696

2013-10-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795707#comment-13795707
 ] 

Hadoop QA commented on HBASE-9748:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608354/trunk-9748.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 15 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:red}-1 javadoc{color}.  The javadoc tool appears to have generated 1 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7553//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7553//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7553//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7553//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7553//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7553//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7553//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7553//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7553//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7553//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7553//console

This message is automatically generated.

> Address outstanding comments raised for HBASE-9696
> --
>
> Key: HBASE-9748
> URL: https://issues.apache.org/jira/browse/HBASE-9748
> Project: HBase
>  Issue Type: Bug
>Reporter: Jimmy Xiang
>Assignee: Jimmy Xiang
>Priority: Minor
> Fix For: 0.98.0, 0.96.1
>
> Attachments: trunk-9748.patch
>
>
> This is a follow-up issue of HBASE-9696.
> There are some outstanding review comments in 
> https://reviews.apache.org/r/14470/ that haven't been addressed. I will 
> address them later in this jira.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9769) Improve performance of a Scanner with explicit column list when rows are small/medium size

2013-10-15 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795694#comment-13795694
 ] 

Lars Hofhansl commented on HBASE-9769:
--

Also try with 0.94.12. The specific issue you're seeing might be fixed there.

> Improve performance of a Scanner with explicit column list when rows are 
> small/medium size
> --
>
> Key: HBASE-9769
> URL: https://issues.apache.org/jira/browse/HBASE-9769
> Project: HBase
>  Issue Type: Improvement
>  Components: Scanners
>Affects Versions: 0.98.0, 0.94.12, 0.96.0
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9771) [WebUI] Humanize store and blockcache statistics on RS

2013-10-15 Thread Elliott Clark (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795683#comment-13795683
 ] 

Elliott Clark commented on HBASE-9771:
--

+1 lgtm

> [WebUI] Humanize store and blockcache statistics on RS
> --
>
> Key: HBASE-9771
> URL: https://issues.apache.org/jira/browse/HBASE-9771
> Project: HBase
>  Issue Type: Improvement
>  Components: UI
>Affects Versions: 0.98.0, 0.96.0
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Trivial
> Attachments: HBASE-9771.00.patch, HBASE-9771.01.patch, 
> HBASE-9771.02.patch
>
>
> Some statistics reported on webui don't include hints about what the unit is, 
> leaving people guessing about what they mean. Clean them up.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-5487) Generic framework for Master-coordinated tasks

2013-10-15 Thread Sergey Shelukhin (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795680#comment-13795680
 ] 

Sergey Shelukhin commented on HBASE-5487:
-

[~jmhsieh] I am writing out very detailed operation and failover descriptions 
right now :)

> Generic framework for Master-coordinated tasks
> --
>
> Key: HBASE-5487
> URL: https://issues.apache.org/jira/browse/HBASE-5487
> Project: HBase
>  Issue Type: New Feature
>  Components: master, regionserver, Zookeeper
>Affects Versions: 0.94.0
>Reporter: Mubarak Seyed
>Priority: Critical
> Attachments: Region management in Master.pdf
>
>
> Need a framework to execute master-coordinated tasks in a fault-tolerant 
> manner. 
> Master-coordinated tasks such as online-scheme change and delete-range 
> (deleting region(s) based on start/end key) can make use of this framework.
> The advantages of framework are
> 1. Eliminate repeated code in Master, ZooKeeper tracker and Region-server for 
> master-coordinated tasks
> 2. Ability to abstract the common functions across Master -> ZK and RS -> ZK
> 3. Easy to plugin new master-coordinated tasks without adding code to core 
> components



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (HBASE-9772) Normalize new client default values

2013-10-15 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795679#comment-13795679
 ] 

Hadoop QA commented on HBASE-9772:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608553/HBASE-9772.00.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7552//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7552//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7552//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7552//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7552//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7552//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7552//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7552//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7552//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7552//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7552//console

This message is automatically generated.

> Normalize new client default values
> ---
>
> Key: HBASE-9772
> URL: https://issues.apache.org/jira/browse/HBASE-9772
> Project: HBase
>  Issue Type: Improvement
>  Components: Client
>Affects Versions: 0.98.0, 0.96.0
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Minor
> Attachments: HBASE-9772.00.patch
>
>
> The new AsyncProcess exposes a handful of configuration settings for 
> controlling the number and distribution of concurrent connections across 
> regionservers. Use default values consistently and bubble up their usage to 
> users.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


  1   2   3   >