date:20141230


 [ 
https://issues.apache.org/jira/browse/HBASE-12762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell reopened HBASE-12762:


No fix version for 0.98 was set on this issue but it was committed there. 

TestShell is failing on 0.98 since this change, please see 
https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/728 and 
https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/729 

 Region with no hfiles will have the highest locality cost in 
 LocalityCostFunction
 -

 Key: HBASE-12762
 URL: https://issues.apache.org/jira/browse/HBASE-12762
 Project: HBase
  Issue Type: Improvement
  Components: Balancer
Affects Versions: 0.99.2
Reporter: cuijianwei
Assignee: cuijianwei
Priority: Minor
 Fix For: 1.0.0, 2.0.0, 0.98.10, 1.1.0

 Attachments: HBASE-12762-trunk.patch


 The locality cost of region will be computed in LocalityCostFunction.cost as:
 {code}
 double cost() {
 ...
 int index = -1;
 for (int j = 0; j  regionLocations.length; j++) {
   if (regionLocations[j] = 0  regionLocations[j] == serverIndex) {
 index = j;
 break;
   }
 }
 if (index  0) {
   cost += 1;  // == region with no hfiles will have the highest cost
 } else {
   cost += (double) index / (double) regionLocations.length;
 }
 ...
 }
 {code}
 The region with no hfiles(such as empty region) will have the highest cost 
 which represents the worst case that region located in the server with no 
 locality for hfiles. However, this might be the best case because there are 
 no hlogs for the region. Although the absolute cost value won't affect the 
 balance process, will it be more reasonable to have zero cost for such 
 regions? such as:
 {code}
...
 if (index  0) {
   if (regionLocation.length  0) { //  == only consider regions with 
 hfiles
   cost += 1;
   }
 } else {
   cost += (double) index / (double) regionLocations.length;
 }
...
 {code} 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12762) Region with no hfiles will have the highest locality cost in LocalityCostFunction


[ 
https://issues.apache.org/jira/browse/HBASE-12762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14260902#comment-14260902
 ] 

Andrew Purtell commented on HBASE-12762:


My mistake, please pardon, fix version is fine, it's (phone) operator error on 
my part. Test issue is real though. :-)

 Region with no hfiles will have the highest locality cost in 
 LocalityCostFunction
 -

 Key: HBASE-12762
 URL: https://issues.apache.org/jira/browse/HBASE-12762
 Project: HBase
  Issue Type: Improvement
  Components: Balancer
Affects Versions: 0.99.2
Reporter: cuijianwei
Assignee: cuijianwei
Priority: Minor
 Fix For: 1.0.0, 2.0.0, 0.98.10, 1.1.0

 Attachments: HBASE-12762-trunk.patch


 The locality cost of region will be computed in LocalityCostFunction.cost as:
 {code}
 double cost() {
 ...
 int index = -1;
 for (int j = 0; j  regionLocations.length; j++) {
   if (regionLocations[j] = 0  regionLocations[j] == serverIndex) {
 index = j;
 break;
   }
 }
 if (index  0) {
   cost += 1;  // == region with no hfiles will have the highest cost
 } else {
   cost += (double) index / (double) regionLocations.length;
 }
 ...
 }
 {code}
 The region with no hfiles(such as empty region) will have the highest cost 
 which represents the worst case that region located in the server with no 
 locality for hfiles. However, this might be the best case because there are 
 no hlogs for the region. Although the absolute cost value won't affect the 
 balance process, will it be more reasonable to have zero cost for such 
 regions? such as:
 {code}
...
 if (index  0) {
   if (regionLocation.length  0) { //  == only consider regions with 
 hfiles
   cost += 1;
   }
 } else {
   cost += (double) index / (double) regionLocations.length;
 }
...
 {code} 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12772) TestPerColumnFamilyFlush failing

2014-12-30 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-12772:
--
Attachment: 12772.addendum.txt

Flush the namespace table before we start adding edits so its hanging edit 
won't get in way of test.

Rather than wait 4 seconds, wait on the WAL log count to change (should change 
after successful flush).

Add timeouts on all tests.

Unable to test locally. Trying against hadoopqa.

 TestPerColumnFamilyFlush failing
 

 Key: HBASE-12772
 URL: https://issues.apache.org/jira/browse/HBASE-12772
 Project: HBase
  Issue Type: Bug
  Components: test
Affects Versions: 1.0.0
Reporter: stack
 Attachments: 0001-HBASE-12772-TestPerColumnFamilyFlush-failing.patch, 
 12772.addendum.txt


 On internal rig see this failing in two places:
 {code}
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testFlushingWhenLogRolling
 Failing for the past 1 build (Since Failed#653 )
 Took 9 sec.
 Error Message
 expected:424 but was:205744
 Stacktrace
 java.lang.AssertionError: expected:424 but was:205744
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testFlushingWhenLogRolling(TestPerColumnFamilyFlush.java:483)
 {code}
 and 
 {code}
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testLogReplayWithDistributedReplay
 Failing for the past 1 build (Since Failed#653 )
 Took 25 ms.
 Error Message
 A mini-cluster is already running
 Stacktrace
 java.lang.IllegalStateException: A mini-cluster is already running
   at 
 org.apache.hadoop.hbase.HBaseTestingUtility.startMiniCluster(HBaseTestingUtility.java:921)
   at 
 org.apache.hadoop.hbase.HBaseTestingUtility.startMiniCluster(HBaseTestingUtility.java:812)
   at 
 org.apache.hadoop.hbase.HBaseTestingUtility.startMiniCluster(HBaseTestingUtility.java:794)
   at 
 org.apache.hadoop.hbase.HBaseTestingUtility.startMiniCluster(HBaseTestingUtility.java:781)
   at 
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testLogReplay(TestPerColumnFamilyFlush.java:337)
   at 
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testLogReplayWithDistributedReplay(TestPerColumnFamilyFlush.java:418)
 {code}
 Opening issue to keep an eye on these tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12772) TestPerColumnFamilyFlush failing

2014-12-30 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-12772:
--
Assignee: stack
  Status: Patch Available  (was: Open)

 TestPerColumnFamilyFlush failing
 

 Key: HBASE-12772
 URL: https://issues.apache.org/jira/browse/HBASE-12772
 Project: HBase
  Issue Type: Bug
  Components: test
Affects Versions: 1.0.0
Reporter: stack
Assignee: stack
 Attachments: 0001-HBASE-12772-TestPerColumnFamilyFlush-failing.patch, 
 12772.addendum.txt


 On internal rig see this failing in two places:
 {code}
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testFlushingWhenLogRolling
 Failing for the past 1 build (Since Failed#653 )
 Took 9 sec.
 Error Message
 expected:424 but was:205744
 Stacktrace
 java.lang.AssertionError: expected:424 but was:205744
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testFlushingWhenLogRolling(TestPerColumnFamilyFlush.java:483)
 {code}
 and 
 {code}
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testLogReplayWithDistributedReplay
 Failing for the past 1 build (Since Failed#653 )
 Took 25 ms.
 Error Message
 A mini-cluster is already running
 Stacktrace
 java.lang.IllegalStateException: A mini-cluster is already running
   at 
 org.apache.hadoop.hbase.HBaseTestingUtility.startMiniCluster(HBaseTestingUtility.java:921)
   at 
 org.apache.hadoop.hbase.HBaseTestingUtility.startMiniCluster(HBaseTestingUtility.java:812)
   at 
 org.apache.hadoop.hbase.HBaseTestingUtility.startMiniCluster(HBaseTestingUtility.java:794)
   at 
 org.apache.hadoop.hbase.HBaseTestingUtility.startMiniCluster(HBaseTestingUtility.java:781)
   at 
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testLogReplay(TestPerColumnFamilyFlush.java:337)
   at 
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testLogReplayWithDistributedReplay(TestPerColumnFamilyFlush.java:418)
 {code}
 Opening issue to keep an eye on these tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12728) buffered writes substantially less useful after removal of HTablePool


[ 
https://issues.apache.org/jira/browse/HBASE-12728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14260911#comment-14260911
 ] 

Andrew Purtell commented on HBASE-12728:


My 0.02, it shouldn't be too late for 1.0. Table is a fundamental API and we 
should get it right for a 1.0 release. Let things slip if need be IMHO

 buffered writes substantially less useful after removal of HTablePool
 -

 Key: HBASE-12728
 URL: https://issues.apache.org/jira/browse/HBASE-12728
 Project: HBase
  Issue Type: Bug
  Components: hbase
Affects Versions: 0.98.0
Reporter: Aaron Beppu

 In previous versions of HBase, when use of HTablePool was encouraged, HTable 
 instances were long-lived in that pool, and for that reason, if autoFlush was 
 set to false, the table instance could accumulate a full buffer of writes 
 before a flush was triggered. Writes from the client to the cluster could 
 then be substantially larger and less frequent than without buffering.
 However, when HTablePool was deprecated, the primary justification seems to 
 have been that creating HTable instances is cheap, so long as the connection 
 and executor service being passed to it are pre-provided. A use pattern was 
 encouraged where users should create a new HTable instance for every 
 operation, using an existing connection and executor service, and then close 
 the table. In this pattern, buffered writes are substantially less useful; 
 writes are as small and as frequent as they would have been with 
 autoflush=true, except the synchronous write is moved from the operation 
 itself to the table close call which immediately follows.
 More concretely :
 ```
 // Given these two helpers ...
 private HTableInterface getAutoFlushTable(String tableName) throws 
 IOException {
   // (autoflush is true by default)
   return storedConnection.getTable(tableName, executorService);
 }
 private HTableInterface getBufferedTable(String tableName) throws IOException 
 {
   HTableInterface table = getAutoFlushTable(tableName);
   table.setAutoFlush(false);
   return table;
 }
 // it's my contention that these two methods would behave almost identically,
 // except the first will hit a synchronous flush during the put call,
 and the second will
 // flush during the (hidden) close call on table.
 private void writeAutoFlushed(Put somePut) throws IOException {
   try (HTableInterface table = getAutoFlushTable(tableName)) {
 table.put(somePut); // will do synchronous flush
   }
 }
 private void writeBuffered(Put somePut) throws IOException {
   try (HTableInterface table = getBufferedTable(tableName)) {
 table.put(somePut);
   } // auto-close will trigger synchronous flush
 }
 ```
 For buffered writes to actually provide a performance benefit to users, one 
 of two things must happen:
 - The writeBuffer itself shouldn't live, flush and die with the lifecycle of 
 it's HTableInstance. If the writeBuffer were managed elsewhere and had a long 
 lifespan, this could cease to be an issue. However, if the same writeBuffer 
 is appended to by multiple tables, then some additional concurrency control 
 will be needed around it.
 - Alternatively, there should be some pattern for having long-lived HTable 
 instances. However, since HTable is not thread-safe, we'd need multiple 
 instances, and a mechanism for leasing them out safely -- which sure sounds a 
 lot like the old HTablePool to me.
 See discussion on mailing list here : 
 http://mail-archives.apache.org/mod_mbox/hbase-user/201412.mbox/%3CCAPdJLkEzmUQZ_kvD%3D8mrxi4V%3DhCmUp3g9MUZsddD%2Bmon%2BAvNtg%40mail.gmail.com%3E



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12728) buffered writes substantially less useful after removal of HTablePool


[ 
https://issues.apache.org/jira/browse/HBASE-12728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14260920#comment-14260920
 ] 

Andrew Purtell commented on HBASE-12728:


FWIW, I recently rewrote the YCSB client for HBase because it was setting auto 
flush to off and using large write buffers to collect puts. Unfortunately 
although producing 'excellent' write latency measurements this lead to multiple 
threads flushing deep buffers more or less at the same time, resulting in long 
periods of write unavailability. I'm sure that was an unintended consequence. I 
believe that YCSB client code was written by HBase devs. An earlier era in any 
case, but my point is devs very familiar with the API can get into trouble 
never mind newcomers. Removing buffering from Table and moving it into 
BufferedTable with suitable advice in javadoc there sounds like a good idea to 
me. 

 buffered writes substantially less useful after removal of HTablePool
 -

 Key: HBASE-12728
 URL: https://issues.apache.org/jira/browse/HBASE-12728
 Project: HBase
  Issue Type: Bug
  Components: hbase
Affects Versions: 0.98.0
Reporter: Aaron Beppu

 In previous versions of HBase, when use of HTablePool was encouraged, HTable 
 instances were long-lived in that pool, and for that reason, if autoFlush was 
 set to false, the table instance could accumulate a full buffer of writes 
 before a flush was triggered. Writes from the client to the cluster could 
 then be substantially larger and less frequent than without buffering.
 However, when HTablePool was deprecated, the primary justification seems to 
 have been that creating HTable instances is cheap, so long as the connection 
 and executor service being passed to it are pre-provided. A use pattern was 
 encouraged where users should create a new HTable instance for every 
 operation, using an existing connection and executor service, and then close 
 the table. In this pattern, buffered writes are substantially less useful; 
 writes are as small and as frequent as they would have been with 
 autoflush=true, except the synchronous write is moved from the operation 
 itself to the table close call which immediately follows.
 More concretely :
 ```
 // Given these two helpers ...
 private HTableInterface getAutoFlushTable(String tableName) throws 
 IOException {
   // (autoflush is true by default)
   return storedConnection.getTable(tableName, executorService);
 }
 private HTableInterface getBufferedTable(String tableName) throws IOException 
 {
   HTableInterface table = getAutoFlushTable(tableName);
   table.setAutoFlush(false);
   return table;
 }
 // it's my contention that these two methods would behave almost identically,
 // except the first will hit a synchronous flush during the put call,
 and the second will
 // flush during the (hidden) close call on table.
 private void writeAutoFlushed(Put somePut) throws IOException {
   try (HTableInterface table = getAutoFlushTable(tableName)) {
 table.put(somePut); // will do synchronous flush
   }
 }
 private void writeBuffered(Put somePut) throws IOException {
   try (HTableInterface table = getBufferedTable(tableName)) {
 table.put(somePut);
   } // auto-close will trigger synchronous flush
 }
 ```
 For buffered writes to actually provide a performance benefit to users, one 
 of two things must happen:
 - The writeBuffer itself shouldn't live, flush and die with the lifecycle of 
 it's HTableInstance. If the writeBuffer were managed elsewhere and had a long 
 lifespan, this could cease to be an issue. However, if the same writeBuffer 
 is appended to by multiple tables, then some additional concurrency control 
 will be needed around it.
 - Alternatively, there should be some pattern for having long-lived HTable 
 instances. However, since HTable is not thread-safe, we'd need multiple 
 instances, and a mechanism for leasing them out safely -- which sure sounds a 
 lot like the old HTablePool to me.
 See discussion on mailing list here : 
 http://mail-archives.apache.org/mod_mbox/hbase-user/201412.mbox/%3CCAPdJLkEzmUQZ_kvD%3D8mrxi4V%3DhCmUp3g9MUZsddD%2Bmon%2BAvNtg%40mail.gmail.com%3E



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12768) Support enable cache_data_on_write in Shell while creating table


[ 
https://issues.apache.org/jira/browse/HBASE-12768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14260923#comment-14260923
 ] 

Andrew Purtell commented on HBASE-12768:


+1 for 0.98

 Support enable cache_data_on_write in Shell while creating table
 

 Key: HBASE-12768
 URL: https://issues.apache.org/jira/browse/HBASE-12768
 Project: HBase
  Issue Type: Improvement
  Components: shell
Affects Versions: 1.0.0, 2.0.0, 0.94.27
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
 Fix For: 1.0.0, 2.0.0, 0.98.10

 Attachments: HBASE-12768.patch


 A simple approach to support cache_data_on_write while creating table in 
 shell.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12768) Support enable cache_data_on_write in Shell while creating table

2014-12-30 Thread ramkrishna.s.vasudevan (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-12768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14260926#comment-14260926
 ] 

ramkrishna.s.vasudevan commented on HBASE-12768:


Pushed to 0.98. Thanks @apurtell.

 Support enable cache_data_on_write in Shell while creating table
 

 Key: HBASE-12768
 URL: https://issues.apache.org/jira/browse/HBASE-12768
 Project: HBase
  Issue Type: Improvement
  Components: shell
Affects Versions: 1.0.0, 2.0.0, 0.94.27
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
 Fix For: 1.0.0, 2.0.0, 0.98.10

 Attachments: HBASE-12768.patch


 A simple approach to support cache_data_on_write while creating table in 
 shell.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12772) TestPerColumnFamilyFlush failing


[ 
https://issues.apache.org/jira/browse/HBASE-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14260959#comment-14260959
 ] 

Hadoop QA commented on HBASE-12772:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12689451/12772.addendum.txt
  against master branch at commit 0513a21dc8b86f57b5a6c1b742904821632f77f7.
  ATTACHMENT ID: 12689451

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 checkstyle{color}.  The applied patch does not increase the 
total number of checkstyle errors

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

 {color:red}-1 core tests{color}.  The patch failed these unit tests:
   
org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/newPatchFindbugsWarningshbase-rest.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/newPatchFindbugsWarningshbase-annotations.html
Checkstyle Errors: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//artifact/patchprocess/checkstyle-aggregate.html

  Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12254//console

This message is automatically generated.

 TestPerColumnFamilyFlush failing
 

 Key: HBASE-12772
 URL: https://issues.apache.org/jira/browse/HBASE-12772
 Project: HBase
  Issue Type: Bug
  Components: test
Affects Versions: 1.0.0
Reporter: stack
Assignee: stack
 Attachments: 0001-HBASE-12772-TestPerColumnFamilyFlush-failing.patch, 
 12772.addendum.txt


 On internal rig see this failing in two places:
 {code}
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testFlushingWhenLogRolling
 Failing for the past 1 build (Since Failed#653 )
 Took 9 sec.
 Error Message
 expected:424 but was:205744
 Stacktrace
 java.lang.AssertionError: expected:424 but was:205744
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testFlushingWhenLogRolling(TestPerColumnFamilyFlush.java:483)
 {code}
 and 
 {code}
 org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testLogReplayWithDistributedReplay
 Failing for the past 1 build (Since

[jira] [Commented] (HBASE-12775) CompressionTest ate my HFile (sigh!)

2014-12-30 Thread Aditya Kishore (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-12775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14260963#comment-14260963
 ] 

Aditya Kishore commented on HBASE-12775:


I agree but was trying to be consistent with what is already there. The program 
exits with a SUCCESS on System.out if successful (though it does LOG in other 
places).

May be replace all sysout and syserr with LOG.error() and LOG.info()?

 CompressionTest ate my HFile (sigh!)
 

 Key: HBASE-12775
 URL: https://issues.apache.org/jira/browse/HBASE-12775
 Project: HBase
  Issue Type: Bug
  Components: test
Affects Versions: 0.98.9, 0.99.2
Reporter: Aditya Kishore
Assignee: Aditya Kishore
 Fix For: 2.0.0

 Attachments: HBASE-12775-CompressionTest-ate-my-HFile.patch


 {{org.apache.hadoop.hbase.util.CompressionTest}} should abort execution if 
 the file specified on the command line exists. This will help careless (me) 
 or unsuspecting user to not loose data.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12684) Add new AsyncRpcClient

[
https://issues.apache.org/jira/browse/HBASE-12684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jurriaan Mous updated HBASE-12684:
--
Attachment: HBASE-12684-DEBUG1.patch

[~stack] Thanks! That file was exactly what I needed. :) Didn't know about that
page with the build artifacts.

It seems the fail is caused is a NullPointer in an area I thought could never
be null. Created a new debug patch which probably fixes it and added a Netty
ResourceLeakDetector since the log was complaining of ByteBuf leaks which I
didn't have locally.

Add new AsyncRpcClient
--

Key: HBASE-12684
URL: https://issues.apache.org/jira/browse/HBASE-12684
Project: HBase
Issue Type: Improvement
Components: Client
Reporter: Jurriaan Mous
Assignee: Jurriaan Mous
Attachments: HBASE-12684-DEBUG1.patch, HBASE-12684-v1.patch,
HBASE-12684-v10.patch, HBASE-12684-v11.patch, HBASE-12684-v12.patch,
HBASE-12684-v13.patch, HBASE-12684-v14.patch, HBASE-12684-v15.patch,
HBASE-12684-v16.patch, HBASE-12684-v17.patch, HBASE-12684-v2.patch,
HBASE-12684-v3.patch, HBASE-12684-v4.patch, HBASE-12684-v5.patch,
HBASE-12684-v6.patch, HBASE-12684-v7.patch, HBASE-12684-v8.patch,
HBASE-12684-v9.patch, HBASE-12684.patch

With the changes in HBASE-12597 it is possible to add new RpcClients. This
issue is about adding a new Async RpcClient which would enable HBase to do
non blocking protobuf service communication.
Besides delivering a new AsyncRpcClient I would also like to ask the question
what it would take to replace the current RpcClient? This would enable to
simplify async code in some next issues.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12684) Add new AsyncRpcClient


[ 
https://issues.apache.org/jira/browse/HBASE-12684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14260991#comment-14260991
 ] 

Hadoop QA commented on HBASE-12684:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12689481/HBASE-12684-DEBUG1.patch
  against master branch at commit 0513a21dc8b86f57b5a6c1b742904821632f77f7.
  ATTACHMENT ID: 12689481

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 8 new 
or modified tests.

{color:red}-1 javac{color}.  The patch appears to cause mvn compile goal to 
fail.

Compilation errors resume:
[ERROR] COMPILATION ERROR : 
[ERROR] 
/home/jenkins/jenkins-slave/workspace/PreCommit-HBASE-Build/hbase-server/src/test/java/org/apache/hadoop/hbase/client/TestHCM.java:[403,35]
 cannot find symbol
[ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-compiler-plugin:3.2:testCompile 
(default-testCompile) on project hbase-server: Compilation failure
[ERROR] 
/home/jenkins/jenkins-slave/workspace/PreCommit-HBASE-Build/hbase-server/src/test/java/org/apache/hadoop/hbase/client/TestHCM.java:[403,35]
 cannot find symbol
[ERROR] symbol:   variable done
[ERROR] location: class org.apache.hadoop.hbase.client.TestHCM
[ERROR] - [Help 1]
[ERROR] 
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR] 
[ERROR] For more information about the errors and possible solutions, please 
read the following articles:
[ERROR] [Help 1] 
http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException
[ERROR] 
[ERROR] After correcting the problems, you can resume the build with the command
[ERROR]   mvn goals -rf :hbase-server


Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12255//console

This message is automatically generated.

 Add new AsyncRpcClient
 --

 Key: HBASE-12684
 URL: https://issues.apache.org/jira/browse/HBASE-12684
 Project: HBase
  Issue Type: Improvement
  Components: Client
Reporter: Jurriaan Mous
Assignee: Jurriaan Mous
 Attachments: HBASE-12684-DEBUG1.patch, HBASE-12684-v1.patch, 
 HBASE-12684-v10.patch, HBASE-12684-v11.patch, HBASE-12684-v12.patch, 
 HBASE-12684-v13.patch, HBASE-12684-v14.patch, HBASE-12684-v15.patch, 
 HBASE-12684-v16.patch, HBASE-12684-v17.patch, HBASE-12684-v2.patch, 
 HBASE-12684-v3.patch, HBASE-12684-v4.patch, HBASE-12684-v5.patch, 
 HBASE-12684-v6.patch, HBASE-12684-v7.patch, HBASE-12684-v8.patch, 
 HBASE-12684-v9.patch, HBASE-12684.patch


 With the changes in HBASE-12597 it is possible to add new RpcClients. This 
 issue is about adding a new Async RpcClient which would enable HBase to do 
 non blocking protobuf service communication.
 Besides delivering a new AsyncRpcClient I would also like to ask the question 
 what it would take to replace the current RpcClient? This would enable to 
 simplify async code in some next issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12684) Add new AsyncRpcClient


 [ 
https://issues.apache.org/jira/browse/HBASE-12684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jurriaan Mous updated HBASE-12684:
--
Attachment: (was: HBASE-12684-DEBUG1.patch)

 Add new AsyncRpcClient
 --

 Key: HBASE-12684
 URL: https://issues.apache.org/jira/browse/HBASE-12684
 Project: HBase
  Issue Type: Improvement
  Components: Client
Reporter: Jurriaan Mous
Assignee: Jurriaan Mous
 Attachments: HBASE-12684-v1.patch, HBASE-12684-v10.patch, 
 HBASE-12684-v11.patch, HBASE-12684-v12.patch, HBASE-12684-v13.patch, 
 HBASE-12684-v14.patch, HBASE-12684-v15.patch, HBASE-12684-v16.patch, 
 HBASE-12684-v17.patch, HBASE-12684-v2.patch, HBASE-12684-v3.patch, 
 HBASE-12684-v4.patch, HBASE-12684-v5.patch, HBASE-12684-v6.patch, 
 HBASE-12684-v7.patch, HBASE-12684-v8.patch, HBASE-12684-v9.patch, 
 HBASE-12684.patch


 With the changes in HBASE-12597 it is possible to add new RpcClients. This 
 issue is about adding a new Async RpcClient which would enable HBase to do 
 non blocking protobuf service communication.
 Besides delivering a new AsyncRpcClient I would also like to ask the question 
 what it would take to replace the current RpcClient? This would enable to 
 simplify async code in some next issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12684) Add new AsyncRpcClient


 [ 
https://issues.apache.org/jira/browse/HBASE-12684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jurriaan Mous updated HBASE-12684:
--
Attachment: HBASE-12684-DEBUG2.patch

 Add new AsyncRpcClient
 --

 Key: HBASE-12684
 URL: https://issues.apache.org/jira/browse/HBASE-12684
 Project: HBase
  Issue Type: Improvement
  Components: Client
Reporter: Jurriaan Mous
Assignee: Jurriaan Mous
 Attachments: HBASE-12684-DEBUG2.patch, HBASE-12684-v1.patch, 
 HBASE-12684-v10.patch, HBASE-12684-v11.patch, HBASE-12684-v12.patch, 
 HBASE-12684-v13.patch, HBASE-12684-v14.patch, HBASE-12684-v15.patch, 
 HBASE-12684-v16.patch, HBASE-12684-v17.patch, HBASE-12684-v2.patch, 
 HBASE-12684-v3.patch, HBASE-12684-v4.patch, HBASE-12684-v5.patch, 
 HBASE-12684-v6.patch, HBASE-12684-v7.patch, HBASE-12684-v8.patch, 
 HBASE-12684-v9.patch, HBASE-12684.patch


 With the changes in HBASE-12597 it is possible to add new RpcClients. This 
 issue is about adding a new Async RpcClient which would enable HBase to do 
 non blocking protobuf service communication.
 Besides delivering a new AsyncRpcClient I would also like to ask the question 
 what it would take to replace the current RpcClient? This would enable to 
 simplify async code in some next issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12768) Support enable cache_data_on_write in Shell while creating table

2014-12-30 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-12768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14261015#comment-14261015
 ] 

Hudson commented on HBASE-12768:


SUCCESS: Integrated in HBase-0.98 #764 (See 
[https://builds.apache.org/job/HBase-0.98/764/])
HBASE-12768 - Support enable cache_data_on_write in Shell while creating 
(ramkrishna: rev 54b6830fb4f9c33f3cd35694598694ffcd39ade4)
* hbase-shell/src/main/ruby/hbase/admin.rb


 Support enable cache_data_on_write in Shell while creating table
 

 Key: HBASE-12768
 URL: https://issues.apache.org/jira/browse/HBASE-12768
 Project: HBase
  Issue Type: Improvement
  Components: shell
Affects Versions: 1.0.0, 2.0.0, 0.94.27
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
 Fix For: 1.0.0, 2.0.0, 0.98.10

 Attachments: HBASE-12768.patch


 A simple approach to support cache_data_on_write while creating table in 
 shell.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12761) On region jump ClientScanners should get next row start key instead of a skip.


 [ 
https://issues.apache.org/jira/browse/HBASE-12761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jurriaan Mous updated HBASE-12761:
--
Attachment: (was: HBASE-12761-v2.patch)

 On region jump ClientScanners should get next row start key instead of a skip.
 --

 Key: HBASE-12761
 URL: https://issues.apache.org/jira/browse/HBASE-12761
 Project: HBase
  Issue Type: Improvement
Reporter: Jurriaan Mous
Assignee: Jurriaan Mous
 Attachments: HBASE-12761-v1.patch, HBASE-12761-v2.patch, 
 HBASE-12761.patch


 While working on async scanner I had some trouble with the extra RPC calls 
 that happen to let the Scanner advance 1 row so it skips the last already 
 known row. 
 This RPC call can be avoided by letting the start key be the last row with an 
 appended 0. This saves quite some logic from the scanners and improves 
 performance by saving extra RPC calls.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12740) Improve performance of TestHBaseFsck


 [ 
https://issues.apache.org/jira/browse/HBASE-12740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jurriaan Mous updated HBASE-12740:
--
Attachment: PROFILE_before_patch_test_fails.png
PROFILE_after_patch.png

[~dimaspivak]

The main reason why I opened this issue was that this test crashes for me 
because it reaches my machines Java default max thread count of 2000. (I tried 
various stack overflow suggested options to raise it with no success)

It caused many of these exceptions:
java.lang.OutOfMemoryError: unable to create new native thread

So I tried to lower this thread count to make it runnable so I could fix issues 
with the async client that this test was somehow pointing out.

I see my very long runs were those which would succeed in the tests because 
HBase recovered of the exceptions by retrying and gave the server some time to 
timeout some connections. Those runs took minutes longer and when I fixed what 
causes it the test would run in around the same numbers you post.

I think that the few milliseconds regression in your comparison is because the 
test now properly closes connections and regions and that takes up a bit more 
time.

I have included 2 screenshots of profiling of the failing run with too much 
thread creation and the succeeding run with much less thread creation.

Is it ok to still commit this patch so the test is runnable for those with 
lower thread limits?

 Improve performance of TestHBaseFsck
 

 Key: HBASE-12740
 URL: https://issues.apache.org/jira/browse/HBASE-12740
 Project: HBase
  Issue Type: Bug
  Components: util
Reporter: Jurriaan Mous
Assignee: Jurriaan Mous
 Attachments: HBASE-12740-v1.patch, HBASE-12740-v2.patch, 
 HBASE-12740-v3.patch, HBASE-12740.patch, PROFILE_after_patch.png, 
 PROFILE_before_patch_test_fails.png


 TestHBaseFsck performs poor on my machine. It crashes because the threads 
 reach the 2000 thread limit on my machine. Looking at the code a lot of 
 optimization is possible and some API calls are used wrong. A lot of Admin 
 instances are created and never closed, lots of Tables are not closed, 
 ThreadPoolExecutors are not shut down and an unlimited thread pool which does 
 not recycle threads.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12768) Support enable cache_data_on_write in Shell while creating table

2014-12-30 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-12768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14261041#comment-14261041
 ] 

Hudson commented on HBASE-12768:


SUCCESS: Integrated in HBase-0.98-on-Hadoop-1.1 #730 (See 
[https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/730/])
HBASE-12768 - Support enable cache_data_on_write in Shell while creating 
(ramkrishna: rev 54b6830fb4f9c33f3cd35694598694ffcd39ade4)
* hbase-shell/src/main/ruby/hbase/admin.rb


 Support enable cache_data_on_write in Shell while creating table
 

 Key: HBASE-12768
 URL: https://issues.apache.org/jira/browse/HBASE-12768
 Project: HBase
  Issue Type: Improvement
  Components: shell
Affects Versions: 1.0.0, 2.0.0, 0.94.27
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
 Fix For: 1.0.0, 2.0.0, 0.98.10

 Attachments: HBASE-12768.patch


 A simple approach to support cache_data_on_write while creating table in 
 shell.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12684) Add new AsyncRpcClient

[
https://issues.apache.org/jira/browse/HBASE-12684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14261067#comment-14261067
]

Hadoop QA commented on HBASE-12684:
---

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment

http://issues.apache.org/jira/secure/attachment/12689483/HBASE-12684-DEBUG2.patch
against master branch at commit 0513a21dc8b86f57b5a6c1b742904821632f77f7.
ATTACHMENT ID: 12689483

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:green}+1 tests included{color}. The patch appears to include 8 new
or modified tests.

{color:green}+1 javac{color}. The applied patch does not increase the
total number of javac compiler warnings.

{color:green}+1 javadoc{color}. The javadoc tool did not generate any
warning messages.

{color:green}+1 checkstyle{color}. The applied patch does not increase the
total number of checkstyle errors

{color:green}+1 findbugs{color}. The patch does not introduce any new
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}. The applied patch does not increase
the total number of release audit warnings.

{color:green}+1 lineLengths{color}. The patch does not introduce lines
longer than 100

{color:green}+1 site{color}. The mvn site goal succeeds with this patch.

{color:red}-1 core tests{color}. The patch failed these unit tests:

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/newPatchFindbugsWarningshbase-rest.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/newPatchFindbugsWarningshbase-annotations.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Checkstyle Errors:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//artifact/patchprocess/checkstyle-aggregate.html

Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/12256//console

This message is automatically generated.

Add new AsyncRpcClient
--

Key: HBASE-12684
URL: https://issues.apache.org/jira/browse/HBASE-12684
Project: HBase
Issue Type: Improvement
Components: Client
Reporter: Jurriaan Mous
Assignee: Jurriaan Mous
Attachments: HBASE-12684-DEBUG2.patch, HBASE-12684-v1.patch,
HBASE-12684-v10.patch, HBASE-12684-v11.patch, HBASE-12684-v12.patch,
HBASE-12684-v13.patch, HBASE-12684-v14.patch, HBASE-12684-v15.patch,
HBASE-12684-v16.patch, HBASE-12684-v17.patch, HBASE-12684-v2.patch,
HBASE-12684-v3.patch, HBASE-12684-v4.patch, HBASE-12684-v5.patch,
HBASE-12684-v6.patch, HBASE-12684-v7.patch, HBASE-12684-v8.patch,
HBASE-12684-v9.patch, HBASE-12684.patch

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12781) Listen port will bind always to the passed command line address

2014-12-30 Thread Pankaj Kumar (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-12781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pankaj Kumar updated HBASE-12781:
-
Attachment: 12781-V1.patch

Patch for review.

 Listen port will bind always to the passed command line address
 ---

 Key: HBASE-12781
 URL: https://issues.apache.org/jira/browse/HBASE-12781
 Project: HBase
  Issue Type: Bug
  Components: Thrift
Affects Versions: 0.98.3
Reporter: Pankaj Kumar
Assignee: Pankaj Kumar
 Attachments: 12781-V1.patch


 In Thrift server,  listen port will bind always to the address which  is 
 passed through command line argument. 
 --
 InetSocketAddress inetSocketAddress = bindToPort(cmd.getOptionValue(bind), 
 listenPort);
 -
 private static InetSocketAddress bindToPort(String bindValue, int listenPort)
   throws UnknownHostException {
 try {
   if (bindValue == null) {
 return new InetSocketAddress(listenPort);
   } else {
 return new InetSocketAddress(InetAddress.getByName(bindValue), 
 listenPort);
   }
 } catch (UnknownHostException e) {
   throw new RuntimeException(Could not bind to provided ip address, e);
 }
   }
 In case when bind address is not passed through argument then it is binding 
 with any local  address. It should read hbase.thrift.info.bindAddress  value 
 from configuration first.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12781) Listen port will bind always to the passed command line address

2014-12-30 Thread Pankaj Kumar (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-12781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pankaj Kumar updated HBASE-12781:
-
Status: Patch Available  (was: Open)

 Listen port will bind always to the passed command line address
 ---

 Key: HBASE-12781
 URL: https://issues.apache.org/jira/browse/HBASE-12781
 Project: HBase
  Issue Type: Bug
  Components: Thrift
Affects Versions: 0.98.3
Reporter: Pankaj Kumar
Assignee: Pankaj Kumar
 Attachments: 12781-V1.patch


 In Thrift server,  listen port will bind always to the address which  is 
 passed through command line argument. 
 --
 InetSocketAddress inetSocketAddress = bindToPort(cmd.getOptionValue(bind), 
 listenPort);
 -
 private static InetSocketAddress bindToPort(String bindValue, int listenPort)
   throws UnknownHostException {
 try {
   if (bindValue == null) {
 return new InetSocketAddress(listenPort);
   } else {
 return new InetSocketAddress(InetAddress.getByName(bindValue), 
 listenPort);
   }
 } catch (UnknownHostException e) {
   throw new RuntimeException(Could not bind to provided ip address, e);
 }
   }
 In case when bind address is not passed through argument then it is binding 
 with any local  address. It should read hbase.thrift.info.bindAddress  value 
 from configuration first.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12684) Add new AsyncRpcClient


 [ 
https://issues.apache.org/jira/browse/HBASE-12684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jurriaan Mous updated HBASE-12684:
--
Attachment: HBASE-12684-DEBUG3.patch

Debug patch 3 
-Some logs for debugging (Although no test fails are reported it seems to have 
failed in the logs)
-Probable fix for the ByteBuf leak. Maybe this was the cause?

 Add new AsyncRpcClient
 --

 Key: HBASE-12684
 URL: https://issues.apache.org/jira/browse/HBASE-12684
 Project: HBase
  Issue Type: Improvement
  Components: Client
Reporter: Jurriaan Mous
Assignee: Jurriaan Mous
 Attachments: HBASE-12684-DEBUG2.patch, HBASE-12684-DEBUG3.patch, 
 HBASE-12684-v1.patch, HBASE-12684-v10.patch, HBASE-12684-v11.patch, 
 HBASE-12684-v12.patch, HBASE-12684-v13.patch, HBASE-12684-v14.patch, 
 HBASE-12684-v15.patch, HBASE-12684-v16.patch, HBASE-12684-v17.patch, 
 HBASE-12684-v2.patch, HBASE-12684-v3.patch, HBASE-12684-v4.patch, 
 HBASE-12684-v5.patch, HBASE-12684-v6.patch, HBASE-12684-v7.patch, 
 HBASE-12684-v8.patch, HBASE-12684-v9.patch, HBASE-12684.patch


 With the changes in HBASE-12597 it is possible to add new RpcClients. This 
 issue is about adding a new Async RpcClient which would enable HBase to do 
 non blocking protobuf service communication.
 Besides delivering a new AsyncRpcClient I would also like to ask the question 
 what it would take to replace the current RpcClient? This would enable to 
 simplify async code in some next issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12270) A bug in the bucket cache, with cache blocks on write enabled

2014-12-30 Thread Liu Shaohui (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-12270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liu Shaohui updated HBASE-12270:

Attachment: HBASE-12270-v2.diff

Add tests for caching block on write with different block caches.

[~stack]
Add more tests if you wish. Thanks

 A bug in the bucket cache, with cache blocks on write enabled
 -

 Key: HBASE-12270
 URL: https://issues.apache.org/jira/browse/HBASE-12270
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.11, 0.98.6.1
 Environment: I can reproduce it on a simple 2 node cluster, one 
 running the master and another running a RS. I was testing on ec2.
 I used the following configurations for the cluster. 
 hbase-env:HBASE_REGIONSERVER_OPTS=-Xmx2G -XX:MaxDirectMemorySize=5G 
 -XX:CMSInitiatingOccupancyFraction=88 -XX:+AggressiveOpts -verbose:gc 
 -XX:+PrintGCDetails -XX:+PrintGCTimeStamps -Xlog 
 gc:/tmp/hbase-regionserver-gc.log
 hbase-site:
 hbase.bucketcache.ioengine=offheap
 hbase.bucketcache.size=4196
 hbase.rs.cacheblocksonwrite=true
 hfile.block.index.cacheonwrite=true
 hfile.block.bloom.cacheonwrite=true
Reporter: Khaled Elmeleegy
Assignee: Liu Shaohui
Priority: Critical
 Fix For: 2.0.0, 0.98.10, 1.1.0

 Attachments: HBASE-12270-v1.diff, HBASE-12270-v2.diff, 
 TestHBase.java, TestKey.java


 In my experiments, I have writers streaming their output to HBase. The reader 
 powers a web page and does this scatter/gather, where it reads 1000 keys 
 written last and passes them the the front end. With this workload, I get the 
 exception below at the region server. Again, I am using HBAse (0.98.6.1). Any 
 help is appreciated.
 2014-10-10 15:06:44,173 ERROR 
 [B.DefaultRpcServer.handler=62,queue=2,port=60020] ipc.RpcServer: Unexpected 
 throwable object 
 java.lang.IllegalArgumentException
   at java.nio.Buffer.position(Buffer.java:236)
  at 
 org.apache.hadoop.hbase.util.ByteBufferUtils.skip(ByteBufferUtils.java:434)
   at 
 org.apache.hadoop.hbase.io.hfile.HFileReaderV2$ScannerV2.readKeyValueLen(HFileReaderV2.java:849)
   at 
 org.apache.hadoop.hbase.io.hfile.HFileReaderV2$ScannerV2.next(HFileReaderV2.java:760)
  at 
 org.apache.hadoop.hbase.regionserver.StoreFileScanner.seekAtOrAfter(StoreFileScanner.java:248)
at 
 org.apache.hadoop.hbase.regionserver.StoreFileScanner.seek(StoreFileScanner.java:152)
   at 
 org.apache.hadoop.hbase.regionserver.StoreScanner.seekScanners(StoreScanner.java:317)
  at 
 org.apache.hadoop.hbase.regionserver.StoreScanner.init(StoreScanner.java:176)
   at org.apache.hadoop.hbase.regionserver.HStore.getScanner(HStore.java:1780)
   at 
 org.apache.hadoop.hbase.regionserver.HRegion$RegionScannerImpl.init(HRegion.java:3758)
   at 
 org.apache.hadoop.hbase.regionserver.HRegion.instantiateRegionScanner(HRegion.java:1950)
   at 
 org.apache.hadoop.hbase.regionserver.HRegion.getScanner(HRegion.java:1936)
 at 
 org.apache.hadoop.hbase.regionserver.HRegion.getScanner(HRegion.java:1913)
   at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.scan(HRegionServer.java:3157)
   at 
 org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:29587)
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2027)
 at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:108)
  at 
 org.apache.hadoop.hbase.ipc.RpcExecutor.consumerLoop(RpcExecutor.java:114)
at org.apache.hadoop.hbase.ipc.RpcExecutor$1.run(RpcExecutor.java:94)
  at java.lang.Thread.run(Thread.java:744)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HBASE-12783) Create efficient RegionLocator implementation

Solomon Duskis created HBASE-12783:
--

 Summary: Create efficient RegionLocator implementation
 Key: HBASE-12783
 URL: https://issues.apache.org/jira/browse/HBASE-12783
 Project: HBase
  Issue Type: Bug
Affects Versions: 1.0.0, 2.0.0
Reporter: Solomon Duskis
Assignee: Solomon Duskis


A new HRegionLocator that only implements RegionLocator functionality will be 
more efficient to instantiate than a full HTable. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12783) Create efficient RegionLocator implementation


[ 
https://issues.apache.org/jira/browse/HBASE-12783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14261125#comment-14261125
 ] 

Solomon Duskis commented on HBASE-12783:


For now, I'm simply creating a new implementation of RegionLocator.  We can 
slowly remove the implementation from HTable.

 Create efficient RegionLocator implementation
 -

 Key: HBASE-12783
 URL: https://issues.apache.org/jira/browse/HBASE-12783
 Project: HBase
  Issue Type: Bug
Affects Versions: 1.0.0, 2.0.0
Reporter: Solomon Duskis
Assignee: Solomon Duskis
 Attachments: HBASE-12783.patch


 A new HRegionLocator that only implements RegionLocator functionality will be 
 more efficient to instantiate than a full HTable. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12783) Create efficient RegionLocator implementation


 [ 
https://issues.apache.org/jira/browse/HBASE-12783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Solomon Duskis updated HBASE-12783:
---
Attachment: HBASE-12783.patch

 Create efficient RegionLocator implementation
 -

 Key: HBASE-12783
 URL: https://issues.apache.org/jira/browse/HBASE-12783
 Project: HBase
  Issue Type: Bug
Affects Versions: 1.0.0, 2.0.0
Reporter: Solomon Duskis
Assignee: Solomon Duskis
 Attachments: HBASE-12783.patch


 A new HRegionLocator that only implements RegionLocator functionality will be 
 more efficient to instantiate than a full HTable. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12783) Create efficient RegionLocator implementation


[ 
https://issues.apache.org/jira/browse/HBASE-12783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14261126#comment-14261126
 ] 

Solomon Duskis commented on HBASE-12783:


Hm... I guess I forgot tests.  I'll work on that.

 Create efficient RegionLocator implementation
 -

 Key: HBASE-12783
 URL: https://issues.apache.org/jira/browse/HBASE-12783
 Project: HBase
  Issue Type: Bug
Affects Versions: 1.0.0, 2.0.0
Reporter: Solomon Duskis
Assignee: Solomon Duskis
 Attachments: HBASE-12783.patch


 A new HRegionLocator that only implements RegionLocator functionality will be 
 more efficient to instantiate than a full HTable. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12728) buffered writes substantially less useful after removal of HTablePool


[ 
https://issues.apache.org/jira/browse/HBASE-12728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14261147#comment-14261147
 ] 

Solomon Duskis commented on HBASE-12728:


Is HTableMultiplexer a good existing solution to replace the functionality of 
HTable without autoflush?

 buffered writes substantially less useful after removal of HTablePool
 -

 Key: HBASE-12728
 URL: https://issues.apache.org/jira/browse/HBASE-12728
 Project: HBase
  Issue Type: Bug
  Components: hbase
Affects Versions: 0.98.0
Reporter: Aaron Beppu

 In previous versions of HBase, when use of HTablePool was encouraged, HTable 
 instances were long-lived in that pool, and for that reason, if autoFlush was 
 set to false, the table instance could accumulate a full buffer of writes 
 before a flush was triggered. Writes from the client to the cluster could 
 then be substantially larger and less frequent than without buffering.
 However, when HTablePool was deprecated, the primary justification seems to 
 have been that creating HTable instances is cheap, so long as the connection 
 and executor service being passed to it are pre-provided. A use pattern was 
 encouraged where users should create a new HTable instance for every 
 operation, using an existing connection and executor service, and then close 
 the table. In this pattern, buffered writes are substantially less useful; 
 writes are as small and as frequent as they would have been with 
 autoflush=true, except the synchronous write is moved from the operation 
 itself to the table close call which immediately follows.
 More concretely :
 ```
 // Given these two helpers ...
 private HTableInterface getAutoFlushTable(String tableName) throws 
 IOException {
   // (autoflush is true by default)
   return storedConnection.getTable(tableName, executorService);
 }
 private HTableInterface getBufferedTable(String tableName) throws IOException 
 {
   HTableInterface table = getAutoFlushTable(tableName);
   table.setAutoFlush(false);
   return table;
 }
 // it's my contention that these two methods would behave almost identically,
 // except the first will hit a synchronous flush during the put call,
 and the second will
 // flush during the (hidden) close call on table.
 private void writeAutoFlushed(Put somePut) throws IOException {
   try (HTableInterface table = getAutoFlushTable(tableName)) {
 table.put(somePut); // will do synchronous flush
   }
 }
 private void writeBuffered(Put somePut) throws IOException {
   try (HTableInterface table = getBufferedTable(tableName)) {
 table.put(somePut);
   } // auto-close will trigger synchronous flush
 }
 ```
 For buffered writes to actually provide a performance benefit to users, one 
 of two things must happen:
 - The writeBuffer itself shouldn't live, flush and die with the lifecycle of 
 it's HTableInstance. If the writeBuffer were managed elsewhere and had a long 
 lifespan, this could cease to be an issue. However, if the same writeBuffer 
 is appended to by multiple tables, then some additional concurrency control 
 will be needed around it.
 - Alternatively, there should be some pattern for having long-lived HTable 
 instances. However, since HTable is not thread-safe, we'd need multiple 
 instances, and a mechanism for leasing them out safely -- which sure sounds a 
 lot like the old HTablePool to me.
 See discussion on mailing list here : 
 http://mail-archives.apache.org/mod_mbox/hbase-user/201412.mbox/%3CCAPdJLkEzmUQZ_kvD%3D8mrxi4V%3DhCmUp3g9MUZsddD%2Bmon%2BAvNtg%40mail.gmail.com%3E



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12781) Listen port will bind always to the passed command line address


[ 
https://issues.apache.org/jira/browse/HBASE-12781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14261170#comment-14261170
 ] 

Hadoop QA commented on HBASE-12781:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12689499/12781-V1.patch
  against master branch at commit 0513a21dc8b86f57b5a6c1b742904821632f77f7.
  ATTACHMENT ID: 12689499

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 checkstyle{color}.  The applied patch does not increase the 
total number of checkstyle errors

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/newPatchFindbugsWarningshbase-rest.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/newPatchFindbugsWarningshbase-annotations.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Checkstyle Errors: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//artifact/patchprocess/checkstyle-aggregate.html

  Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12258//console

This message is automatically generated.

 Listen port will bind always to the passed command line address
 ---

 Key: HBASE-12781
 URL: https://issues.apache.org/jira/browse/HBASE-12781
 Project: HBase
  Issue Type: Bug
  Components: Thrift
Affects Versions: 0.98.3
Reporter: Pankaj Kumar
Assignee: Pankaj Kumar
 Attachments: 12781-V1.patch


 In Thrift server,  listen port will bind always to the address which  is 
 passed through command line argument. 
 --
 InetSocketAddress inetSocketAddress = bindToPort(cmd.getOptionValue(bind), 
 listenPort);
 -
 private static InetSocketAddress bindToPort(String bindValue, int listenPort)
   throws UnknownHostException {
 try {
   if (bindValue == null) {
 return new InetSocketAddress(listenPort);
   } else {
 return new InetSocketAddress(InetAddress.getByName(bindValue), 
 listenPort);
   }
 } catch (UnknownHostException e) {
   throw new RuntimeException(Could not bind to provided ip address, e);
 }
   }
 In case when bind address is not passed through argument then it is binding 
 with any local  address. It

[jira] [Commented] (HBASE-12684) Add new AsyncRpcClient