[jira] [Commented] (HBASE-4890) fix possible NPE in HConnectionManager

2012-01-26 Thread Simon Dircks (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-4890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13194057#comment-13194057
 ] 

Simon Dircks commented on HBASE-4890:
-

I was also able to reproduce this:

hadoop-1.0 and hbase-0.92 with YCSB. 

2012/01/25 15:19:24 WARN client.HConnectionManager$HConnectionImplementation: 
Failed all from 
region=usertable,user3076346045817661344,1327530607222.bab55fba6adb17bc8757eb6cdee99a91.,
 hostname=datatask6.hadoop.telescope.tv, port=60020
java.util.concurrent.ExecutionException: java.lang.RuntimeException: 
java.lang.NullPointerException

Got this error on the LOAD part of YCSB

/usr/local/bin/java -cp "build/ycsb.jar:db/hbase/lib/*:db/hbase/conf/" 
com.yahoo.ycsb.Client -load -db com.yahoo.ycsb.db.HBaseClient -P 
workloads/workloada -p columnfamily=family1 -p recordcount=500 -s > load.dat



> fix possible NPE in HConnectionManager
> --
>
> Key: HBASE-4890
> URL: https://issues.apache.org/jira/browse/HBASE-4890
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.92.0
>Reporter: Jonathan Hsieh
>
> I was running YCSB against a 0.92 branch and encountered this error message:
> {code}
> 11/11/29 08:47:16 WARN client.HConnectionManager$HConnectionImplementation: 
> Failed all from 
> region=usertable,user3917479014967760871,1322555655231.f78d161e5724495a9723bcd972f97f41.,
>  hostname=c0316.hal.cloudera.com, port=57020
> java.util.concurrent.ExecutionException: java.lang.RuntimeException: 
> java.lang.NullPointerException
> at java.util.concurrent.FutureTask$Sync.innerGet(FutureTask.java:222)
> at java.util.concurrent.FutureTask.get(FutureTask.java:83)
> at 
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.processBatchCallback(HConnectionManager.java:1501)
> at 
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.processBatch(HConnectionManager.java:1353)
> at org.apache.hadoop.hbase.client.HTable.flushCommits(HTable.java:898)
> at org.apache.hadoop.hbase.client.HTable.doPut(HTable.java:775)
> at org.apache.hadoop.hbase.client.HTable.put(HTable.java:750)
> at com.yahoo.ycsb.db.HBaseClient.update(Unknown Source)
> at com.yahoo.ycsb.DBWrapper.update(Unknown Source)
> at com.yahoo.ycsb.workloads.CoreWorkload.doTransactionUpdate(Unknown 
> Source)
> at com.yahoo.ycsb.workloads.CoreWorkload.doTransaction(Unknown Source)
> at com.yahoo.ycsb.ClientThread.run(Unknown Source)
> Caused by: java.lang.RuntimeException: java.lang.NullPointerException
> at 
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getRegionServerWithoutRetries(HConnectionManager.java:1315)
> at 
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1327)
> at 
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1325)
> at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
> at java.util.concurrent.FutureTask.run(FutureTask.java:138)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
> at java.lang.Thread.run(Thread.java:619)
> Caused by: java.lang.NullPointerException
> at 
> org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:158)
> at $Proxy4.multi(Unknown Source)
> at 
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1330)
> at 
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1328)
> at 
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getRegionServerWithoutRetries(HConnectionManager.java:1309)
> ... 7 more
> {code}
> It looks like the NPE is caused by server being null in the MultiRespone 
> call() method.
> {code}
>  public MultiResponse call() throws IOException {
>  return getRegionServerWithoutRetries(
>  new ServerCallable(connection, tableName, null) {
>public MultiResponse call() throws IOException {
>  return server.multi(multi);
>}
>@Override
>public void connect(boolean reload) throws IOException {
>  server =
>connection.getHRegionConnection(loc.getHostname(), 
> loc.getPort());
>}
>  }
>  );
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, pleas

[jira] [Commented] (HBASE-3466) runtime exception -- cached an already cached block -- during compaction

2012-01-25 Thread Simon Dircks (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-3466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13193444#comment-13193444
 ] 

Simon Dircks commented on HBASE-3466:
-

I just reproduced this with hadoop-1.0 and hbase-0.92 with YCSB. 


2012-01-25 23:23:51,556 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x134f70a343101a0 Successfully transitioned node 
162702503c650e551130e5fb588b3ec2 from RS_ZK_REGION_SPLIT to RS_ZK_REGION_SPLIT
2012-01-25 23:23:51,616 ERROR 
org.apache.hadoop.hbase.regionserver.HRegionServer: 
java.lang.RuntimeException: Cached an already cached block
at 
org.apache.hadoop.hbase.io.hfile.LruBlockCache.cacheBlock(LruBlockCache.java:268)
at 
org.apache.hadoop.hbase.io.hfile.HFileReaderV2.readBlock(HFileReaderV2.java:276)
at 
org.apache.hadoop.hbase.io.hfile.HFileReaderV2$ScannerV2.seekTo(HFileReaderV2.java:487)
at 
org.apache.hadoop.hbase.io.HalfStoreFileReader$1.seekTo(HalfStoreFileReader.java:168)
at 
org.apache.hadoop.hbase.regionserver.StoreFileScanner.seekAtOrAfter(StoreFileScanner.java:181)
at 
org.apache.hadoop.hbase.regionserver.StoreFileScanner.seek(StoreFileScanner.java:111)
at 
org.apache.hadoop.hbase.regionserver.StoreScanner.(StoreScanner.java:83)
at 
org.apache.hadoop.hbase.regionserver.Store.getScanner(Store.java:1721)
at 
org.apache.hadoop.hbase.regionserver.HRegion$RegionScannerImpl.(HRegion.java:2861)
at 
org.apache.hadoop.hbase.regionserver.HRegion.instantiateRegionScanner(HRegion.java:1432)
at 
org.apache.hadoop.hbase.regionserver.HRegion.getScanner(HRegion.java:1424)
at 
org.apache.hadoop.hbase.regionserver.HRegion.getScanner(HRegion.java:1400)
at org.apache.hadoop.hbase.regionserver.HRegion.get(HRegion.java:3688)
at org.apache.hadoop.hbase.regionserver.HRegion.get(HRegion.java:3581)
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.get(HRegionServer.java:1771)
at sun.reflect.GeneratedMethodAccessor38.invoke(Unknown Source)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at 
org.apache.hadoop.hbase.ipc.WritableRpcEngine$Server.call(WritableRpcEngine.java:364)
at 
org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1325)
2012-01-25 23:23:51,656 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x134f70a343101a0 Attempting to transition node 
162702503c650e551130e5fb588b3ec2 from RS_ZK_REGION_SPLIT to RS_ZK_REGION_SPLIT






18 node cluster, dedicated namenode, zookeeper, hbasemaster, and YCSB client 
machine. 


/usr/local/bin/java -cp "build/ycsb.jar:db/hbase/lib/*:db/hbase/conf/" 
com.yahoo.ycsb.Client -load -db com.yahoo.ycsb.db.HBaseClient -P 
workloads/workloada -p columnfamily=family1 -p recordcount=500 -s > load.dat

loaded 5mil records, that created 8 regions. (balanced all onto the same RS)


/usr/local/bin/java -cp "build/ycsb.jar:db/hbase/lib/*:db/hbase/conf/" 
com.yahoo.ycsb.Client -t -db com.yahoo.ycsb.db.HBaseClient -P 
workloads/workloada -p columnfamily=family1 -p operationcount=500 -threads 
10 -s > transaction.dat


I also was able to reproduce the 
2/01/25 15:19:24 WARN client.HConnectionManager$HConnectionImplementation: 
Failed all from 
region=usertable,user3076346045817661344,1327530607222.bab55fba6adb17bc8757eb6cdee99a91.,
 hostname=datatask6.hadoop.telescope.tv, port=60020
java.util.concurrent.ExecutionException: java.lang.RuntimeException: 
java.lang.NullPointerException

found in https://issues.apache.org/jira/browse/HBASE-4890




> runtime exception -- cached an already cached block -- during compaction
> 
>
> Key: HBASE-3466
> URL: https://issues.apache.org/jira/browse/HBASE-3466
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.0
> Environment: ubuntu 9.10, kernel 2.6.31-14-generic SMP 8-core with 
> hyperthreading
>Reporter: M. C. Srivas
>Priority: Critical
>
> Happened while running ycsb against a single RS.  BlockSize was set to 64M to 
> tickle more splits. No compression, and replication factor set to 1.
>  
> I noticed that  https://issues.apache.org/jira/browse/HBASE-2455 applied to 
> 0.20.4, so opened this new one (didn't check to see if the code was the same 
> in 0.20.4 and 0.90.0)
> YCSB was run as follows:
> java -mx3000m -cp conf/:build/ycsb.jar:db/hbase/lib/* com.yahoo.ycsb.Client 
> -t -db com.yahoo.ycsb.db.HBaseClient -P workloads/workloada -p 
> columnfamily=family -p operationcount=1000 -s -threads 30 -target 3
> workloada was modified to do 1 billion records:
> --
> recordcount=10
> operationcount=1000
> wor