Monitoring HBase counters while running tests

2011-08-03 Thread Steinmaurer Thomas
Hello, we are in process running our first load test on a Cloudera test cluster with 8 region servers. We also have Ganglia in use for monitoring. I wonder what counters one should watch out during the tests to identify possible bottlenecks, improvements etc ... Is there some kind of quick

Re: about 'number of empty REGIONINFO_QUALIFIER rows in .META.'

2011-08-03 Thread Xian Woo
In addition, when I opened the Web Based UI and clicked on the testRow button in User Tables, I didn't get any information about Online Regions while I can still scan or get this table.. 2011/8/3 Xian Woo infinity0...@gmail.com Hello ,guys, I've got a table of which rows range from 'row1'

YCSB Benchmarking for HBase

2011-08-03 Thread praveenesh kumar
Hi, Anyone working on YCSB (Yahoo Cloud Service Benchmarking) for HBase ?? I am trying to run it, its giving me error: $ java -cp build/ycsb.jar com.yahoo.ycsb.CommandLine -db com.yahoo.ycsb.db.HBaseClient YCSB Command Line client Type help for command line help Start with -help for usage info

Why the HBase master is not stopped?

2011-08-03 Thread shanmuganathan.r
Hi All, I have a problem in HBase fully distributed mode with four node cluster. My Hadoop and zookeeper are working properly. The HBase also starting and running properly but it is not stopped when I give the stop-hbase.sh command. Thanks in advance.! Regards,

Re: YCSB Benchmarking for HBase

2011-08-03 Thread Eric Hauser
You need to copy both the Hadoop and HBase JAR to both YCSB/lib/ and YCSB/hbase/lib/. BTW - I never got a chance to respond, but thanks to those who responded to my inquiry last week. Turning on RPC debugging showed me that the issue was as simple as a problem with the box running YCSB as HBase

Re: Why the HBase master is not stopped?

2011-08-03 Thread Gan, Xiyun
Hi, I have confronted with the same situation. My solution is to stop regionservers, then master, then zookeeper manually. On Wed, Aug 3, 2011 at 7:47 PM, shanmuganathan.r shanmuganatha...@zohocorp.com wrote: Hi All,      I have a problem in HBase fully distributed mode with four node

Re: Backup Master not started?

2011-08-03 Thread shanmuganathan.r
Hi Jean, The Problem is continued even the Active master is properly killed. How can we make the Active master and backup master use the same zookeeper ensemble. Regards, Shanmuganathan On Tue, 02 Aug 2011 22:42:10 +0530 Jean-Daniel Cryanslt;jdcry...@apache.orggt; wrote

Re: Why the HBase master is not stopped?

2011-08-03 Thread Rui Li
Have you tried shutdown in hbase shell? Sent from my iPhone On Aug 3, 2011, at 4:48 AM, shanmuganathan.r shanmuganatha...@zohocorp.com wrote: Hi All, I have a problem in HBase fully distributed mode with four node cluster. My Hadoop and zookeeper are working properly. The HBase also

HBASE-2849 - 90.3 hbase clients recovering from invalid zk sessions

2011-08-03 Thread Douglas Campbell
Did this patch make it into 0.90.3? https://issues.apache.org/jira/browse/HBASE-2849 When I go into HConnectionManager source from the 90.3 branch,  evidence of the patch is missing. Pointers to where this logic is would be helpful. I'm trying to make sure my clients reconnect under zk and

Why the Master is waiting for the stopped regionserver?

2011-08-03 Thread shanmuganathan.r
Hi All, I have a problem in HBase fully distributed mode with four node cluster. My Hadoop and zookeeper are working properly. The HBase also starting and running properly but it is not stopped when I give the stop-hbase.sh command.

Re: Why the HBase master is not stopped?

2011-08-03 Thread Jean-Daniel Cryans
Look at the .log and .out files, something might just be blocking you. J-D On Wed, Aug 3, 2011 at 4:47 AM, shanmuganathan.r shanmuganatha...@zohocorp.com wrote: Hi All,      I have a problem in HBase fully distributed mode with four node cluster. My Hadoop and zookeeper are working

0.90.3 IPC Reader threads are not daemons

2011-08-03 Thread Douglas Campbell
In debugging another issue, I noticed that IPC Reader threads kicked off in HBaseServer are not daemon. as it is now, they could prevent the program from exiting if for some reason the thread pool isn't shutdown after main exits. Other threads like, IPC Servers, are daemon threads

Re: 0.90.3 IPC Reader threads are not daemons

2011-08-03 Thread Douglas Campbell
will do From: Ted Yu yuzhih...@gmail.com To: user@hbase.apache.org; Douglas Campbell deegs...@yahoo.com Sent: Wednesday, August 3, 2011 11:29 AM Subject: Re: 0.90.3 IPC Reader threads are not daemons Doug: Mind logging an issue ? Thanks On Wed, Aug 3, 2011 at

Re: How big a jump is safe?

2011-08-03 Thread Jean-Daniel Cryans
It's safe, all the big installs I know are either on some heavily patched 0.89 or a late version of 0.90. You might want to do an export before upgrading, or a complete HDFS distcp, just for good measure. Also read this guide from the online book:

Re: 0.90.3 IPC Reader threads are not daemons

2011-08-03 Thread Douglas Campbell
https://issues.apache.org/jira/browse/HBASE-4159 From: Douglas Campbell deegs...@yahoo.com To: user@hbase.apache.org user@hbase.apache.org Sent: Wednesday, August 3, 2011 11:41 AM Subject: Re: 0.90.3 IPC Reader threads are not daemons will do

Re: How big a jump is safe?

2011-08-03 Thread Greg Bledsoe
Nice. Thanks for the quick response and pointer. Greg On 8/3/11 2:42 PM, Jean-Daniel Cryans jdcry...@apache.org wrote: It's safe, all the big installs I know are either on some heavily patched 0.89 or a late version of 0.90. You might want to do an export before upgrading, or a complete HDFS

Re: YCSB Benchmarking for HBase

2011-08-03 Thread Edward Capriolo
On Wed, Aug 3, 2011 at 6:10 AM, praveenesh kumar praveen...@gmail.comwrote: Hi, Anyone working on YCSB (Yahoo Cloud Service Benchmarking) for HBase ?? I am trying to run it, its giving me error: $ java -cp build/ycsb.jar com.yahoo.ycsb.CommandLine -db com.yahoo.ycsb.db.HBaseClient YCSB

Re: about 'number of empty REGIONINFO_QUALIFIER rows in .META.'

2011-08-03 Thread Jean-Daniel Cryans
This used to be leftovers from old bugs we had with the old master, but I'm not aware of any current bug that cause that. Maybe a failed split that rolled back, only your master and region server logs can tell. Looking at the output it seems that your table is healthy, but that row in .META.

Re: Monitoring HBase counters while running tests

2011-08-03 Thread Jean-Daniel Cryans
Hi Thomas, No specific guide for monitoring performance but I strongly recommend that you read the following sections of the book: http://hbase.apache.org/book/performance.html http://hbase.apache.org/book/trouble.html And regarding general advices, lookout for swap and high iowait, verify that

Re: Apparent data loss on 90.4 rc2 after partial zookeeper network partition (on MapR)

2011-08-03 Thread Jean-Daniel Cryans
Hi Jacques, Sorry to hear about that. Regarding MapR, I personally don't have hands-on experience so it's a little bit hard for me to help you. You might want to ping them and ask their opinion (and I know they are watching, Ted? Srivas?) What I can do is telling you if things look normal from

Re: MR on HBase - java.io.IOException: Pass a Delete or a Put

2011-08-03 Thread Suraj Varma
Yes - this was also the reason in the thread I had linked earlier. The reason is that there are two versions of the Reducer. One in the mapred package (that uses Iterator) and the new one in mapreduce package (that uses Iterable). It is likely that you used the new one with the old reduce

Re: HFileOutputFormat comment says multi family not supported

2011-08-03 Thread Gan, Xiyun
According to JIRA, it will be part of 92. On Thu, Aug 4, 2011 at 2:18 AM, David Capwell dcapw...@yahoo-inc.com wrote: Its in trunk, so it will be part of 92, correct? On Aug 2, 2011, at 7:00:04PM, Gan, Xiyun wrote: Hi Capwell,    Stack said HBASE-1861 is one legitimate approach to support

Re: HFileOutputFormat comment says multi family not supported

2011-08-03 Thread David Capwell
thanks On Aug 3, 2011, at 6:06:14PM, Gan, Xiyun wrote: According to JIRA, it will be part of 92. On Thu, Aug 4, 2011 at 2:18 AM, David Capwell dcapw...@yahoo-inc.com wrote: Its in trunk, so it will be part of 92, correct? On Aug 2, 2011, at 7:00:04PM, Gan, Xiyun wrote: Hi Capwell,

Re: Can't disable a table because of a region time out

2011-08-03 Thread Yi Liang
HI J-D, I have tried to force unassign it with shell command 'unassign HistoryNoticeInc,,1311223940614.aaa8d345f5b7b6a69b786fe6d14ed9fa.', true', but it didn't help. The error message still appears after executing that command, and it looks like just same as before: 2011-08-04 09:56:41,005 DEBUG