Re: Regarding Designing Hbase Table - for a banking scenario

2012-12-23 Thread Mohammad Tariq
Hello Ram, You schema looks fine to me. And, you can have anything as your rowkey, but keeping in mind the types of queries which your table is going to serve. It would be helpful for us to answer your question in a better manner if can tell us what kind of queries you face mostly? Thank

Re: how can thrift connect to hbase?

2012-12-23 Thread hua beatls
Hi, we have a c++ client, and the thrift server was installed on the Ndatanode ®ionservers. ( from the official document we know that it is not recommended for the thrift colocated with nn &jobtracker). as we understand the processs from client to hbase shown below: client--

Re: HBase table affinity to host(s)

2012-12-23 Thread yuzhihong
They would move over to live region servers. Cheers On Dec 23, 2012, at 2:59 PM, Brennon Church wrote: > I'm curious. If I disable the hbase balancer, what happens when a > regionserver goes offline? Will those regions get moved over to the > remaining hosts as would normally be the case

Hbase Question

2012-12-23 Thread Dalia Sobhy
Dear all, I have 50,000 row with diagnosis qualifier = "cardiac", and another 50,000 rows with "renal". When I type this in Hbase shell, import org.apache.hadoop.hbase.filter.CompareFilter import org.apache.hadoop.hbase.filter.SingleColumnValueFilter import org.apache.hadoop.hbase.filter.Subst

Re: HBase table affinity to host(s)

2012-12-23 Thread Brennon Church
I'm curious. If I disable the hbase balancer, what happens when a regionserver goes offline? Will those regions get moved over to the remaining hosts as would normally be the case? Or will they remain unavailable until the balancer is run manually? Thanks. --Brennon On 12/22/12 5:48 AM, T

Re: HBase table affinity to host(s)

2012-12-23 Thread Dimitry Goldin
On 22.12.2012 05:59, Ted Yu wrote: Dimitry: StochasticLoadBalancer doesn't provide the capability of grouping selected regions on subset of region servers. I mentioned the StochasticBalancer more as an example. It still should be possible to achieve something similar by extending the balancer t

Re: Hbase scalability performance

2012-12-23 Thread Mohammad Tariq
Hello Dalia, You can go the Hbase webUI to see the details, as Ted has specified earlier. But if you really want to monitor everything properly I would suggest to configure Ganglia to capture the metrics. To do a quick check you can also use "status" command from the Hbase shell. hbase> stat

Re: Hbase scalability performance

2012-12-23 Thread Dimitry Goldin
Hi, On 23.12.2012 14:38, Dalia Sobhy wrote: So do you have an example of multithreading program, because I am using the read-made Java API not thrift server, so I don't know how to write a multithreaded program using this API. You should take a loot at YCSB (https://github.com/brianfrankco

Re: Many scanner opening

2012-12-23 Thread Eugeny Morozov
Lars, We tried, but I didn't know there is such a contention issue. We have two different column families. First one contains data, that are partially used as a filter. And actual data lives in second column family. So, outer scanner (the first one) goes through the table and filter out keys tha

RE: Hbase scalability performance

2012-12-23 Thread Dalia Sobhy
I am using 3 region servers. Hbase version: 0.92 Cloudera Manager: 4.1 How to know the load is balanced Ted? > Date: Sat, 22 Dec 2012 08:06:59 -0800 > Subject: Re: Hbase scalability performance > From: yuzhih...@gmail.com > To: user@hbase.apache.org > > By '3 datanodes', did you mean that you

RE: Hbase scalability performance

2012-12-23 Thread Dalia Sobhy
Dear all, Thanks for your help. I am already using coprocessors for this table. I already tried a program similar to it but using thrift server and my cluster was 23 nodes on Rackspace cloud, but the same I didn't see any improved performance. Then I was advised to use actual machines (not vi

RE: Hbase scalability performance

2012-12-23 Thread Dalia Sobhy
So do you have an example of multithreading program, because I am using the read-made Java API not thrift server, so I don't know how to write a multithreaded program using this API. > Date: Sat, 22 Dec 2012 08:50:56 -0800 > Subject: Re: Hbase scalability performance > From: va...@pinterest.com