Re: HBase Write to Regionservers behavior

2009-06-16 Thread zsongbo
Thanks Bradford. On Tue, Jun 16, 2009 at 2:17 AM, Bradford Stephens < bradfordsteph...@gmail.com> wrote: > Right now, we're storing the documents in HBase. The indices are > stored in HDFS and then 'sharded' to each node using Katta. Not sure > if there's much of an advantage to storing the index

Re: HBase Write to Regionservers behavior

2009-06-15 Thread Bradford Stephens
Right now, we're storing the documents in HBase. The indices are stored in HDFS and then 'sharded' to each node using Katta. Not sure if there's much of an advantage to storing the index itself in HBase, though I'd be interested to see some use cases for it. On Sat, Jun 13, 2009 at 11:27 AM, zsong

Re: HBase Write to Regionservers behavior

2009-06-13 Thread zsongbo
Hi Bradford Stephens, Could you please share something about your practices on "Katta+HBase"? Do you store the documents or indexes in HBase? Schubert On Fri, Jun 12, 2009 at 1:19 PM, Bradford Stephens < bradfordsteph...@gmail.com> wrote: > That actually make a lot of sense. Thanks, awesome peop

Re: HBase Write to Regionservers behavior

2009-06-11 Thread Bradford Stephens
Oh, I misspoke. The MR job is over tab-delimited text files. I have 14 mappers and 7 reducers -- loading into an empty table. The total amount of regions generated after the job is done and some splits happen is 70. On Thu, Jun 11, 2009 at 10:47 PM, stack wrote: > Is your MR job over the whole tab

Re: HBase Write to Regionservers behavior

2009-06-11 Thread stack
Is your MR job over the whole table or a subset? If whole table then its odd that the 7 reducers are hitting only 2 regions. What happens if 70 reducers? St.Ack On Thu, Jun 11, 2009 at 10:42 PM, Bradford Stephens < bradfordsteph...@gmail.com> wrote: > About 70. > > On Thu, Jun 11, 2009 at 10:24

Re: HBase Write to Regionservers behavior

2009-06-11 Thread Bradford Stephens
About 70. On Thu, Jun 11, 2009 at 10:24 PM, stack wrote: > Hey, how many regions?  (smile) > St.Ack > > On Thu, Jun 11, 2009 at 10:19 PM, Bradford Stephens < > bradfordsteph...@gmail.com> wrote: > >> I meant, here 'till Midnight :) thanks! >> >> On Thu, Jun 11, 2009 at 10:19 PM, Bradford >> Stephe

Re: HBase Write to Regionservers behavior

2009-06-11 Thread stack
Hey, how many regions? (smile) St.Ack On Thu, Jun 11, 2009 at 10:19 PM, Bradford Stephens < bradfordsteph...@gmail.com> wrote: > I meant, here 'till Midnight :) thanks! > > On Thu, Jun 11, 2009 at 10:19 PM, Bradford > Stephens wrote: > > That actually make a lot of sense. Thanks, awesome people!

Re: HBase Write to Regionservers behavior

2009-06-11 Thread Bradford Stephens
I meant, here 'till Midnight :) thanks! On Thu, Jun 11, 2009 at 10:19 PM, Bradford Stephens wrote: > That actually make a lot of sense. Thanks, awesome people! Me and the > dev team are here to get Katta + HBase to play together, and it's > looking pretty nice. > > On Thu, Jun 11, 2009 at 9:47 PM,

Re: HBase Write to Regionservers behavior

2009-06-11 Thread Bradford Stephens
That actually make a lot of sense. Thanks, awesome people! Me and the dev team are here to get Katta + HBase to play together, and it's looking pretty nice. On Thu, Jun 11, 2009 at 9:47 PM, stack wrote: > On Thu, Jun 11, 2009 at 6:10 PM, Bradford Stephens < > bradfordsteph...@gmail.com> wrote: > >

Re: HBase Write to Regionservers behavior

2009-06-11 Thread stack
On Thu, Jun 11, 2009 at 6:10 PM, Bradford Stephens < bradfordsteph...@gmail.com> wrote: > > What I'm noticing is that it's writing to mostly one or two regions on > one box at a time, even though I have 7 reducers running. Monitoring > everything with dstat -v, I notice that only 2 of my servers a

RE: HBase Write to Regionservers behavior

2009-06-11 Thread Jim Kellerman (POWERSET)
servers. --- Jim Kellerman, Powerset (Live Search, Microsoft Corporation) > -Original Message- > From: Bradford Stephens [mailto:bradfordsteph...@gmail.com] > Sent: Thursday, June 11, 2009 6:10 PM > To: hbase-user@hadoop.apache.org > Subject: HBase Write to Regionservers behavio

Re: HBase Write to Regionservers behavior

2009-06-11 Thread Billy Pearson
once the table has split more you might look in to using org.apache.hadoop.hbase.mapred.HRegionPartitioner.java It will split up the data and only run one reduce per region so all that's regions rows will be sent to just one reducer but does not help much as when the table is small and you have

HBase Write to Regionservers behavior

2009-06-11 Thread Bradford Stephens
Hey there, So, I wiped my HDFS and reinstalled everything, and am running smaller loads... so far, so good. I've got 7 regionservers. My job basically takes a lot of documents and metadata with unique binary keys (like "055E51294F9D9CA331D968D04B72A11C"), combines them all in a reducer, then writ