Thanks Bradford.
On Tue, Jun 16, 2009 at 2:17 AM, Bradford Stephens <
bradfordsteph...@gmail.com> wrote:
> Right now, we're storing the documents in HBase. The indices are
> stored in HDFS and then 'sharded' to each node using Katta. Not sure
> if there's much of an advantage to storing the index
Right now, we're storing the documents in HBase. The indices are
stored in HDFS and then 'sharded' to each node using Katta. Not sure
if there's much of an advantage to storing the index itself in HBase,
though I'd be interested to see some use cases for it.
On Sat, Jun 13, 2009 at 11:27 AM, zsong
Hi Bradford Stephens,
Could you please share something about your practices on "Katta+HBase"?
Do you store the documents or indexes in HBase?
Schubert
On Fri, Jun 12, 2009 at 1:19 PM, Bradford Stephens <
bradfordsteph...@gmail.com> wrote:
> That actually make a lot of sense. Thanks, awesome peop
Oh, I misspoke. The MR job is over tab-delimited text files. I have 14
mappers and 7 reducers -- loading into an empty table. The total
amount of regions generated after the job is done and some splits
happen is 70.
On Thu, Jun 11, 2009 at 10:47 PM, stack wrote:
> Is your MR job over the whole tab
Is your MR job over the whole table or a subset? If whole table then its
odd that the 7 reducers are hitting only 2 regions. What happens if 70
reducers?
St.Ack
On Thu, Jun 11, 2009 at 10:42 PM, Bradford Stephens <
bradfordsteph...@gmail.com> wrote:
> About 70.
>
> On Thu, Jun 11, 2009 at 10:24
About 70.
On Thu, Jun 11, 2009 at 10:24 PM, stack wrote:
> Hey, how many regions? (smile)
> St.Ack
>
> On Thu, Jun 11, 2009 at 10:19 PM, Bradford Stephens <
> bradfordsteph...@gmail.com> wrote:
>
>> I meant, here 'till Midnight :) thanks!
>>
>> On Thu, Jun 11, 2009 at 10:19 PM, Bradford
>> Stephe
Hey, how many regions? (smile)
St.Ack
On Thu, Jun 11, 2009 at 10:19 PM, Bradford Stephens <
bradfordsteph...@gmail.com> wrote:
> I meant, here 'till Midnight :) thanks!
>
> On Thu, Jun 11, 2009 at 10:19 PM, Bradford
> Stephens wrote:
> > That actually make a lot of sense. Thanks, awesome people!
I meant, here 'till Midnight :) thanks!
On Thu, Jun 11, 2009 at 10:19 PM, Bradford
Stephens wrote:
> That actually make a lot of sense. Thanks, awesome people! Me and the
> dev team are here to get Katta + HBase to play together, and it's
> looking pretty nice.
>
> On Thu, Jun 11, 2009 at 9:47 PM,
That actually make a lot of sense. Thanks, awesome people! Me and the
dev team are here to get Katta + HBase to play together, and it's
looking pretty nice.
On Thu, Jun 11, 2009 at 9:47 PM, stack wrote:
> On Thu, Jun 11, 2009 at 6:10 PM, Bradford Stephens <
> bradfordsteph...@gmail.com> wrote:
>
>
On Thu, Jun 11, 2009 at 6:10 PM, Bradford Stephens <
bradfordsteph...@gmail.com> wrote:
>
> What I'm noticing is that it's writing to mostly one or two regions on
> one box at a time, even though I have 7 reducers running. Monitoring
> everything with dstat -v, I notice that only 2 of my servers a
servers.
---
Jim Kellerman, Powerset (Live Search, Microsoft Corporation)
> -Original Message-
> From: Bradford Stephens [mailto:bradfordsteph...@gmail.com]
> Sent: Thursday, June 11, 2009 6:10 PM
> To: hbase-user@hadoop.apache.org
> Subject: HBase Write to Regionservers behavio
once the table has split more you might look in to using
org.apache.hadoop.hbase.mapred.HRegionPartitioner.java
It will split up the data and only run one reduce per region so all that's
regions rows will be sent to just one reducer
but does not help much as when the table is small and you have
Hey there,
So, I wiped my HDFS and reinstalled everything, and am running smaller
loads... so far, so good. I've got 7 regionservers.
My job basically takes a lot of documents and metadata with unique
binary keys (like "055E51294F9D9CA331D968D04B72A11C"), combines them
all in a reducer, then writ
13 matches
Mail list logo