RE: Row Key Question

2011-04-22 Thread Peter Haidinyak
utes on my home system. -Pete -Original Message- From: jdcry...@gmail.com [mailto:jdcry...@gmail.com] On Behalf Of Jean-Daniel Cryans Sent: Friday, April 22, 2011 1:32 PM To: user@hbase.apache.org Subject: Re: Row Key Question That's almost exactly what mozilla is doing with soro

Re: Row Key Question

2011-04-22 Thread Jean-Daniel Cryans
rt the data would buy me anything or not. Anyway, I'm going to try this > out at home on my own cluster to see how it performs. > > Thanks > > -Pete > > -Original Message- > From: Buttler, David [mailto:buttl...@llnl.gov] > Sent: Friday, April 22, 2011 12:10

RE: Row Key Question

2011-04-22 Thread Peter Haidinyak
how it performs. Thanks -Pete -Original Message- From: Buttler, David [mailto:buttl...@llnl.gov] Sent: Friday, April 22, 2011 12:10 PM To: user@hbase.apache.org Subject: RE: Row Key Question Regions split when they are larger than the configuration parameter region size. Your data is

RE: Row Key Question

2011-04-22 Thread Buttler, David
-Original Message- From: Peter Haidinyak [mailto:phaidin...@local.com] Sent: Friday, April 22, 2011 10:41 AM To: user@hbase.apache.org Subject: Row Key Question I have a question on how HBase decides to save rows based on Row Keys. Say I have a million rows to insert into a new table in a

Re: Row Key Question

2011-04-22 Thread Jean-Daniel Cryans
The splitting is based on when a region reaches a configured size (default is 256MB). A table starts with 1 region, and splits as needed when you insert. For a bit more info see: http://hbase.apache.org/book.html#regions.arch J-D On Fri, Apr 22, 2011 at 10:40 AM, Peter Haidinyak wrote: > I have

Row Key Question

2011-04-22 Thread Peter Haidinyak
I have a question on how HBase decides to save rows based on Row Keys. Say I have a million rows to insert into a new table in a ten node cluster. Each row's key is some random 32 byte value and there are two columns per row, each column contains some random 32 byte value. My question is how do

Re: Row Key Question

2011-02-16 Thread Stack
int@gmail.com [mailto:saint@gmail.com] On Behalf Of >> Stack >> Sent: Wednesday, February 16, 2011 10:52 AM >> To: user@hbase.apache.org >> Cc: Peter Haidinyak >> Subject: Re: Row Key Question >> >> On Wed, Feb 16, 2011 at 10:48 AM, Peter Haidinyak &

RE: Row Key Question

2011-02-16 Thread Peter Haidinyak
gt; Cc: Peter Haidinyak > Subject: Re: Row Key Question > > On Wed, Feb 16, 2011 at 10:48 AM, Peter Haidinyak > wrote: >> I'm not using the Timestamp alone, it is part of a compound key. >> My old key included >> || >> >> My new key will include >> ||

Re: Row Key Question

2011-02-16 Thread Stack
On Wed, Feb 16, 2011 at 10:48 AM, Peter Haidinyak wrote: > I'm not using the Timestamp alone, it is part of a compound key. > My old key included > || > > My new key will include > || > Yes. Got that. Was just trying to give you a bit more background to highlight what the lads were saying befor

RE: Row Key Question

2011-02-16 Thread Peter Haidinyak
th my limited knowledge I don't know how to do that yet. Thanks -Pete -Original Message- From: saint@gmail.com [mailto:saint@gmail.com] On Behalf Of Stack Sent: Wednesday, February 16, 2011 10:15 AM To: user@hbase.apache.org Subject: Re: Row Key Question See http://hbase.

Re: Row Key Question

2011-02-16 Thread Stack
gt; Thanks again > > -Pete > > > > -Original Message- > From: Doug Meil [mailto:doug.m...@explorysmedical.com] > Sent: Wednesday, February 16, 2011 9:41 AM > To: user@hbase.apache.org > Subject: RE: Row Key Question > > Hi there- > > As was described in th

RE: Row Key Question

2011-02-16 Thread Peter Haidinyak
[mailto:doug.m...@explorysmedical.com] Sent: Wednesday, February 16, 2011 9:41 AM To: user@hbase.apache.org Subject: RE: Row Key Question Hi there- As was described in the HBase chapter in the Hadoop book by Tom White, you don't want to insert a lot of data at one time with incrementing keys. -MM-DD woul

RE: Row Key Question

2011-02-16 Thread Doug Meil
Haidinyak [mailto:phaidin...@local.com] Sent: Tuesday, February 15, 2011 7:38 PM To: user@hbase.apache.org Subject: Row Key Question Hi All, A couple of weeks ago I asked about how to distribute my rows across the servers if the key always starts with the date in the format... -MM-DD I

RE: Row Key Question

2011-02-15 Thread Gary Gilbert - SQLstream
00's x01's x02's etc. depending on requirements. Any thoughts? Gary -Original Message- From: Peter Haidinyak [mailto:phaidin...@local.com] Sent: Tuesday, February 15, 2011 6:38 PM To: user@hbase.apache.org Subject: Row Key Question Hi All, A couple of weeks ago I asked abo

Re: Row Key Question

2011-02-15 Thread Chris Tarnas
I've been playing with salting my keys as well keys. My current experiments are around hashing the rowkey and using digits of that to create the prefix. That would make your salts and your puts idempotent, but you do loose control of data-locality. -chris On Feb 15, 2011, at 4:38 PM, Peter Hai

Row Key Question

2011-02-15 Thread Peter Haidinyak
Hi All, A couple of weeks ago I asked about how to distribute my rows across the servers if the key always starts with the date in the format... -MM-DD I believe Stack, although I could be wrong, suggested pre-pending a 'X-' when 'X' is a number from 1 to the number of servers I have. Thi