The splitting is based on when a region reaches a configured size (default is 256MB). A table starts with 1 region, and splits as needed when you insert. For a bit more info see: http://hbase.apache.org/book.html#regions.arch
J-D On Fri, Apr 22, 2011 at 10:40 AM, Peter Haidinyak <[email protected]> wrote: > I have a question on how HBase decides to save rows based on Row Keys. Say I > have a million rows to insert into a new table in a ten node cluster. Each > row's key is some random 32 byte value and there are two columns per row, > each column contains some random 32 byte value. > My question is how does HBase know when to 'split' the table between the ten > nodes? Or how does HBase 'split' the random keys between the ten nodes? > > Thanks > > -Pete >
