In addition to what Jon said please be aware that if compression is specified 
in the table schema, it happens at the store file level -- compression happens 
after write I/O, before read I/O, so if you transmit a 100MB object that 
compresses to 30MB, the performance impact is that of 100MB, not 30MB. 

I also try not to go above 50MB as largest cell size, for the same reason. I 
have tried storing objects larger than 100MB but this can cause out of memory 
issues on busy regionservers no matter the size of the heap. When/if HBase RPC 
can send large objects in smaller chunks, this will be less of an issue. 

Best regards,

    - Andy

Why is this email five sentences or less?
http://five.sentenc.es/


--- On Mon, 9/6/10, Jonathan Gray <jg...@facebook.com> wrote:

> From: Jonathan Gray <jg...@facebook.com>
> Subject: RE: Limits on HBase
> To: "user@hbase.apache.org" <user@hbase.apache.org>
> Date: Monday, September 6, 2010, 4:10 PM
> I'm not sure what you mean by
> "optimized cell size" or whether you're just asking about
> practical limits?
> 
> HBase is generally used with cells in the range of tens of
> bytes to hundreds of kilobytes.  However, I have used
> it with cells that are several megabytes, up to about
> 50MB.  Up at that level, I have seen some weird
> performance issues.
> 
> The most important thing is to be sure to tweak all of your
> settings.  If you have 20MB cells, you need to be sure
> to increase the flush size beyond 64MB and the split size
> beyond 256MB.  You also need enough memory to support
> all this large object allocation.
> 
> And of course, test test test.  That's the easiest way
> to see if what you want to do will work :)
> 
> When you run into problems, e-mail the list.
> 
> As far as row size is concerned, the only issue is that a
> row can never span multiple regions so a given row can only
> be in one region and thus be hosted on one server at a
> time.
> 
> JG
> 
> > -----Original Message-----
> > From: William Kang [mailto:weliam.cl...@gmail.com]
> > Sent: Monday, September 06, 2010 1:57 PM
> > To: hbase-user
> > Subject: Limits on HBase
> > 
> > Hi folks,
> > I know this question may have been asked many times,
> but I am wondering
> > if
> > there is any update on the optimized cell size (in
> megabytes) and row
> > size
> > (in megabytes)? Many thanks.
> > 
> > 
> > William
> 




Reply via email to