> The blobs vary in size from smallish (10K) to largish (20MB).

20MB is quite large, but could be harmless if most of the rows are under 1MB

> They are too small to put into individual files in HDFS, but if I have too 
> many largish rows in a region, I think I would suffer.

Yeah, need more info about the size distribution.

>
> Would it be possible to put the blobs in their own column family that has a 
> significantly different block size (10x).  I hesitate to do this mostly 
> because I already have too many column families, but since I don't expect the 
> blobs to be touched very often, a separate column family would make them 
> mostly harmless.

The block size is dynamic, if you store a single cell of 20MB then
that will be 1 block of the same size. Instead of creating a new
family, you could also create a new table.

J-D

Reply via email to