> The blobs vary in size from smallish (10K) to largish (20MB). 20MB is quite large, but could be harmless if most of the rows are under 1MB
> They are too small to put into individual files in HDFS, but if I have too > many largish rows in a region, I think I would suffer. Yeah, need more info about the size distribution. > > Would it be possible to put the blobs in their own column family that has a > significantly different block size (10x). I hesitate to do this mostly > because I already have too many column families, but since I don't expect the > blobs to be touched very often, a separate column family would make them > mostly harmless. The block size is dynamic, if you store a single cell of 20MB then that will be 1 block of the same size. Instead of creating a new family, you could also create a new table. J-D