Hi all:
I learn Kudu's paper "Kudu: Storage for Fast Analytics on Fast Data" very hard 
to find
why performance of hbase' random query is superior to kudu. "the primary key 
intervals
 of different RowSets may intersect" may be one of the reasons.

My confusion is why not keep DiskRowSets ordered on primary key globally. When 
flush MemRowSet,
the rows of MemRowSet dispatch to deltaMemStore of correspanding DiskRowSets. 
And negative side
effects is fragment of DiskRowSets, but it is worth for globally orderd of 
DiskRowSets.

best
jie

Reply via email to