On Mon, Oct 19, 2009 at 7:58 PM, Anty <[email protected]> wrote:

> I do not mean to overwrite what is currently there of loadtable.rb.
> Considering the following scenario,if i partition a dataset according to
> the  regions of a  existing table in hbase,and generate many HFiles,then we
> can move these HFiles to their appropriate directory .do you think this is
> practicable?
>
>
Thats something other than what hbase-48 does.

You'd need a partitioner that divided the key ranged based off current
region set start and end keys (Not impossible, your partitioner on startup
would do a lookup on current set of regions).

While this new job ran, the table should be flipped to be read-only so it
didn't split and make new regions.  Before setting it read-only, it should
flush so there is nothing in memstore.

Somehow you'd have to get from hbase the latest sequenceid.  I'm don't think
this knowledge is currently easy to get at.

You'd then run your job.  You'd write with timestamps that were more modern
than anything currently in your table.

You'd need a version of loadtable.rb that then took the mapreduce produce
and that moved the hfiles to the right region.

If you want to work on something that does this Anty, make a new issue and
I'll help you out.
St.Ack

Reply via email to