Late answer, I just came back from vacation. On Mon, Jan 30, 2012 at 10:03 PM, Nicolas Spiegelberg <nspiegelb...@fb.com> wrote: > I think HFile upgrade in particular is more complicated than you think. > We currently have production traffic running with HFileV1. It has a 5-min > SLA. We can't afford to take the entire downtime to rewrite 100GB (or > whatever) worth of data. We need to do this while the cluster is live.
AFAIK that's how it's done, V1 files are being rewritten to V2 when a compaction happens. You don't have to do some offline processing before getting the cluster back online. J-D