Every split executed is a new day. The row key design is yyyyMMdd_URL. And
the split points are yyyyMMdd_x, yyyyMMdd_y etc. In a way that the entire
load is (almost) evenly spread.
The problem I described causes the bulk load to load all files to to the
last region of the previous day.
Thanks.
On Dec 16, 2013 3:43 AM, "Bijieshan" <bijies...@huawei.com> wrote:

> Hi Amit:
> Can you provide the split-keys of the new regions and your row-key design?
>
> Thank you.
> Jieshan.
> -----Original Message-----
> From: Amit Sela [mailto:am...@infolinks.com]
> Sent: Monday, December 16, 2013 7:09 AM
> To: user@hbase.apache.org
> Subject: Bulk load moving HFiles to the wrong region
>
> Hi all,
> I'm using Hadoop 1.0.4 and HBase 0.94.12.
> When trying to bulk load using the Java API I sometimes get the HFiles
> moved to the wrong directory.
> I'm pre-splitting regions and the new regions are always the last
> (lexicographically), so when this happens all files move to the last region
> pre-split. But the split does work. I see the new regions in the webapp
> before bulk load executes. Once a table has this problem (not all the time)
> it keeps on until I restart HBase.
>
> Anyone seen something similar ?
>
> Thanks,
> Amit.
>

Reply via email to