Every split executed is a new day. The row key design is yyyyMMdd_URL. And the split points are yyyyMMdd_x, yyyyMMdd_y etc. In a way that the entire load is (almost) evenly spread. The problem I described causes the bulk load to load all files to to the last region of the previous day. Thanks. On Dec 16, 2013 3:43 AM, "Bijieshan" <bijies...@huawei.com> wrote:
> Hi Amit: > Can you provide the split-keys of the new regions and your row-key design? > > Thank you. > Jieshan. > -----Original Message----- > From: Amit Sela [mailto:am...@infolinks.com] > Sent: Monday, December 16, 2013 7:09 AM > To: user@hbase.apache.org > Subject: Bulk load moving HFiles to the wrong region > > Hi all, > I'm using Hadoop 1.0.4 and HBase 0.94.12. > When trying to bulk load using the Java API I sometimes get the HFiles > moved to the wrong directory. > I'm pre-splitting regions and the new regions are always the last > (lexicographically), so when this happens all files move to the last region > pre-split. But the split does work. I see the new regions in the webapp > before bulk load executes. Once a table has this problem (not all the time) > it keeps on until I restart HBase. > > Anyone seen something similar ? > > Thanks, > Amit. >