[
https://issues.apache.org/jira/browse/HBASE-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13133677#comment-13133677
]
Jonathan Hsieh commented on HBASE-4552:
---------------------------------------
One more piece: Mechanism to atomically rollback if a partial failures
encountered when attempting to bulk load multiple families.
For example, let's say I want to bulk load a region with cfs A, B, C. I issue
a call to an RS region to atomically bulkload the HFiles. The RS loads A and B
successfully but fails on C (hdfs failure, or rs goes down, etc). We should
rollback A and B -- if we don't we would have A and B loaded but not C and have
an atomicity violation.
> multi-CF bulk load is not atomic across column families
> -------------------------------------------------------
>
> Key: HBASE-4552
> URL: https://issues.apache.org/jira/browse/HBASE-4552
> Project: HBase
> Issue Type: Bug
> Components: regionserver
> Affects Versions: 0.92.0
> Reporter: Todd Lipcon
> Assignee: Jonathan Hsieh
> Fix For: 0.92.0
>
>
> Currently the bulk load API simply imports one HFile at a time. With
> multi-column-family support, this is inappropriate, since different CFs show
> up separately. Instead, the IPC endpoint should take a of CF -> HFiles, so we
> can online them all under a single region-wide lock.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira