[
https://issues.apache.org/jira/browse/HBASE-4552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13129013#comment-13129013
]
Todd Lipcon commented on HBASE-4552:
------------------------------------
The trick is making sure it's atomic inside the region server - not just that
the client sends all of the files for a given region in one RPC. If there are
any concurrent scanners, then they should either see all of the new data or
none of the new data on a given row. So we need some region-wide coordination.
I think probably we have to take a write-lock on HRegion#lock
> multi-CF bulk load is not atomic across column families
> -------------------------------------------------------
>
> Key: HBASE-4552
> URL: https://issues.apache.org/jira/browse/HBASE-4552
> Project: HBase
> Issue Type: Bug
> Components: regionserver
> Affects Versions: 0.92.0
> Reporter: Todd Lipcon
> Fix For: 0.92.0
>
>
> Currently the bulk load API simply imports one HFile at a time. With
> multi-column-family support, this is inappropriate, since different CFs show
> up separately. Instead, the IPC endpoint should take a of CF -> HFiles, so we
> can online them all under a single region-wide lock.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira