[jira] [Commented] (HBASE-11882) Row level consistency may not be maintained with bulk load and compaction

ramkrishna.s.vasudevan (JIRA) Tue, 02 Sep 2014 21:36:07 -0700

    [ 
https://issues.apache.org/jira/browse/HBASE-11882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14119319#comment-14119319
 ]


ramkrishna.s.vasudevan commented on HBASE-11882:
------------------------------------------------

The reason why I was telling option 1 was it is a behaviour change in bulk 
load.  Previously all the KVs were getting visible from a bulk loaded file and 
this was true in cases of where a bulk load could get completed when a scan is 
in progress.
The HBASE-11591 changed that behaviour too and now bulk load will go through 
mvcc sequence as per the other normal kvs (during scan).
The patch v2 looks fine to me and it solves the problem mentioned in the JIRA.  
But I doubt whether this solves the test case issue.  Assume in the test case 
if there is a scanner that gets started and just after that a bulk load is 
completed, the kvs in the bulk loaded file is not visibile. Does this behaviour 
happen in the test case now? If we all agree with the new behaviour then I 
think the test case may need some tweak. That was my concern.

> Row level consistency may not be maintained with bulk load and compaction
> -------------------------------------------------------------------------
>
>                 Key: HBASE-11882
>                 URL: https://issues.apache.org/jira/browse/HBASE-11882
>             Project: HBase
>          Issue Type: Bug
>          Components: regionserver
>    Affects Versions: 0.99.0, 2.0.0
>            Reporter: Jerry He
>            Assignee: Jerry He
>            Priority: Critical
>             Fix For: 0.99.0, 2.0.0
>
>         Attachments: HBASE-11882-master-v1.patch, 
> HBASE-11882-master-v2.patch, TestHRegionServerBulkLoad.java.patch
>
>
> While looking into the TestHRegionServerBulkLoad failure for HBASE-11772, I 
> found the root cause is that row level atomicity may not be maintained with 
> bulk load together with compation.
> TestHRegionServerBulkLoad is used to test bulk load atomicity. The test uses 
> multiple threads to do bulk load and scan continuously and do compactions 
> periodically. 
> It verifies row level data is always consistent across column families.
> After HBASE-11591, we added readpoint checks for bulkloaded data using the 
> seqId at the time of bulk load. Now a scanner will not see the data from a 
> bulk load if the scanner's readpoint is earlier than the bulk load seqId.
> Previously, the atomic bulk load result is visible immediately to all 
> scanners.
> The problem is with compaction after bulk load. Compaction does not lock the 
> region and it is done one store (column family) at a time. It also compact 
> away the seqId marker of bulk load.
> Here is an event sequence where the row level consistency is broken.
> 1. A scanner is started to scan a region with cf1 and cf2. The readpoint is 
> 10.
> 2. There is a bulk load that loads into cf1 and cf2. The bulk load seqId is 
> 11. Bulk load is guarded by region write lock. So it is atomic.
> 3. There is a compaction that compacts cf1. It compacts away the seqId marker 
> of the bulk load.
> 4. The scanner tries to next to row-1001. It gets the bulk load data for cf1 
> since there is no seqId preventing it.  It does not get the bulk load data 
> for cf2 since the scanner's readpoint (10) is less than the bulk load seqId 
> (11).
> Now the row level consistency is broken in this case.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-11882) Row level consistency may not be maintained with bulk load and compaction

Reply via email to