[ 
https://issues.apache.org/jira/browse/HBASE-18098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pavel Salimov updated HBASE-18098:
----------------------------------
    Description: 
Somehow it happened that a region, say AB, was split onto A and B, but some 
portion of B's data were missing in B's H-files, still presenting in A's files. 
I am not completely sure, was it a result of just a failure during split, or of 
applying hbck repair after that. 

Anyway, data of some rows belonging to B were missing accessing normally. I was 
able to access them making a scan type query regarding these rows to region A 
(despite the rows are after its end). Hbck repairs, splitting A onto A0, A1 and 
(full) compaction did not change the situation: the data were still missing in 
B migrated to A1. 

Copying the file from A1 dir to B finally made the data accessible.
 
I am reporting the issue in hope to get an advice on how to detect such an 
inconsistency as well as hoping to clarify what leaded to such a state (and 
fixed if it is a bug).

  was:
Somehow it happened that a region, say AB, was split onto A and B, but some 
portion of B's data were missing in B's H-files, still presenting in A's files. 
I am not completely sure, was it a result of just a failure during split, or of 
applying hbck repair after that. 

Anyway, data of some rows belonging to B were missing accessing normally. I was 
able to access them making a scan type query regarding these rows to region A 
(despite the rows are after its end). Hbck repairs, splitting A onto A0, A1 and 
(full) compaction did not change the situation: the data were still missing in 
B migrated to A1. 
 
I am reporting the issue in hope to get an advice on how to detect such an 
inconsistency as well as hoping to clarify what leaded to such a state (and 
fixed if it is a bug).


> Region data ended up in a file of an another region
> ---------------------------------------------------
>
>                 Key: HBASE-18098
>                 URL: https://issues.apache.org/jira/browse/HBASE-18098
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 1.2.0
>            Reporter: Pavel Salimov
>
> Somehow it happened that a region, say AB, was split onto A and B, but some 
> portion of B's data were missing in B's H-files, still presenting in A's 
> files. I am not completely sure, was it a result of just a failure during 
> split, or of applying hbck repair after that. 
> Anyway, data of some rows belonging to B were missing accessing normally. I 
> was able to access them making a scan type query regarding these rows to 
> region A (despite the rows are after its end). Hbck repairs, splitting A onto 
> A0, A1 and (full) compaction did not change the situation: the data were 
> still missing in B migrated to A1. 
> Copying the file from A1 dir to B finally made the data accessible.
>  
> I am reporting the issue in hope to get an advice on how to detect such an 
> inconsistency as well as hoping to clarify what leaded to such a state (and 
> fixed if it is a bug).



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to