Hi Tim,,

Just read more details, it may not be related with the issue we fixed (mob 
compaction related).
I am doing a similar test to see if I can reproduce it.

Thanks,
Huaxiang
> On Oct 12, 2016, at 10:29 AM, Tim Robertson <timrobertson...@gmail.com> wrote:
> 
> Thanks Ted, Huaxiang
> 
> I'll move this to a Cloudera forum and comment back here if it appears
> unrelated.
> 
> On Wed, Oct 12, 2016 at 7:24 PM, Huaxiang Sun <h...@cloudera.com 
> <mailto:h...@cloudera.com>> wrote:
> 
>> By the way, I forgot the forum link: http://community.cloudera.com 
>> <http://community.cloudera.com/> <
>> http://community.cloudera.com/ <http://community.cloudera.com/>>
>> 
>> Thanks,
>> Huaxiang
>> 
>>> On Oct 12, 2016, at 10:10 AM, Huaxiang Sun <h...@cloudera.com 
>>> <mailto:h...@cloudera.com>> wrote:
>>> 
>>> Hi Tim,
>>> 
>>>   I believe that it runs into an issue which is specific to cloudera
>> release we fixed recently. For details, could you discuss it in cdh forum?
>>> Copy me(h...@cloudera.com <mailto:h...@cloudera.com> 
>>> <mailto:h...@cloudera.com <mailto:h...@cloudera.com>>) in the forum so I
>> can explain more there.
>>> 
>>>   Thanks,
>>>   Huaxiang
>>> 
>>>> On Oct 12, 2016, at 8:13 AM, Ted Yu <yuzhih...@gmail.com 
>>>> <mailto:yuzhih...@gmail.com> <mailto:
>> yuzhih...@gmail.com <mailto:yuzhih...@gmail.com>>> wrote:
>>>> 
>>>> Have you looked at HBASE-16578 ?
>>>> 
>>>> Cheers
>>>> 
>>>>> On Oct 12, 2016, at 8:02 AM, Tim Robertson <timrobertson...@gmail.com 
>>>>> <mailto:timrobertson...@gmail.com>
>> <mailto:timrobertson...@gmail.com <mailto:timrobertson...@gmail.com>>> wrote:
>>>>> 
>>>>> Hi devs,
>>>>> [Had a quick chat with Lars G. about this and before opening a Jira I
>>>>> thought I'd raise it here first]
>>>>> 
>>>>> We have just experienced data loss in HBase 1.0.0-cdh5.4.10.
>>>>> 
>>>>> Before I dig into this further, I'd like to just ask if anyone has seen
>>>>> this before?
>>>>> 
>>>>> The initial state was a table (tim_test) built with MOB support and a
>> few
>>>>> 10's million rows and 10's billions of cells.
>>>>> 
>>>>> I wanted to rename the table to get this into production and did so as
>>>>> follows:
>>>>> 
>>>>> snapshot 'tim_test', 'tim_test-snapshot'
>>>>> clone_snapshot 'tim_test-snapshot', 'prod_b_map'
>>>>> 
>>>>> At this stage the application all looked good, and so I continued with:
>>>>> 
>>>>> delete_snapshot 'tim_test-snapshot'
>>>>> disable 'tim_test'
>>>>> drop ‘tim_test’
>>>>> 
>>>>> Then things went... awry and data just started dropping out in the app.
>>>>> Before long, all MOB data seemingly is gone.
>>>>> 
>>>>> The references in the new table MOB folder appear to point to the
>> source
>>>>> table (e.g.
>>>>> /hbase/mobdir/data/default/prod_b_map/ba42a2e8e9b669d9fc85bdfeed2f5f
>> 2a/EPSG_4326/tim_test=14bf5f1737ac65c34615ed97c0b7de06-
>> d41d8cd98f00b204e9800998ecf8427e20161006ff8baa70d21f408caefe8ae6318dfba2).
>>>>> 
>>>>> The RS logs full of ERROR like:
>>>>> 
>>>>> 2016-10-12 15:19:14,640 ERROR org.apache.hadoop.hbase.
>> regionserver.HStore:
>>>>> The mob file
>>>>> d41d8cd98f00b204e9800998ecf8427e20161006b59865f80e604781a79e
>> bfa2ddd66b48
>>>>> could not be found in the locations
>>>>> [hdfs://ha-nn/hbase/mobdir/data/default/tim_test/
>> 14bf5f1737ac65c34615ed97c0b7de06/EPSG_4326 <hdfs://ha-nn/hbase/mobdir/ 
>> <hdfs://ha-nn/hbase/mobdir/>
>> data/default/tim_test/14bf5f1737ac65c34615ed97c0b7de06/EPSG_4326>,
>>>>> hdfs://ha-nn/hbase/archive/data/default/tim_test/ 
>>>>> <hdfs://ha-nn/hbase/archive/data/default/tim_test/>
>> 14bf5f1737ac65c34615ed97c0b7de06/EPSG_4326] <hdfs://ha-nn/hbase/archive/
>> data/default/tim_test/14bf5f1737ac65c34615ed97c0b7de06/EPSG_4326]>
>>>>> 
>>>>> What I don't know is:
>>>>> 1) was this running a background task to copy the MOB data when the
>>>>> snapshot was cloned and I just deleted the source before the copy was
>>>>> complete?
>>>>> - or
>>>>> 2) when running "snapshot and clone" it just references the source MOB
>>>>> data until a (?) change?
>>>>> 3) snapshot and clone just doesn't support MOB?
>>>>> 
>>>>> Can anyone shed some light on this easily before I dig into it please?
>>>>> 
>>>>> While this situation exists (at least in 1.0.0) might it be good to get
>>>>> info about data loss for MOB tables into the snapshot clone docs?
>>>>> 
>>>>> Thanks,
>>>>> Tim

Reply via email to