[ 
https://issues.apache.org/jira/browse/HBASE-10786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-10786:
---------------------------

    Status: Patch Available  (was: Open)

> If snapshot verification fails with 'Regions moved', the message should 
> contain the name of region causing the failure
> ----------------------------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-10786
>                 URL: https://issues.apache.org/jira/browse/HBASE-10786
>             Project: HBase
>          Issue Type: Task
>            Reporter: Ted Yu
>            Assignee: Ted Yu
>            Priority: Minor
>         Attachments: 10786-v1.txt
>
>
> I was trying to find cause for test failure in 
> https://builds.apache.org/job/PreCommit-HBASE-Build/9036//testReport/org.apache.hadoop.hbase.snapshot/TestSecureExportSnapshot/testExportRetry/
>  :
> {code}
> org.apache.hadoop.hbase.snapshot.HBaseSnapshotException: 
> org.apache.hadoop.hbase.snapshot.HBaseSnapshotException: Snapshot { 
> ss=emptySnaptb0-1395177346656 table=testtb-1395177346656 type=FLUSH } had an 
> error.  Procedure emptySnaptb0-1395177346656 { waiting=[] done=[] }
>       at 
> org.apache.hadoop.hbase.master.snapshot.SnapshotManager.isSnapshotDone(SnapshotManager.java:342)
>       at 
> org.apache.hadoop.hbase.master.HMaster.isSnapshotDone(HMaster.java:3007)
>       at 
> org.apache.hadoop.hbase.protobuf.generated.MasterProtos$MasterService$2.callBlockingMethod(MasterProtos.java:40494)
>       at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2020)
>       at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:98)
>       at 
> org.apache.hadoop.hbase.ipc.FifoRpcScheduler$1.run(FifoRpcScheduler.java:73)
>       at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
>       at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
>       at java.util.concurrent.FutureTask.run(FutureTask.java:138)
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
>       at java.lang.Thread.run(Thread.java:662)
> Caused by: org.apache.hadoop.hbase.snapshot.CorruptedSnapshotException via 
> Failed taking snapshot { ss=emptySnaptb0-1395177346656 
> table=testtb-1395177346656 type=FLUSH } due to exception:Regions moved during 
> the snapshot '{ ss=emptySnaptb0-1395177346656 table=testtb-1395177346656 
> type=FLUSH }'. expected=9 
> snapshotted=8:org.apache.hadoop.hbase.snapshot.CorruptedSnapshotException: 
> Regions moved during the snapshot '{ ss=emptySnaptb0-1395177346656 
> table=testtb-1395177346656 type=FLUSH }'. expected=9 snapshotted=8
>       at 
> org.apache.hadoop.hbase.errorhandling.ForeignExceptionDispatcher.rethrowException(ForeignExceptionDispatcher.java:83)
>       at 
> org.apache.hadoop.hbase.master.snapshot.TakeSnapshotHandler.rethrowExceptionIfFailed(TakeSnapshotHandler.java:320)
>       at 
> org.apache.hadoop.hbase.master.snapshot.SnapshotManager.isSnapshotDone(SnapshotManager.java:332)
>       ... 11 more
> {code}
> However, it is not clear which region caused the verification to fail.
> I searched for log from balancer but found none.
> The exception message should include region name which caused the 
> verification to fail.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to