Sangeetha Hariharan created CLOUDSTACK-5469:
-----------------------------------------------

             Summary: Snapshot creation fails with following exception - 
"Failed to backup snapshot: qemu-img: Could not delete snapshot 
'89eced14-9121-44a7-bb97-26b567795726': -2 (No such file or directory)"
                 Key: CLOUDSTACK-5469
                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-5469
             Project: CloudStack
          Issue Type: Bug
      Security Level: Public (Anyone can view this level - this is the default.)
          Components: Management Server
    Affects Versions: 4.3.0
         Environment: Build from 4.3
            Reporter: Sangeetha Hariharan
            Priority: Critical
             Fix For: 4.3.0


Set up: 
Advanced Zone with 2 KVM (RHEL 6.3) hosts.
 2 NFS secondary stores set up. 
Steps to reproduce the problem:
 1. Deploy 5 Vms in each of the hosts with 10 GB ROOT volume size , so we start 
with 10 Vms. 
2. Start concurrent snapshots for ROOT volumes of all the Vms. 

1 of the secondary store -ss1- had the nfs server down for 1 and 1/2 hours. 
The other secondary store -ss2 - was always reachable. 
Snapshot tasks that went to the ss1 , succeeded after the nfs server was 
brought up (It temporarily halted when the the nfs server was down and resumed 
when the nsf server was made available). 

First set of snapshot tasks that went to the ss2 all succeeded.
 But the next hourly snapshot tasks, few of them failed with following 
exception: 2013-12-11 16:33:22,427 DEBUG [c.c.s.s.SnapshotManagerImpl] 
(Job-Executor-64:ctx-9c70ad77 ctx-3d959fa6) Failed t o create snapshot 
com.cloud.utils.exception.CloudRuntimeException: Failed to backup snapshot: 
qemu-img: Could not delete snapshot '89eced14-9121-44a7-bb97-26b567795726': -2 
(No such file or directory)Failed to delete snapshot 89eced14-9121-44 
a7-bb97-26b567795726 for path 
/mnt/c20ea198-e8ca-33c3-9f11-e361ec9b5532/71a5dce2-da7c-4692-8f25-ba37e5296886 
at 
org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:27
 5) at 
org.apache.cloudstack.storage.snapshot.XenserverSnapshotStrategy.backupSnapshot(XenserverSnapshotStra
 tegy.java:135) at 
org.apache.cloudstack.storage.snapshot.XenserverSnapshotStrategy.takeSnapshot(XenserverSnapshotStrate
 gy.java:294) at 
com.cloud.storage.snapshot.SnapshotManagerImpl.takeSnapshot(SnapshotManagerImpl.java:951)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) 
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:601) at 
org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:317)
 at 
org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocati
 on.java:183) at 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:
 150) at 
org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.ja
 va:91) at 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:
 172) at 
org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:204)
 at $Proxy161.takeSnapshot(Unknown Source) at 
org.apache.cloudstack.storage.volume.VolumeServiceImpl.takeSnapshot(VolumeServiceImpl.java:1341)
 at 
com.cloud.storage.VolumeApiServiceImpl.takeSnapshot(VolumeApiServiceImpl.java:1461)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) 
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:601) at 
org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:317)

 Copy to the secondary has succeed. Failure happens after this. 

[root@Rack3Host5 118]# ls -ltr
total 10002852
-rw-r--r--. 1 root root 3637903360 Dec 11 20:33 
89eced14-9121-44a7-bb97-26b567795726
-rw-r--r--. 1 root root 3638755328 Dec 11 21:37 
b38d93db-4c14-45a7-9274-639ad95a3f29
-rw-r--r--. 1 root root 2956619776 Dec 11 22:24 
452c8841-2025-41da-b6ec-49cea2a49da8
[root@Rack3Host5 118]#


Following are the volumes which are in "CreatedOnPrimary" state for which the 
failure occured. 
| 113 | BackedUp | 2013-12-11 20:57:03 | 
| 112 | BackedUp | 2013-12-11 20:57:03 |
 | 110 | BackedUp | 2013-12-11 20:57:03 |
 | 121 | CreatedOnPrimary | 2013-12-11 20:57:04 | 
| 118 | CreatedOnPrimary | 2013-12-11 20:57:04 |
 | 117 | BackedUp | 2013-12-11 20:57:04 |
 | 116 | CreatedOnPrimary | 2013-12-11 20:57:04 | 




--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

Reply via email to