Hello Andrei,

As yourself noticed, ACS has a hardcoded threshold for secondary storages. In cases that the secondary storage has large capacities, 10% can mean a lot of storage. There is an open PR (https://github.com/apache/cloudstack/pull/4790) that externalize this threshold to allow operators decide how much they need. Also, the logs of secondary storage management were improved with PR https://github.com/apache/cloudstack/pull/4955.

With respect to KVM snapshots, volume snapshots are taken in a quite peculiar way. Instead of taking volume snapshots directly, ACS takes a full snapshot of the VM, which may cause freeze on it (VM) due to memory snapshot, and then extracts the disk from the VM snapshot. Due to it, it was open an issue https://github.com/apache/cloudstack/issues/5124 to discuss a new workflow to snapshots on KVM.

I am already implementing the solution for the issue 5124, and to improve this whole snapshot process for KVM; However, it is a complex and long standing job. As soon as we have something, I would appreciate to receive some feedback from you.

Regards,
Guto


On 2021/06/16 16:15:51, Andrei Mikhailovsky wrote:
> Hello,>
>
> I've done some more investigation and indeed, the snapshots were not taken because the secondary storage was over 90% used. I have started cleaning some of the older volumes and noticed another problem. After removing snapshots, they do not seem to be removed from the secondary storage. I've removed all snapshots over 24 hours ago and it looks like the disk space hasn't been freed up at all.>
>
> Looks like there are issues with snapshotting function after all.>
>
> Andrei>
>
>
>
> ----- Original Message ----->
> > From: "Harikrishna Patnala" >
> > To: "users" >
> > Sent: Tuesday, 8 June, 2021 03:33:57>
> > Subject: Re: Snapshots are not working after upgrading to 4.15.0>
>
> > Hi Andrei,>
> > >
> > Can you check the following things and let us know?>
> > >
> > >
> > 1. Can you try creating a new volume and then create snapshot of that, to check>
> > if this an issue with old entries>
> > 2. For the snapshots which are failing can you check if you are seeing any> > > error messages like this "Can't find an image storage in zone with less than".>
> > This is to check if secondary storage free space check failed.>
> > 3. For the snapshots which are failing and if it is delta snapshot can you> > > check if its parent's snapshot entry exists in "snapshot_store_ref" table with> > > 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". This is> > > to find the secondary storage where the parent snapshot backup is located.>
> > >
> > Regards,>
> > Harikrishna>
> > ________________________________>
> > From: Andrei Mikhailovsky >
> > Sent: Monday, June 7, 2021 7:00 PM>
> > To: users >
> > Subject: Snapshots are not working after upgrading to 4.15.0>
> > >
> > Hello everyone,>
> > >
> > I am having an issue with volume snapshots since I've upgraded to 4.15.0. None> > > of the volumes are being snapshotted regardless if the snapshot is initiated> > > manually or from the schedule. The strange thing is that if I manually take the> > > snapshot, the GUI shows Success status, but the Storage>Snapshots show an Error>
> > status. Here is what I see in the management server logs:>
> > >
> > 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]>
> > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Done>
> > executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143>
> > 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor]>
> > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Remove>
> > job-86143 from job monitoring>
> > 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl]>
> > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy snapshot> > > com.cloud.utils.exception.CloudRuntimeException: can not find an image stores>
> > at>
> > org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271)>
> > at>
> > org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171)>
> > at>
> > com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238)>
> > at>
> > org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48)>
> > at>
> > org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55)>
> > at>
> > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102)>
> > at>
> > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52)>
> > at>
> > org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45)>
> > at>
> > java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)>
> > at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)>
> > at>
> > java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304)>
> > at>
> > java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)>
> > at>
> > java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)>
> > at java.base/java.lang.Thread.run(Thread.java:829)>
> > 2021-06-07 13:55:20,152 DEBUG [c.c.s.s.SnapshotManagerImpl]>
> > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Backing up of snapshot>
> > failed, for snapshot with ID 53531, left with 2 more attempts>
> > >
> > >
> > I've checked and the Secondary storage is configured and visible in the GUI. I> > > can also mount it manually from the management server and a couple of host> > > servers that I've tested. In addition, I can successfully upload an ISO image> > > and that registers just fine and I can create new VMs using the newly uploaded>
> > ISO image.>
> > >
> > I've had no such problems with 4.13.x ACS, so the issue seems to have been>
> > introduced after doing the upgrade to 4.15.0.>
> > >
> > Could you please let me know how do I fix the issue?>
> > >
> > Cheers>
> > >
> > andrei>
>

Reply via email to