[ https://issues.apache.org/jira/browse/CLOUDSTACK-3715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sailaja Mada reopened CLOUDSTACK-3715: -------------------------------------- Issue is happening even on a single node Management server: Hence reopening the defect. 013-09-18 13:37:53,488 DEBUG [cloud.storage.VolumeManagerImpl] (Job-Executor-43:job-65 = [ 8e1a2c00-2309-4c98-b792-d10f6ad21972 ]) Failed to migrated vm VM[User|dvsi1] along with its volumes. com.cloud.utils.exception.CloudRuntimeException: Error while migrating the vm VM[User|dvsi1] to host Host[-8-Routing]. Exception: javax.xml.ws.WebServiceException Message: java.net.SocketTimeoutException: Read timed out Stack: javax.xml.ws.WebServiceException: java.net.SocketTimeoutException: Read timed out at com.sun.xml.internal.ws.transport.http.client.HttpClientTransport.readResponseCodeAndMessage(HttpClientTransport.java:201) at com.sun.xml.internal.ws.transport.http.client.HttpTransportPipe.process(HttpTransportPipe.java:151) at com.sun.xml.internal.ws.transport.http.client.HttpTransportPipe.processRequest(HttpTransportPipe.java:83) at com.sun.xml.internal.ws.transport.DeferredTransportPipe.processRequest(DeferredTransportPipe.java:78) at com.sun.xml.internal.ws.api.pipe.Fiber.__doRun(Fiber.java:587) at com.sun.xml.internal.ws.api.pipe.Fiber._doRun(Fiber.java:546) at com.sun.xml.internal.ws.api.pipe.Fiber.doRun(Fiber.java:531) at com.sun.xml.internal.ws.api.pipe.Fiber.runSync(Fiber.java:428) at com.sun.xml.internal.ws.client.Stub.process(Stub.java:211) at com.sun.xml.internal.ws.client.sei.SEIStub.doProcess(SEIStub.java:124) at com.sun.xml.internal.ws.client.sei.SyncMethodHandler.invoke(SyncMethodHandler.java:98) at com.sun.xml.internal.ws.client.sei.SyncMethodHandler.invoke(SyncMethodHandler.java:78) at com.sun.xml.internal.ws.client.sei.SEIStub.invoke(SEIStub.java:107) at $Proxy90.waitForUpdates(Unknown Source) at com.cloud.hypervisor.vmware.util.VmwareClient.waitForValues(VmwareClient.java:428) at com.cloud.hypervisor.vmware.util.VmwareClient.waitForTask(VmwareClient.java:371) at com.cloud.hypervisor.vmware.mo.VirtualMachineMO.changeDatastore(VirtualMachineMO.java:344) at com.cloud.hypervisor.vmware.resource.VmwareResource.execute(VmwareResource.java:4275) at com.cloud.hypervisor.vmware.resource.VmwareResource.executeRequest(VmwareResource.java:461) at com.cloud.agent.manager.DirectAgentAttache$Task.run(DirectAgentAttache.java:186) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334) at java.util.concurrent.FutureTask.run(FutureTask.java:166) at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$101(ScheduledThreadPoolExecutor.java:165) at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603) at java.lang.Thread.run(Thread.java:679) Caused by: java.net.SocketTimeoutException: Read timed out at java.net.SocketInputStream.socketRead0(Native Method) at java.net.SocketInputStream.read(SocketInputStream.java:146) at sun.security.ssl.InputRecord.readFully(InputRecord.java:312) at sun.security.ssl.InputRecord.read(InputRecord.java:350) at sun.security.ssl.SSLSocketImpl.readRecord(SSLSocketImpl.java:850) at sun.security.ssl.SSLSocketImpl.readDataRecord(SSLSocketImpl.java:807) at sun.security.ssl.AppInputStream.read(AppInputStream.java:94) at java.io.BufferedInputStream.fill(BufferedInputStream.java:235) at java.io.BufferedInputStream.read1(BufferedInputStream.java:275) at java.io.BufferedInputStream.read(BufferedInputStream.java:334) at sun.net.www.http.HttpClient.parseHTTPHeader(HttpClient.java:688) at sun.net.www.http.HttpClient.parseHTTP(HttpClient.java:633) at sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1162) at java.net.HttpURLConnection.getResponseCode(HttpURLConnection.java:397) at sun.net.www.protocol.https.HttpsURLConnectionImpl.getResponseCode(HttpsURLConnectionImpl.java:338) at com.sun.xml.internal.ws.transport.http.client.HttpClientTransport.readResponseCodeAndMessage(HttpClientTransport.java:198) ... 27 more 2013-09-18 13:37:53,489 INFO [cloud.vm.VirtualMachineManagerImpl] (Job-Executor-43:job-65 = [ 8e1a2c00-2309-4c98-b792-d10f6ad21972 ]) Migration was unsuccessful. Cleaning up: VM[User|dvsi1] 2013-09-18 13:37:53,490 WARN [apache.cloudstack.alerts] (Job-Executor-43:job-65 = [ 8e1a2c00-2309-4c98-b792-d10f6ad21972 ]) alertType:: 17 // dataCenterId:: 1 // podId:: 1 // clusterId:: null // message:: Unable to migrate vm i-3-13-VM from host 10.102.192.20 in zone 307Zone1 and pod 307Zone1 2013-09-18 13:37:53,492 DEBUG [cloud.alert.AlertManagerImpl] (Job-Executor-43:job-65 = [ 8e1a2c00-2309-4c98-b792-d10f6ad21972 ]) Have already sent: 1 emails for alert type '17' -- skipping send ema > Live Migration of Virtual instances operation is getting timedout on a > multinode mgmt setup > ------------------------------------------------------------------------------------------- > > Key: CLOUDSTACK-3715 > URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3715 > Project: CloudStack > Issue Type: Bug > Security Level: Public(Anyone can view this level - this is the > default.) > Components: Management Server > Affects Versions: 4.2.0 > Reporter: Sailaja Mada > Assignee: Alena Prokharchyk > Priority: Blocker > Fix For: 4.2.0 > > Attachments: 195113management-server.log.gz, > 195117management-server.log.gz, apilog.log, cloud-backup.dmp.gz, > cloud-backup.sql.gz, management-server.log > > > Setup: Multinode Management setup. > Steps: > 1. Configure Adv Zone with 2 VMWARE clusters each with one hosts with Zone > wide primary storage ( Standard vSwitch cluster) > 2. Deploy VM using User account > 3. Tried to Live migrate VM from cluster1 (host 1) to Cluster 2 (host2 ) > Observation: > 1. Migration took very log time and finally failed saying operation timed out > : > 2013-07-22 17:46:06,288 DEBUG [cloud.capacity.CapacityManagerImpl] > (Job-Executor-40:job-133 = [ 4a2f6d84-236d-4bdd-a2e5-72fa336e0274 ]) VM state > transitted from :Migrating to Running with event: OperationFailedvm's > original host id: 4 new host id: 4 host id before state transition: 1 > 2013-07-22 17:46:06,292 INFO [vmware.resource.VmwareResource] > (DirectAgent-421:10.102.192.18) VM i-4-9-VM is no longer in vSphere > 2013-07-22 17:46:06,293 DEBUG [agent.manager.DirectAgentAttache] > (DirectAgent-421:null) Seq 1-1311245319: Response Received: > 2013-07-22 17:46:06,294 DEBUG [agent.transport.Request] > (DirectAgent-421:null) Seq 1-1311245319: Processing: { Ans: , MgmtId: > 94838926819810, via: 1, Ver: v1, Flags: 10, > [{"com.cloud.agent.api.StopAnswer":{"vncPort":0,"result":true,"details":"VM > i-4-9-VM is no longer in vSphere","wait":0}}] } > 2013-07-22 17:46:06,294 DEBUG [agent.manager.AgentAttache] > (DirectAgent-421:null) Seq 1-1311245319: Unable to find listener. > 2013-07-22 17:46:06,307 DEBUG [cloud.capacity.CapacityManagerImpl] > (Job-Executor-40:job-133 = [ 4a2f6d84-236d-4bdd-a2e5-72fa336e0274 ]) Hosts's > actual total CPU: 9572 and CPU after applying overprovisioning: 9572 > 2013-07-22 17:46:06,307 DEBUG [cloud.capacity.CapacityManagerImpl] > (Job-Executor-40:job-133 = [ 4a2f6d84-236d-4bdd-a2e5-72fa336e0274 ]) Hosts's > actual total RAM: 17166258176 and RAM after applying overprovisioning: > 17166258176 > 2013-07-22 17:46:06,307 DEBUG [cloud.capacity.CapacityManagerImpl] > (Job-Executor-40:job-133 = [ 4a2f6d84-236d-4bdd-a2e5-72fa336e0274 ]) release > cpu from host: 1, old used: 2000,reserved: 0, actual total: 9572, total with > overprovisioning: 9572; new used: 200,reserved:0; movedfromreserved: > false,moveToReserveredfalse > 2013-07-22 17:46:06,307 DEBUG [cloud.capacity.CapacityManagerImpl] > (Job-Executor-40:job-133 = [ 4a2f6d84-236d-4bdd-a2e5-72fa336e0274 ]) release > mem from host: 1, old used: 2013265920,reserved: 0, total: 17166258176; new > used: 2013265920,reserved:0; movedfromreserved: false,moveToReserveredfalse > 2013-07-22 17:46:06,345 ERROR [cloud.async.AsyncJobManagerImpl] > (Job-Executor-40:job-133 = [ 4a2f6d84-236d-4bdd-a2e5-72fa336e0274 ]) > Unexpected exception while executing > org.apache.cloudstack.api.command.admin.vm.MigrateVirtualMachineWithVolumeCmd > com.cloud.utils.exception.CloudRuntimeException: Failed to migrated vm > VM[User|newuser1i1] along with its volumes. > com.cloud.exception.AgentUnavailableException: Resource [Host:1] is > unreachable: Host 1: Operation timed out on storage motion for > VM[User|newuser1i1] > at > com.cloud.storage.VolumeManagerImpl.migrateVolumes(VolumeManagerImpl.java:2263) > at > com.cloud.vm.VirtualMachineManagerImpl.migrateWithStorage(VirtualMachineManagerImpl.java:1780) > at > com.cloud.vm.UserVmManagerImpl.migrateVirtualMachineWithVolume(UserVmManagerImpl.java:4046) > at > com.cloud.utils.component.ComponentInstantiationPostProcessor$InterceptorDispatcher.intercept(ComponentInstantiationPostProcessor.java:125) > at > org.apache.cloudstack.api.command.admin.vm.MigrateVirtualMachineWithVolumeCmd.execute(MigrateVirtualMachineWithVolumeCmd.java:137) > at com.cloud.api.ApiDispatcher.dispatch(ApiDispatcher.java:158) > at > com.cloud.async.AsyncJobManagerImpl$1.run(AsyncJobManagerImpl.java:531) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) > at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334) > at java.util.concurrent.FutureTask.run(FutureTask.java:166) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603) > at java.lang.Thread.run(Thread.java:679) > 2013-07-22 17:46:06,350 DEBUG [cloud.async.AsyncJobManagerImpl] > (Job-Executor-40:job-133 = [ 4a2f6d84-236d-4bdd-a2e5-72fa336e0274 ]) Complete > async job-133 = [ 4a2f6d84-236d-4bdd-a2e5-72fa336e0274 ], jobStatus: 2, > resultCode: 530, result: Error Code: 530 Error text: Failed to migrated vm > VM[User|newuser1i1] along with its volumes. > com.cloud.exception.AgentUnavailableException: Resource [Host:1] is > unreachable: Host 1: Operation timed out on storage motion for > VM[User|newuser1i1] -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira