Re: [Users] Cannot stop or restart container
Hey there. Thanks for the tip about vzps, Thorsten. We are still in that situation I described a few days ago, with a container which won't stop (operation timed out). vzps shows this. # vzps -E 84 ax VEID PID TTY STAT TIME COMMAND 84 32226 ?Z 0:00 [init ] Does this offer any clue to what went wrong, how to prevent it, and how possibly to undo it without rebooting the HN? I understand what zombies/defunct processes are, and that they tend not to go away; but if their hanging around can cause a VE to cease functioning, that's not so great. Any ideas on how we can fix this? -- HostGIS, Open Source solutions for the global GIS community Greg Allensworth - SysAdmin, Programmer, GIS Person, Security Network+ Server+ A+ Security+ Linux+ PHP PostgreSQL MySQL DHTML/JavaScript/AJAX "No one cares if you can back up — only if you can recover." ___ Users mailing list Users@openvz.org https://openvz.org/mailman/listinfo/users
Re: [Users] Cannot stop or restart container
I found my bug report from October 2009 about not being able to stop containers. http://bugzilla.openvz.org/show_bug.cgi?id=1345 Back then we couldn't replicate it, and we chalked it up to "must have already been fixed" or to issues with the VE configs, but here it is today. Any ideas as to how I can figure this out and get this VE restarted, preferably without pulling the plug on the HN? -- HostGIS, Open Source solutions for the global GIS community Greg Allensworth - SysAdmin, Programmer, GIS Person, Security Network+ Server+ A+ Security+ Linux+ PHP PostgreSQL MySQL DHTML/JavaScript/AJAX "No one cares if you can back up — only if you can recover." ___ Users mailing list Users@openvz.org https://openvz.org/mailman/listinfo/users
Re: [Users] Cannot stop or restart container
On Tue, Mar 30, 2010 at 4:42 PM, Gregor at HostGIS wrote: >> It gets locked up trying to stop nfsmount. I believe >> the problem is the virtual nic is down before the nfsmount tries to >> stop and nfsmount waits forever to close the connection to the >> server.. > > Interesting. No connections made here, though; no NFS or similar. > > Would that apply to open listening sockets as well, e.g. httpd still > running? > I believe that is doubtful. BTW, if you open a second terminal and vzctl enter you can see what's going on. That is how I determined the cause of my problem.. John ___ Users mailing list Users@openvz.org https://openvz.org/mailman/listinfo/users
Re: [Users] Cannot stop or restart container
It gets locked up trying to stop nfsmount. I believe the problem is the virtual nic is down before the nfsmount tries to stop and nfsmount waits forever to close the connection to the server.. Interesting. No connections made here, though; no NFS or similar. Would that apply to open listening sockets as well, e.g. httpd still running? -- HostGIS, Open Source solutions for the global GIS community Greg Allensworth - SysAdmin, Programmer, GIS Person, Security Network+ Server+ A+ Security+ Linux+ PHP PostgreSQL MySQL DHTML/JavaScript/AJAX "No one cares if you can back up — only if you can recover." ___ Users mailing list Users@openvz.org https://openvz.org/mailman/listinfo/users
Re: [Users] Cannot stop or restart container
Thorsten Schifferdecker wrote: and any entries logged in kern.log/dmesg as well ? Nope. Just the same message "CT: 85: failed to start with err=-12" -- HostGIS, Open Source solutions for the global GIS community Greg Allensworth - SysAdmin, Programmer, GIS Person, Security Network+ Server+ A+ Security+ Linux+ PHP PostgreSQL MySQL DHTML/JavaScript/AJAX "No one cares if you can back up — only if you can recover." ___ Users mailing list Users@openvz.org https://openvz.org/mailman/listinfo/users
Re: [Users] Cannot stop or restart container
What template are you using and where did you get it from? Slackware 11 I created it myself, based on a working system. This is the same template which we had used previously those months ago, when we weren't able to replicate the bug. Can you vzctl enter it? Nope. "enter into CT 84 failed" -- HostGIS, Open Source solutions for the global GIS community Greg Allensworth - SysAdmin, Programmer, GIS Person, Security Network+ Server+ A+ Security+ Linux+ PHP PostgreSQL MySQL DHTML/JavaScript/AJAX "No one cares if you can back up — only if you can recover." ___ Users mailing list Users@openvz.org https://openvz.org/mailman/listinfo/users
Re: [Users] Cannot stop or restart container
Hi, and any entries logged in kern.log/dmesg as well ? Bye, Thorsten Steven Crothers schrieb: > What template are you using and where did you get it from? > > I recently had a similar result from an Archlinux template, however I would > need to know more about the container to be able to relate it to my case. > > Can you vzctl enter it? > > -Original Message- > From: users-boun...@openvz.org [mailto:users-boun...@openvz.org] On Behalf > Of Gregor at HostGIS > Sent: Tuesday, March 30, 2010 3:53 PM > To: OpenVZ Maling List > Subject: [Users] Cannot stop or restart container > > Hey guys. I reported a bug some months ago, not being able to stop and > restart a container, getting a timeout instead. When I reported the bug > months ago, I was unable to replicate it at the time. But here it is > once again: I cannot stop or restart the VE. > > This clip shows the results which I see: > > # vzctl restart Customer > Restarting container > Stopping container ... > Unable to stop container: operation timed out > > # vzlist Customer > CTID NPROC STATUS IP_ADDR HOSTNAME >84 3 running - Customer > > The 3 processes, I don't know what they are but they may be related to > this, and may be a clue: After the initial vzctl restart, it got down to > 1 process before it timed out. I ran "vzctl exec Customer ps ax" and now > I notice that the NPROC increments each time I do this. However from the > HN, "ps ax | grep ps" only shows the one 'ps' which I am running, not > the ones in the VE. > > Any thoughts on how I can finally stop this thing and restart it? Note > that this is a production HN with production VEs, so rebooting it is not > an attractive option. > > HN OS is Fedora 9 > Kernel is 2.6.24.ovz009.1 > vzctl-3.0.23-1.x86_64 > vzctl-lib-3.0.23-1.x86_64 ___ Users mailing list Users@openvz.org https://openvz.org/mailman/listinfo/users
Re: [Users] Cannot stop or restart container
> I recently had a similar result from an Archlinux template, however I would > need to know more about the container to be able to relate it to my case. > > Can you vzctl enter it? > I have a restart problem on a self made gentoo container however I know the cause. It gets locked up trying to stop nfsmount. I believe the problem is the virtual nic is down before the nfsmount tries to stop and nfsmount waits forever to close the connection to the server.. John ___ Users mailing list Users@openvz.org https://openvz.org/mailman/listinfo/users
RE: [Users] Cannot stop or restart container
What template are you using and where did you get it from? I recently had a similar result from an Archlinux template, however I would need to know more about the container to be able to relate it to my case. Can you vzctl enter it? -Original Message- From: users-boun...@openvz.org [mailto:users-boun...@openvz.org] On Behalf Of Gregor at HostGIS Sent: Tuesday, March 30, 2010 3:53 PM To: OpenVZ Maling List Subject: [Users] Cannot stop or restart container Hey guys. I reported a bug some months ago, not being able to stop and restart a container, getting a timeout instead. When I reported the bug months ago, I was unable to replicate it at the time. But here it is once again: I cannot stop or restart the VE. This clip shows the results which I see: # vzctl restart Customer Restarting container Stopping container ... Unable to stop container: operation timed out # vzlist Customer CTID NPROC STATUS IP_ADDR HOSTNAME 84 3 running - Customer The 3 processes, I don't know what they are but they may be related to this, and may be a clue: After the initial vzctl restart, it got down to 1 process before it timed out. I ran "vzctl exec Customer ps ax" and now I notice that the NPROC increments each time I do this. However from the HN, "ps ax | grep ps" only shows the one 'ps' which I am running, not the ones in the VE. Any thoughts on how I can finally stop this thing and restart it? Note that this is a production HN with production VEs, so rebooting it is not an attractive option. HN OS is Fedora 9 Kernel is 2.6.24.ovz009.1 vzctl-3.0.23-1.x86_64 vzctl-lib-3.0.23-1.x86_64 -- HostGIS, Open Source solutions for the global GIS community Greg Allensworth - SysAdmin, Programmer, GIS Person, Security Network+ Server+ A+ Security+ Linux+ PHP PostgreSQL MySQL DHTML/JavaScript/AJAX "No one cares if you can back up - only if you can recover." ___ Users mailing list Users@openvz.org https://openvz.org/mailman/listinfo/users ___ Users mailing list Users@openvz.org https://openvz.org/mailman/listinfo/users
[Users] Cannot stop or restart container
Hey guys. I reported a bug some months ago, not being able to stop and restart a container, getting a timeout instead. When I reported the bug months ago, I was unable to replicate it at the time. But here it is once again: I cannot stop or restart the VE. This clip shows the results which I see: # vzctl restart Customer Restarting container Stopping container ... Unable to stop container: operation timed out # vzlist Customer CTID NPROC STATUS IP_ADDR HOSTNAME 84 3 running - Customer The 3 processes, I don't know what they are but they may be related to this, and may be a clue: After the initial vzctl restart, it got down to 1 process before it timed out. I ran "vzctl exec Customer ps ax" and now I notice that the NPROC increments each time I do this. However from the HN, "ps ax | grep ps" only shows the one 'ps' which I am running, not the ones in the VE. Any thoughts on how I can finally stop this thing and restart it? Note that this is a production HN with production VEs, so rebooting it is not an attractive option. HN OS is Fedora 9 Kernel is 2.6.24.ovz009.1 vzctl-3.0.23-1.x86_64 vzctl-lib-3.0.23-1.x86_64 -- HostGIS, Open Source solutions for the global GIS community Greg Allensworth - SysAdmin, Programmer, GIS Person, Security Network+ Server+ A+ Security+ Linux+ PHP PostgreSQL MySQL DHTML/JavaScript/AJAX "No one cares if you can back up — only if you can recover." ___ Users mailing list Users@openvz.org https://openvz.org/mailman/listinfo/users