Re: [Users] Cannot stop or restart container

2010-04-02 Thread Gregor at HostGIS

Hey there. Thanks for the tip about vzps, Thorsten.

We are still in that situation I described a few days ago, with a 
container which won't stop (operation timed out). vzps shows this.


# vzps -E 84 ax
 VEID   PID TTY  STAT   TIME COMMAND
   84 32226 ?Z  0:00 [init ]

Does this offer any clue to what went wrong, how to prevent it, and how 
possibly to undo it without rebooting the HN? I understand what 
zombies/defunct processes are, and that they tend not to go away; but if 
their hanging around can cause a VE to cease functioning, that's not so 
great. Any ideas on how we can fix this?


--
HostGIS, Open Source solutions for the global GIS community
Greg Allensworth - SysAdmin, Programmer, GIS Person, Security
   Network+   Server+   A+   Security+   Linux+
   PHP   PostgreSQL   MySQL   DHTML/JavaScript/AJAX

"No one cares if you can back up — only if you can recover."

___
Users mailing list
Users@openvz.org
https://openvz.org/mailman/listinfo/users


Re: [Users] Cannot stop or restart container

2010-03-30 Thread Gregor at HostGIS
I found my bug report from October 2009 about not being able to stop 
containers.

http://bugzilla.openvz.org/show_bug.cgi?id=1345

Back then we couldn't replicate it, and we chalked it up to "must have 
already been fixed" or to issues with the VE configs, but here it is today.


Any ideas as to how I can figure this out and get this VE restarted, 
preferably without pulling the plug on the HN?


--
HostGIS, Open Source solutions for the global GIS community
Greg Allensworth - SysAdmin, Programmer, GIS Person, Security
   Network+   Server+   A+   Security+   Linux+
   PHP   PostgreSQL   MySQL   DHTML/JavaScript/AJAX

"No one cares if you can back up — only if you can recover."

___
Users mailing list
Users@openvz.org
https://openvz.org/mailman/listinfo/users


Re: [Users] Cannot stop or restart container

2010-03-30 Thread John Drescher
On Tue, Mar 30, 2010 at 4:42 PM, Gregor at HostGIS  wrote:
>> It gets locked up trying to stop nfsmount. I believe
>> the problem is the virtual nic is down before the nfsmount tries to
>> stop and nfsmount waits forever to close the connection to the
>> server..
>
> Interesting. No connections made here, though; no NFS or similar.
>
> Would that apply to open listening sockets as well, e.g. httpd still
> running?
>

I believe that is doubtful.

BTW, if you open a second terminal and vzctl enter you can see what's
going on. That is how I determined the cause of my problem..

John
___
Users mailing list
Users@openvz.org
https://openvz.org/mailman/listinfo/users


Re: [Users] Cannot stop or restart container

2010-03-30 Thread Gregor at HostGIS

It gets locked up trying to stop nfsmount. I believe
the problem is the virtual nic is down before the nfsmount tries to
stop and nfsmount waits forever to close the connection to the
server..


Interesting. No connections made here, though; no NFS or similar.

Would that apply to open listening sockets as well, e.g. httpd still 
running?


--
HostGIS, Open Source solutions for the global GIS community
Greg Allensworth - SysAdmin, Programmer, GIS Person, Security
   Network+   Server+   A+   Security+   Linux+
   PHP   PostgreSQL   MySQL   DHTML/JavaScript/AJAX

"No one cares if you can back up — only if you can recover."

___
Users mailing list
Users@openvz.org
https://openvz.org/mailman/listinfo/users


Re: [Users] Cannot stop or restart container

2010-03-30 Thread Gregor at HostGIS

Thorsten Schifferdecker wrote:

and any entries logged in kern.log/dmesg as well ?


Nope. Just the same message "CT: 85: failed to start with err=-12"

--
HostGIS, Open Source solutions for the global GIS community
Greg Allensworth - SysAdmin, Programmer, GIS Person, Security
   Network+   Server+   A+   Security+   Linux+
   PHP   PostgreSQL   MySQL   DHTML/JavaScript/AJAX

"No one cares if you can back up — only if you can recover."

___
Users mailing list
Users@openvz.org
https://openvz.org/mailman/listinfo/users


Re: [Users] Cannot stop or restart container

2010-03-30 Thread Gregor at HostGIS

What template are you using and where did you get it from?


Slackware 11  I created it myself, based on a working system.

This is the same template which we had used previously those months ago, 
when we weren't able to replicate the bug.




Can you vzctl enter it?


Nope. "enter into CT 84 failed"

--
HostGIS, Open Source solutions for the global GIS community
Greg Allensworth - SysAdmin, Programmer, GIS Person, Security
   Network+   Server+   A+   Security+   Linux+
   PHP   PostgreSQL   MySQL   DHTML/JavaScript/AJAX

"No one cares if you can back up — only if you can recover."

___
Users mailing list
Users@openvz.org
https://openvz.org/mailman/listinfo/users


Re: [Users] Cannot stop or restart container

2010-03-30 Thread Thorsten Schifferdecker
Hi,

and any entries logged in kern.log/dmesg as well ?

Bye,
Thorsten

Steven Crothers schrieb:
> What template are you using and where did you get it from?
> 
> I recently had a similar result from an Archlinux template, however I would
> need to know more about the container to be able to relate it to my case.
> 
> Can you vzctl enter it?
> 
> -Original Message-
> From: users-boun...@openvz.org [mailto:users-boun...@openvz.org] On Behalf
> Of Gregor at HostGIS
> Sent: Tuesday, March 30, 2010 3:53 PM
> To: OpenVZ Maling List
> Subject: [Users] Cannot stop or restart container
> 
> Hey guys. I reported a bug some months ago, not being able to stop and 
> restart a container, getting a timeout instead. When I reported the bug 
> months ago, I was unable to replicate it at the time. But here it is 
> once again: I cannot stop or restart the VE.
> 
> This clip shows the results which I see:
> 
> # vzctl restart Customer
> Restarting container
> Stopping container ...
> Unable to stop container: operation timed out
> 
> # vzlist Customer
> CTID  NPROC STATUS  IP_ADDR HOSTNAME
>84  3 running -   Customer
> 
> The 3 processes, I don't know what they are but they may be related to 
> this, and may be a clue: After the initial vzctl restart, it got down to 
> 1 process before it timed out. I ran "vzctl exec Customer ps ax" and now 
> I notice that the NPROC increments each time I do this. However from the 
> HN, "ps ax | grep ps" only shows the one 'ps' which I am running, not 
> the ones in the VE.
> 
> Any thoughts on how I can finally stop this thing and restart it? Note 
> that this is a production HN with production VEs, so rebooting it is not 
> an attractive option.
> 
> HN OS is Fedora 9
> Kernel is 2.6.24.ovz009.1
> vzctl-3.0.23-1.x86_64
> vzctl-lib-3.0.23-1.x86_64

___
Users mailing list
Users@openvz.org
https://openvz.org/mailman/listinfo/users


Re: [Users] Cannot stop or restart container

2010-03-30 Thread John Drescher
> I recently had a similar result from an Archlinux template, however I would
> need to know more about the container to be able to relate it to my case.
>
> Can you vzctl enter it?
>

I have a restart problem on a self made gentoo container however I
know the cause. It gets locked up trying to stop nfsmount. I believe
the problem is the virtual nic is down before the nfsmount tries to
stop and nfsmount waits forever to close the connection to the
server..

John
___
Users mailing list
Users@openvz.org
https://openvz.org/mailman/listinfo/users


RE: [Users] Cannot stop or restart container

2010-03-30 Thread Steven Crothers
What template are you using and where did you get it from?

I recently had a similar result from an Archlinux template, however I would
need to know more about the container to be able to relate it to my case.

Can you vzctl enter it?

-Original Message-
From: users-boun...@openvz.org [mailto:users-boun...@openvz.org] On Behalf
Of Gregor at HostGIS
Sent: Tuesday, March 30, 2010 3:53 PM
To: OpenVZ Maling List
Subject: [Users] Cannot stop or restart container

Hey guys. I reported a bug some months ago, not being able to stop and 
restart a container, getting a timeout instead. When I reported the bug 
months ago, I was unable to replicate it at the time. But here it is 
once again: I cannot stop or restart the VE.

This clip shows the results which I see:

# vzctl restart Customer
Restarting container
Stopping container ...
Unable to stop container: operation timed out

# vzlist Customer
CTID  NPROC STATUS  IP_ADDR HOSTNAME
   84  3 running -   Customer

The 3 processes, I don't know what they are but they may be related to 
this, and may be a clue: After the initial vzctl restart, it got down to 
1 process before it timed out. I ran "vzctl exec Customer ps ax" and now 
I notice that the NPROC increments each time I do this. However from the 
HN, "ps ax | grep ps" only shows the one 'ps' which I am running, not 
the ones in the VE.

Any thoughts on how I can finally stop this thing and restart it? Note 
that this is a production HN with production VEs, so rebooting it is not 
an attractive option.

HN OS is Fedora 9
Kernel is 2.6.24.ovz009.1
vzctl-3.0.23-1.x86_64
vzctl-lib-3.0.23-1.x86_64

-- 
HostGIS, Open Source solutions for the global GIS community
Greg Allensworth - SysAdmin, Programmer, GIS Person, Security
Network+   Server+   A+   Security+   Linux+
PHP   PostgreSQL   MySQL   DHTML/JavaScript/AJAX

"No one cares if you can back up - only if you can recover."

___
Users mailing list
Users@openvz.org
https://openvz.org/mailman/listinfo/users

___
Users mailing list
Users@openvz.org
https://openvz.org/mailman/listinfo/users


[Users] Cannot stop or restart container

2010-03-30 Thread Gregor at HostGIS
Hey guys. I reported a bug some months ago, not being able to stop and 
restart a container, getting a timeout instead. When I reported the bug 
months ago, I was unable to replicate it at the time. But here it is 
once again: I cannot stop or restart the VE.


This clip shows the results which I see:

# vzctl restart Customer
Restarting container
Stopping container ...
Unable to stop container: operation timed out

# vzlist Customer
CTID  NPROC STATUS  IP_ADDR HOSTNAME
  84  3 running -   Customer

The 3 processes, I don't know what they are but they may be related to 
this, and may be a clue: After the initial vzctl restart, it got down to 
1 process before it timed out. I ran "vzctl exec Customer ps ax" and now 
I notice that the NPROC increments each time I do this. However from the 
HN, "ps ax | grep ps" only shows the one 'ps' which I am running, not 
the ones in the VE.


Any thoughts on how I can finally stop this thing and restart it? Note 
that this is a production HN with production VEs, so rebooting it is not 
an attractive option.


HN OS is Fedora 9
Kernel is 2.6.24.ovz009.1
vzctl-3.0.23-1.x86_64
vzctl-lib-3.0.23-1.x86_64

--
HostGIS, Open Source solutions for the global GIS community
Greg Allensworth - SysAdmin, Programmer, GIS Person, Security
   Network+   Server+   A+   Security+   Linux+
   PHP   PostgreSQL   MySQL   DHTML/JavaScript/AJAX

"No one cares if you can back up — only if you can recover."

___
Users mailing list
Users@openvz.org
https://openvz.org/mailman/listinfo/users