Re: what causes a userland to stop, but allows kernel to continue?

2002-05-06 Thread Terry Lambert
Patrick Thomas wrote: Are NMBCLUSTERS and mbuf determined by 'maxusers' ? I have maxusers=512 ... comments ? When you suggest 'clamp the total number of sockets that are permittedto be open' ... how is this done - is there a sysctl that corresponds to total number of sockets that are

Re: what causes a userland to stop, but allows kernel to continue?

2002-05-06 Thread Patrick Thomas
No denied requests. It's not mbufs. It must be something else. How do you feel about this: # vmstat -z ITEMSIZE LIMITUSEDFREE REQUESTS PIPE:160,0,702,522, 236316 SWAPMETA:160, 509724,452,136, 1125 unpcb:

Re: what causes a userland to stop, but allows kernel to continue?

2002-05-06 Thread Terry Lambert
Patrick Thomas wrote: No denied requests. It's not mbufs. It must be something else. How do you feel about this: [ ... ] You have 24M in vnodes, which is surprising for a machine whose job is supposedly postgres. You have another 17M in PV ENTRY values, which is for page mapping. You

what causes a userland to stop, but allows kernel to continue ?

2002-05-05 Thread Patrick Thomas
So, based on a previous thread, it looks like I have a server whose userland halted, essentially, but the kernel continued running. As evidenced by: - you can still ping the server just fine - you can still connect to running services just fine - if you ssh to it, `ssh -v` (verbose) claims a

Re: what causes a userland to stop, but allows kernel to continue?

2002-05-05 Thread Mike Silbersack
On Sun, 5 May 2002, Patrick Thomas wrote: So, based on a previous thread, it looks like I have a server whose userland halted, essentially, but the kernel continued running. My guess would be that userland apps are not necessarily stopped, but perhaps a few processes are somehow locking out

Re: what causes a userland to stop, but allows kernel to continue ?

2002-05-05 Thread Leo Bicknell
In a message written on Sun, May 05, 2002 at 04:31:36PM -0700, Patrick Thomas wrote: So, based on a previous thread, it looks like I have a server whose userland halted, essentially, but the kernel continued running. I've seen something similar to this before on an Ultrix box. What happened

Re: what causes a userland to stop, but allows kernel to continue ?

2002-05-05 Thread Anthony Schneider
FWIW, I've very recently had something similar happen to a 4.5-STABLE box. The machine was NOT SMP, and the cause, as far as we know, was that /var had been filled by apache's error_log -- a funky new mod_throttle install with lots of critical_acquire() failed: Permission denied

Re: what causes a userland to stop, but allows kernel to continue ?

2002-05-05 Thread Terry Lambert
Anthony Schneider wrote: Livelock, maybe? Is there some sort of internal kernel semaphore table which might be getting filled up or something? I'd also like to find out more about this, but sadly, the machine is a remote one and I can't drop into ddb as suggested... Thanks you all very

Re: what causes a userland to stop, but allows kernel to continue?

2002-05-05 Thread Ian
So, based on a previous thread, it looks like I have a server whose userland halted, essentially, but the kernel continued running. My guess would be that userland apps are not necessarily stopped, but perhaps a few processes are somehow locking out all others. I think the best way to

Re: what causes a userland to stop, but allows kernel to continue?

2002-05-05 Thread Mike Silbersack
On Sun, 5 May 2002, Ian wrote: break into how? He said the local console keyboard was totally unresponsive and you can't ssh in. If hitting the capslock key doesn't cause the corresponding LED to toggle, doesn't that imply that the keyboard interupts aren't getting handled? It's times

Re: what causes a userland to stop, but allows kernel to continue?

2002-05-05 Thread Patrick Thomas
Are NMBCLUSTERS and mbuf determined by 'maxusers' ? I have maxusers=512 ... comments ? When you suggest 'clamp the total number of sockets that are permittedto be open' ... how is this done - is there a sysctl that corresponds to total number of sockets that are permitted to be open ? I am